I have been building and maintaining a data lake in AWS for the past year or so and it has been a learning experience to say the least. Recently I had an issue where a AWS Glue crawler stopped updating a table in the catalog that represented raw syslog data that was being imported in.
The error being shown was:
INFO : Multiple tables are found under location [S3 bucket and path]. Table [table name] is skipped.