Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4106 |
Symbol | |
ID | 5589966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 4091324 |
End bp | 4096174 |
Gene Length | 4851 bp |
Protein Length | 1616 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640927725 |
Product | putative haemagglutinins/invasins protein |
Protein accession | YP_001465085 |
Protein GI | 157157415 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport [W] Extracellular structures |
COG ID | [COG5295] Autotransporter adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAA TATTTAAAGT TATCTGGAAC CCTGCGACAG GGAATTATAC TGTTACCAGC GAAACGGCAA AAAGCCGTGG CAAGAAATCT GGGCGCAGTA AGCTGTTAAT TTCTGCGCTG GTTGCGGGTG GAATGTTGTC GTCGTTTGGG GCATTGGCGA ATGCCGGGAA TGACAACGGT CAGGGTGTTG ATTACGGTAG TGGATCAGCT GGCGACGGCT GGGTTGCTAT AGGCAAAGGG GCGAAAGCAA ATACTTTTAT GAACACCAGT GGTTCCAGTA CTGCTGTGGG TTATGACGCT ATAGCTGAAG GCCAATATAG CTCTGCCATC GGGTCAAAAA CCCATGCGAT TGGCGGTGCA TCAATGGCCT TTGGGGTTAG TGCAATATCA GAAGGCGATA GAAGTATAGC ACTGGGTGCC TCTTCGTATT CATTGGGCCA ATACTCAATG GCCCTCGGCC GTTATTCAAA AGCATTGGGT AAATTGTCTA TTGCTATGGG GGACTCTTCC AAAGCGGAAG GAGCAAACGC CATTGCCCTG GGAAATGCCA CTAAAGCTAC TGAGATTATG AGTATTGCTC TTGGCGACAC CGCCAATGCG TCAAAAGCGT ATTCAATGGC GCTGGGAGCA AGTAGCGTCG CATCTGAAGA AAACGCAATT GCCCTGGGGC GTAGCAGTGT AGCTAGCGGT ACTGACAGCC TCGCATTTGG CAGACAATCA CTTGCCAGCG CAGCGAACGC TATTGCGATA GGTGCTGAGA CCGAAGCCGC TGAAAATGCA ACTGCTATTG GCAATAATGC GAAGGCAAAA GGGACTAATA GCATGGCAAT GGGGTTCGGA AGCCTTGCCG ATAAAGTCAA TACTATCGCA TTAGGAAATG GCAGCCAGGC TCTGGCAGAT AATGCAATCG CCATAGGCCA GGGCAACAAA GCTGATGGCG TGGATGCCAT CGCTCTGGGT AATGGTAGCC AGTCGAGAGG CTTAAACACC ATTGCCTTAG GCACAGCCAG TAATGCAACT GGTGATAAGA GTCTTGCGCT TGGTAGTAAT AGCAGTGCCA ACGGTATTAA CTCTGTCGCG CTGGGCGCAG ATTCCATTGC GGATTTAGAC AATACCGTCT CTGTCGGCAA TAGTTCATTA AAACGCAAGA TCGTTAATGT GAAAAATGGC GCGATCAAGT CTGACAGTTA CGATGCCATT AATGGTTCAC AGCTTTATGC CATTAGCGAC TCGGTAGCAA AAAGGCTTGG AGGAGGGGCT GCAGTAGATG TTGATGACGG TACTGTTACA GCACCAACCT ACAATTTAAA AAATGGTAGC AAAAATAACG TAGGGGCTGC GCTCGCTGTA CTTGATGAAA ACACCCTGCA ATGGGACCAA ACCAAAGGCA AATACAGCGC TGCTCATGGT ACTAGTAGCC CAACTGCCAG CGTAATCACC GATGTTGCGG ATGGCACGAT TTCAGCCTCC AGTAAGGATG CGGTTAACGG TTCCCAACTG AAAGCTACCA ATGACGATGT CGAAGCCAAC ACCGCCAATA TCGCTACTAA TACCAGCAAC ATTGCCACGA ATACGGCAAA TATTGCCACC AATACCACCA ATATCACCAA CCTGACGGAT TCCGTTGGTG ACCTTCAGGC TGATGCCCTG CTCTGGAACG AAACTAAAAA GGCATTCAGT GCAGCTCACG GCCAGGATAC CACCAGCAAA ATCACCAACG TTAAAGATGC CGACCTGACG GCTGACAGCA CTGATGCTGT TAACGGCTCT CAGCTGAAAA CCACCAACGA TGCTGTGGCG ACGAATACCA CCAATATCGC CAATAACACT TCCAATATTG CCACTAACAC CACCAACATC TCTAACCTGA CTGAGACGGT GACTAATCTT GGTGAGGATG CGCTGAAATG GGATAAGGAC AATGGTGTAT TCACGGCAGC TCATGGCACC GAGACCACCA GCAAAATCAC CAACGTTAAA GATGGCGACC TGACGACTGG CAGCACCGAT GCCGTTAACG GCTCTCAGCT GAAAACCACC AACGATGCCG TGGCGACGAA TACCACCAAT ATCGCCACTA ACACCACCAA CATCTCTAAT CTGACTGAGA CGGTGACTAA TCTTGGTGAG GATGCGCTGA AATGGGATAA GGACAATGGT GTCTTCACTG CAGCTCATGG CAACAATACC GCCAGCAAAA TCACCAATAT CCTGGACGGC ACAGTCACTG CAACCAGTTC CGATGCCATT AACGGTAGCC AGCTTTATGA CTTAAGCAGC AATATCGCCA CCTACTTCGG CGGCAATGCT TCTGTGAATA CTGACGGTGT GTTTACCGGT CCAACCTACA AAATCGGTGA AACAAATTAT TATAACGTCG GCGATGCACT GGCTGCGATT AACTCCTCAT TTAGCACGTC TCTCGGCGAT GCTCTGCTTT GGGATGCCAC CGCAGGTAAA TTCAGTGCCA AACACGGTAC TAATGGTGAC GCAAGCGTGA TCACTGATGT CGCAGATGGT GAAATTTCAG ACTCCAGTTC TGACGCAGTA AACGGCTCAC AACTCCACGG CGTGAGCAGT TATGTTGTTG ATGCGCTGGG GGGTGGTGCC GAAGTCAATG CAGACGGCAC CATCACTGCG CCGACGTACA CCATTGCTAA TGCTGATTAC GATAATGTCG GTGATGCCCT GAATGCTATC GATACCACTC TTGACGACGC TCTGCTCTGG GATGCGGACG CCGGTGAAAA TGGTGCATTT AGCGCCGCTC ACGGAAAAGA TAAAACTGCC AGTGTAATCA CTAACGTCGC TAACGGTGCA ATCTCTGCTG CCAGCAGCGA CGCGATTAAC GGCTCACAAC TCTATACCAC CAATAAGTAC ATCGCTGATG CGCTGGGTGG TGACGCAGAA GTCAACGCTG ACGGCACCAT CACCGCACCG ACTTACACCA TTGCGAACGC CGAGTACAAC AACGTCGGTG ACGCCCTGGA TGCGCTTGAT GATAACGCCC TGCTGTGGGA TGAGACTGCC AATGGCGGTG CTGGAGCCTA CAATGCCAGC CATGACGGTA AAGCCAGCAT CATCACTAAT GTCGCTAATG GCAGTATTAG TGAGGACAGT ACCGATGCAG TGAACGGTTC TCAGTTGAAT GCGACGAATA TGATGATTGA GCAGAACACC CAAATTATCA ATCAGCTCGC TGGTAACACC GACGCAACCT ATATCCAAGA AAACGGTGCG GGTATTAACT ATGTGCGTAC TAACGACGAC GGCTTAGCGT TCAACGACGC CAGCGCACAG GGTGTTGGCG CTACAGCTAT AGGTTATAAC TCTGTCGCCA AAGGCGATAG CAGCGTAGCT ATTGGTCAGG GCAGCTACAG CGACGTTGAT ACGGGTATCG CCCTGGGTAG CAGCTCTGTT TCCAGCCGAG TGATTGCCAA AGGCTCCCGT GACACCAGCA TAACGGAAAA TGGCGTTGTT ATTGGTTACG ACACCACGGA TGGCGAACTG CTCGGTGCAT TGTCTATCGG TGATGACGGT AAATATCGTC AAATCATCAA CGTAGCCGAT GGTTCCGAAG CCCATGACGC CGTTACGGTT CGTCAATTGC AGAATGCGAT TGGTGCGGTC GCAACCACGC CGACTAAATA CTTCCACGCT AATTCAACGG AAGAAGATTC ACTGGCAGTG GGAACTGACT CGCTGGCAAT GGGTGCGAAA ACCATCGTGA ATGGCGATAA AGGTATTGGT ATCGGTTATG GTGCCTACGT GGACGCGAAT GCACTTAACG GCATTGCCAT TGGTAGCAAT GCGCAAGTCA TTCATGTCAA CAGTATTGCG ATAGGTAATG GTTCTACGAC CACTCGTGGC GCTCAAACCA ATTATACCGC CTACAACATG GACGCACCGC AGAACTCTGT CGGTGAATTC TCAGTCGGTA GTGCGGATGG TCAACGTCAG ATCACTAACG TCGCAGCAGG TTCGGCTGAT ACCGATGCGG TCAACGTGGG TCAGTTGAAA GTAACGGATG CGCAGGTTTC CCAGAATACC CAGAGCATTA CTAACCTGGA TAATCGGGTA ACGAATCTTG ATTCACGCGT CACCAATATC GAAAACGGTA TTGGCGATAT CGTCACCACC GGTAGCACCA AGTACTTCAA GACCAATACC GATGGTGTAG ATGCCAGCGC GCAGGGTAAA GATAGCGTCG CGATTGGTTC TGGCTCCATT GCTGCCGCGG ATAACAGCGT CGCACTGGGT ACAGGGTCTG TGGCAACCGA AGAAAATACG ATCTCTGTAG GCTCATCTAC TAATCAACGT CGTATCACCA ACGTAGCCGC AGGTAAAAAC GACACCGATG CTGTTAACGT GGCACAGTTG AAGTCTTCCG AAGCTGGCGG TGTGCGTTAC GACACCAAAG CTGATGGTTC TATCGACTAT AGCAATATCA CCCTCGGTGG CGGCAACGGC GGTACGACTC GTATCAGCAA CGTCTCCGCT GGCGTCAACA ACAACGACGC GGTGAATTAC GCGCAGTTGA AGCAAAGCGT GCAGGAAACG AAGCAATACA CCGATCAGCG AATGGTTGAG ATGGATAACA AACTGTCTAA AACTGAAAGC AAGTTGAGCG GTGGTATCGC TTCTGCAATG GCAATGACCG GTCTGCCGCA GGCTTATACA CCGGGTGCCA GCATGGCTTC TATTGGTGGC GGTACTTACA ACGGTGAATC GGCAGTTGCT TTAGGTGTAT CGATGGTGAG CGCCAATGGT CGTTGGGTCT ACAAATTACA AGGTAGTACC AATAGCCAGG GTGAATACTC CGCCGCACTC GGTGCCGGTA TTCAGTGGTA A
|
Protein sequence | MNKIFKVIWN PATGNYTVTS ETAKSRGKKS GRSKLLISAL VAGGMLSSFG ALANAGNDNG QGVDYGSGSA GDGWVAIGKG AKANTFMNTS GSSTAVGYDA IAEGQYSSAI GSKTHAIGGA SMAFGVSAIS EGDRSIALGA SSYSLGQYSM ALGRYSKALG KLSIAMGDSS KAEGANAIAL GNATKATEIM SIALGDTANA SKAYSMALGA SSVASEENAI ALGRSSVASG TDSLAFGRQS LASAANAIAI GAETEAAENA TAIGNNAKAK GTNSMAMGFG SLADKVNTIA LGNGSQALAD NAIAIGQGNK ADGVDAIALG NGSQSRGLNT IALGTASNAT GDKSLALGSN SSANGINSVA LGADSIADLD NTVSVGNSSL KRKIVNVKNG AIKSDSYDAI NGSQLYAISD SVAKRLGGGA AVDVDDGTVT APTYNLKNGS KNNVGAALAV LDENTLQWDQ TKGKYSAAHG TSSPTASVIT DVADGTISAS SKDAVNGSQL KATNDDVEAN TANIATNTSN IATNTANIAT NTTNITNLTD SVGDLQADAL LWNETKKAFS AAHGQDTTSK ITNVKDADLT ADSTDAVNGS QLKTTNDAVA TNTTNIANNT SNIATNTTNI SNLTETVTNL GEDALKWDKD NGVFTAAHGT ETTSKITNVK DGDLTTGSTD AVNGSQLKTT NDAVATNTTN IATNTTNISN LTETVTNLGE DALKWDKDNG VFTAAHGNNT ASKITNILDG TVTATSSDAI NGSQLYDLSS NIATYFGGNA SVNTDGVFTG PTYKIGETNY YNVGDALAAI NSSFSTSLGD ALLWDATAGK FSAKHGTNGD ASVITDVADG EISDSSSDAV NGSQLHGVSS YVVDALGGGA EVNADGTITA PTYTIANADY DNVGDALNAI DTTLDDALLW DADAGENGAF SAAHGKDKTA SVITNVANGA ISAASSDAIN GSQLYTTNKY IADALGGDAE VNADGTITAP TYTIANAEYN NVGDALDALD DNALLWDETA NGGAGAYNAS HDGKASIITN VANGSISEDS TDAVNGSQLN ATNMMIEQNT QIINQLAGNT DATYIQENGA GINYVRTNDD GLAFNDASAQ GVGATAIGYN SVAKGDSSVA IGQGSYSDVD TGIALGSSSV SSRVIAKGSR DTSITENGVV IGYDTTDGEL LGALSIGDDG KYRQIINVAD GSEAHDAVTV RQLQNAIGAV ATTPTKYFHA NSTEEDSLAV GTDSLAMGAK TIVNGDKGIG IGYGAYVDAN ALNGIAIGSN AQVIHVNSIA IGNGSTTTRG AQTNYTAYNM DAPQNSVGEF SVGSADGQRQ ITNVAAGSAD TDAVNVGQLK VTDAQVSQNT QSITNLDNRV TNLDSRVTNI ENGIGDIVTT GSTKYFKTNT DGVDASAQGK DSVAIGSGSI AAADNSVALG TGSVATEENT ISVGSSTNQR RITNVAAGKN DTDAVNVAQL KSSEAGGVRY DTKADGSIDY SNITLGGGNG GTTRISNVSA GVNNNDAVNY AQLKQSVQET KQYTDQRMVE MDNKLSKTES KLSGGIASAM AMTGLPQAYT PGASMASIGG GTYNGESAVA LGVSMVSANG RWVYKLQGST NSQGEYSAAL GAGIQW
|
| |