Gene EcE24377A_4106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4106 
Symbol 
ID5589966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4091324 
End bp4096174 
Gene Length4851 bp 
Protein Length1616 aa 
Translation table11 
GC content51% 
IMG OID640927725 
Productputative haemagglutinins/invasins protein 
Protein accessionYP_001465085 
Protein GI157157415 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA TATTTAAAGT TATCTGGAAC CCTGCGACAG GGAATTATAC TGTTACCAGC 
GAAACGGCAA AAAGCCGTGG CAAGAAATCT GGGCGCAGTA AGCTGTTAAT TTCTGCGCTG
GTTGCGGGTG GAATGTTGTC GTCGTTTGGG GCATTGGCGA ATGCCGGGAA TGACAACGGT
CAGGGTGTTG ATTACGGTAG TGGATCAGCT GGCGACGGCT GGGTTGCTAT AGGCAAAGGG
GCGAAAGCAA ATACTTTTAT GAACACCAGT GGTTCCAGTA CTGCTGTGGG TTATGACGCT
ATAGCTGAAG GCCAATATAG CTCTGCCATC GGGTCAAAAA CCCATGCGAT TGGCGGTGCA
TCAATGGCCT TTGGGGTTAG TGCAATATCA GAAGGCGATA GAAGTATAGC ACTGGGTGCC
TCTTCGTATT CATTGGGCCA ATACTCAATG GCCCTCGGCC GTTATTCAAA AGCATTGGGT
AAATTGTCTA TTGCTATGGG GGACTCTTCC AAAGCGGAAG GAGCAAACGC CATTGCCCTG
GGAAATGCCA CTAAAGCTAC TGAGATTATG AGTATTGCTC TTGGCGACAC CGCCAATGCG
TCAAAAGCGT ATTCAATGGC GCTGGGAGCA AGTAGCGTCG CATCTGAAGA AAACGCAATT
GCCCTGGGGC GTAGCAGTGT AGCTAGCGGT ACTGACAGCC TCGCATTTGG CAGACAATCA
CTTGCCAGCG CAGCGAACGC TATTGCGATA GGTGCTGAGA CCGAAGCCGC TGAAAATGCA
ACTGCTATTG GCAATAATGC GAAGGCAAAA GGGACTAATA GCATGGCAAT GGGGTTCGGA
AGCCTTGCCG ATAAAGTCAA TACTATCGCA TTAGGAAATG GCAGCCAGGC TCTGGCAGAT
AATGCAATCG CCATAGGCCA GGGCAACAAA GCTGATGGCG TGGATGCCAT CGCTCTGGGT
AATGGTAGCC AGTCGAGAGG CTTAAACACC ATTGCCTTAG GCACAGCCAG TAATGCAACT
GGTGATAAGA GTCTTGCGCT TGGTAGTAAT AGCAGTGCCA ACGGTATTAA CTCTGTCGCG
CTGGGCGCAG ATTCCATTGC GGATTTAGAC AATACCGTCT CTGTCGGCAA TAGTTCATTA
AAACGCAAGA TCGTTAATGT GAAAAATGGC GCGATCAAGT CTGACAGTTA CGATGCCATT
AATGGTTCAC AGCTTTATGC CATTAGCGAC TCGGTAGCAA AAAGGCTTGG AGGAGGGGCT
GCAGTAGATG TTGATGACGG TACTGTTACA GCACCAACCT ACAATTTAAA AAATGGTAGC
AAAAATAACG TAGGGGCTGC GCTCGCTGTA CTTGATGAAA ACACCCTGCA ATGGGACCAA
ACCAAAGGCA AATACAGCGC TGCTCATGGT ACTAGTAGCC CAACTGCCAG CGTAATCACC
GATGTTGCGG ATGGCACGAT TTCAGCCTCC AGTAAGGATG CGGTTAACGG TTCCCAACTG
AAAGCTACCA ATGACGATGT CGAAGCCAAC ACCGCCAATA TCGCTACTAA TACCAGCAAC
ATTGCCACGA ATACGGCAAA TATTGCCACC AATACCACCA ATATCACCAA CCTGACGGAT
TCCGTTGGTG ACCTTCAGGC TGATGCCCTG CTCTGGAACG AAACTAAAAA GGCATTCAGT
GCAGCTCACG GCCAGGATAC CACCAGCAAA ATCACCAACG TTAAAGATGC CGACCTGACG
GCTGACAGCA CTGATGCTGT TAACGGCTCT CAGCTGAAAA CCACCAACGA TGCTGTGGCG
ACGAATACCA CCAATATCGC CAATAACACT TCCAATATTG CCACTAACAC CACCAACATC
TCTAACCTGA CTGAGACGGT GACTAATCTT GGTGAGGATG CGCTGAAATG GGATAAGGAC
AATGGTGTAT TCACGGCAGC TCATGGCACC GAGACCACCA GCAAAATCAC CAACGTTAAA
GATGGCGACC TGACGACTGG CAGCACCGAT GCCGTTAACG GCTCTCAGCT GAAAACCACC
AACGATGCCG TGGCGACGAA TACCACCAAT ATCGCCACTA ACACCACCAA CATCTCTAAT
CTGACTGAGA CGGTGACTAA TCTTGGTGAG GATGCGCTGA AATGGGATAA GGACAATGGT
GTCTTCACTG CAGCTCATGG CAACAATACC GCCAGCAAAA TCACCAATAT CCTGGACGGC
ACAGTCACTG CAACCAGTTC CGATGCCATT AACGGTAGCC AGCTTTATGA CTTAAGCAGC
AATATCGCCA CCTACTTCGG CGGCAATGCT TCTGTGAATA CTGACGGTGT GTTTACCGGT
CCAACCTACA AAATCGGTGA AACAAATTAT TATAACGTCG GCGATGCACT GGCTGCGATT
AACTCCTCAT TTAGCACGTC TCTCGGCGAT GCTCTGCTTT GGGATGCCAC CGCAGGTAAA
TTCAGTGCCA AACACGGTAC TAATGGTGAC GCAAGCGTGA TCACTGATGT CGCAGATGGT
GAAATTTCAG ACTCCAGTTC TGACGCAGTA AACGGCTCAC AACTCCACGG CGTGAGCAGT
TATGTTGTTG ATGCGCTGGG GGGTGGTGCC GAAGTCAATG CAGACGGCAC CATCACTGCG
CCGACGTACA CCATTGCTAA TGCTGATTAC GATAATGTCG GTGATGCCCT GAATGCTATC
GATACCACTC TTGACGACGC TCTGCTCTGG GATGCGGACG CCGGTGAAAA TGGTGCATTT
AGCGCCGCTC ACGGAAAAGA TAAAACTGCC AGTGTAATCA CTAACGTCGC TAACGGTGCA
ATCTCTGCTG CCAGCAGCGA CGCGATTAAC GGCTCACAAC TCTATACCAC CAATAAGTAC
ATCGCTGATG CGCTGGGTGG TGACGCAGAA GTCAACGCTG ACGGCACCAT CACCGCACCG
ACTTACACCA TTGCGAACGC CGAGTACAAC AACGTCGGTG ACGCCCTGGA TGCGCTTGAT
GATAACGCCC TGCTGTGGGA TGAGACTGCC AATGGCGGTG CTGGAGCCTA CAATGCCAGC
CATGACGGTA AAGCCAGCAT CATCACTAAT GTCGCTAATG GCAGTATTAG TGAGGACAGT
ACCGATGCAG TGAACGGTTC TCAGTTGAAT GCGACGAATA TGATGATTGA GCAGAACACC
CAAATTATCA ATCAGCTCGC TGGTAACACC GACGCAACCT ATATCCAAGA AAACGGTGCG
GGTATTAACT ATGTGCGTAC TAACGACGAC GGCTTAGCGT TCAACGACGC CAGCGCACAG
GGTGTTGGCG CTACAGCTAT AGGTTATAAC TCTGTCGCCA AAGGCGATAG CAGCGTAGCT
ATTGGTCAGG GCAGCTACAG CGACGTTGAT ACGGGTATCG CCCTGGGTAG CAGCTCTGTT
TCCAGCCGAG TGATTGCCAA AGGCTCCCGT GACACCAGCA TAACGGAAAA TGGCGTTGTT
ATTGGTTACG ACACCACGGA TGGCGAACTG CTCGGTGCAT TGTCTATCGG TGATGACGGT
AAATATCGTC AAATCATCAA CGTAGCCGAT GGTTCCGAAG CCCATGACGC CGTTACGGTT
CGTCAATTGC AGAATGCGAT TGGTGCGGTC GCAACCACGC CGACTAAATA CTTCCACGCT
AATTCAACGG AAGAAGATTC ACTGGCAGTG GGAACTGACT CGCTGGCAAT GGGTGCGAAA
ACCATCGTGA ATGGCGATAA AGGTATTGGT ATCGGTTATG GTGCCTACGT GGACGCGAAT
GCACTTAACG GCATTGCCAT TGGTAGCAAT GCGCAAGTCA TTCATGTCAA CAGTATTGCG
ATAGGTAATG GTTCTACGAC CACTCGTGGC GCTCAAACCA ATTATACCGC CTACAACATG
GACGCACCGC AGAACTCTGT CGGTGAATTC TCAGTCGGTA GTGCGGATGG TCAACGTCAG
ATCACTAACG TCGCAGCAGG TTCGGCTGAT ACCGATGCGG TCAACGTGGG TCAGTTGAAA
GTAACGGATG CGCAGGTTTC CCAGAATACC CAGAGCATTA CTAACCTGGA TAATCGGGTA
ACGAATCTTG ATTCACGCGT CACCAATATC GAAAACGGTA TTGGCGATAT CGTCACCACC
GGTAGCACCA AGTACTTCAA GACCAATACC GATGGTGTAG ATGCCAGCGC GCAGGGTAAA
GATAGCGTCG CGATTGGTTC TGGCTCCATT GCTGCCGCGG ATAACAGCGT CGCACTGGGT
ACAGGGTCTG TGGCAACCGA AGAAAATACG ATCTCTGTAG GCTCATCTAC TAATCAACGT
CGTATCACCA ACGTAGCCGC AGGTAAAAAC GACACCGATG CTGTTAACGT GGCACAGTTG
AAGTCTTCCG AAGCTGGCGG TGTGCGTTAC GACACCAAAG CTGATGGTTC TATCGACTAT
AGCAATATCA CCCTCGGTGG CGGCAACGGC GGTACGACTC GTATCAGCAA CGTCTCCGCT
GGCGTCAACA ACAACGACGC GGTGAATTAC GCGCAGTTGA AGCAAAGCGT GCAGGAAACG
AAGCAATACA CCGATCAGCG AATGGTTGAG ATGGATAACA AACTGTCTAA AACTGAAAGC
AAGTTGAGCG GTGGTATCGC TTCTGCAATG GCAATGACCG GTCTGCCGCA GGCTTATACA
CCGGGTGCCA GCATGGCTTC TATTGGTGGC GGTACTTACA ACGGTGAATC GGCAGTTGCT
TTAGGTGTAT CGATGGTGAG CGCCAATGGT CGTTGGGTCT ACAAATTACA AGGTAGTACC
AATAGCCAGG GTGAATACTC CGCCGCACTC GGTGCCGGTA TTCAGTGGTA A
 
Protein sequence
MNKIFKVIWN PATGNYTVTS ETAKSRGKKS GRSKLLISAL VAGGMLSSFG ALANAGNDNG 
QGVDYGSGSA GDGWVAIGKG AKANTFMNTS GSSTAVGYDA IAEGQYSSAI GSKTHAIGGA
SMAFGVSAIS EGDRSIALGA SSYSLGQYSM ALGRYSKALG KLSIAMGDSS KAEGANAIAL
GNATKATEIM SIALGDTANA SKAYSMALGA SSVASEENAI ALGRSSVASG TDSLAFGRQS
LASAANAIAI GAETEAAENA TAIGNNAKAK GTNSMAMGFG SLADKVNTIA LGNGSQALAD
NAIAIGQGNK ADGVDAIALG NGSQSRGLNT IALGTASNAT GDKSLALGSN SSANGINSVA
LGADSIADLD NTVSVGNSSL KRKIVNVKNG AIKSDSYDAI NGSQLYAISD SVAKRLGGGA
AVDVDDGTVT APTYNLKNGS KNNVGAALAV LDENTLQWDQ TKGKYSAAHG TSSPTASVIT
DVADGTISAS SKDAVNGSQL KATNDDVEAN TANIATNTSN IATNTANIAT NTTNITNLTD
SVGDLQADAL LWNETKKAFS AAHGQDTTSK ITNVKDADLT ADSTDAVNGS QLKTTNDAVA
TNTTNIANNT SNIATNTTNI SNLTETVTNL GEDALKWDKD NGVFTAAHGT ETTSKITNVK
DGDLTTGSTD AVNGSQLKTT NDAVATNTTN IATNTTNISN LTETVTNLGE DALKWDKDNG
VFTAAHGNNT ASKITNILDG TVTATSSDAI NGSQLYDLSS NIATYFGGNA SVNTDGVFTG
PTYKIGETNY YNVGDALAAI NSSFSTSLGD ALLWDATAGK FSAKHGTNGD ASVITDVADG
EISDSSSDAV NGSQLHGVSS YVVDALGGGA EVNADGTITA PTYTIANADY DNVGDALNAI
DTTLDDALLW DADAGENGAF SAAHGKDKTA SVITNVANGA ISAASSDAIN GSQLYTTNKY
IADALGGDAE VNADGTITAP TYTIANAEYN NVGDALDALD DNALLWDETA NGGAGAYNAS
HDGKASIITN VANGSISEDS TDAVNGSQLN ATNMMIEQNT QIINQLAGNT DATYIQENGA
GINYVRTNDD GLAFNDASAQ GVGATAIGYN SVAKGDSSVA IGQGSYSDVD TGIALGSSSV
SSRVIAKGSR DTSITENGVV IGYDTTDGEL LGALSIGDDG KYRQIINVAD GSEAHDAVTV
RQLQNAIGAV ATTPTKYFHA NSTEEDSLAV GTDSLAMGAK TIVNGDKGIG IGYGAYVDAN
ALNGIAIGSN AQVIHVNSIA IGNGSTTTRG AQTNYTAYNM DAPQNSVGEF SVGSADGQRQ
ITNVAAGSAD TDAVNVGQLK VTDAQVSQNT QSITNLDNRV TNLDSRVTNI ENGIGDIVTT
GSTKYFKTNT DGVDASAQGK DSVAIGSGSI AAADNSVALG TGSVATEENT ISVGSSTNQR
RITNVAAGKN DTDAVNVAQL KSSEAGGVRY DTKADGSIDY SNITLGGGNG GTTRISNVSA
GVNNNDAVNY AQLKQSVQET KQYTDQRMVE MDNKLSKTES KLSGGIASAM AMTGLPQAYT
PGASMASIGG GTYNGESAVA LGVSMVSANG RWVYKLQGST NSQGEYSAAL GAGIQW