Gene EcE24377A_4892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4892 
Symbolflu1 
ID5587671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4879454 
End bp4882300 
Gene Length2847 bp 
Protein Length948 aa 
Translation table11 
GC content57% 
IMG OID640928493 
Productantigen 43 
Protein accessionYP_001465820 
Protein GI157158925 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGAC ATCTGAACAC CAGCTACAGG CTGGTATGGA ATCACATTAC GGGCACCCTG 
GTGGTGGCTT CCGAACTGGC GCGCTCACGG GGAAAACGCA CCGGTGTGGC GGTTGCGCTG
TCTCTTGCTG CTGTCACGTC AGTCCCGGTA CTGGCTGCTG ACACGGTCGT ACAGGCGGGA
GAAACCGTGA GCGGCGGAAC ACTGACAAAT CATGACAACC AGATTGTCTT CGGTACGGCC
AACGGAATGA CCATCAGTAC CGGTCTGGAG TATGGGCCGG ATAACGAGGC CAATACCGGC
GGACAATGGA TACAAAATGG CGGTATCGCC AACAACACTA CTGTCACCGG TGGTGGTCTT
CAGAGAGTGA ATGCCGGAGG AAGCGTTTCA GACACGGTTA TCAGTGCCGG AGGCGGACAG
AGCCTTCAGG GGCAGGCAGT GAACACCACT CTGAACGGCG GTGAGCAGTG GGTACATGAA
GGCGGGATTG CAACGGGTAC CGTCATTAAT GAGAAGGGCT GGCAGGCCGT CAAATCCGGC
GCAATGGCAA CCGACACGGT TGTGAATACC GGCGCGGAAG GGGGACCGGA TGCAGAAAAT
GGTGATACCG GGCAGTTTGT TCGCGGAAAT GCCGTACGTA CCACTATCAA TGAAAATGGT
CGTCAGATTG TGGCTGCTGA AGGAACAGCA AATACCACTG TGGTTTATGC CGGCGGCGAC
CAGACGGTAC ACGGGCATGC GCTGGATACC ACACTGAATG GCGGTTACCA GTATGTGCAC
AACGGAGGCA CAGCCTCTGA CACGGTTGTA AACAGTGACG GCTGGCAGAT TGTCAAGGAA
GGTGGTCTGG CGGATTTCAC CACCGTTAAC CAGAAAGGCA AACTGCAGGT GAACGCCGGT
GGTACAGCCA CGAATGTCAC CCTGAAGCAG GGAGGCGCAC TGGTCACCAG TACGGCGGCA
ACCGTCACCG GCAGCAACCG TCTGGGCAAT TTCACTGTGG AAAACGGTAA TGCTGACGGT
GTTGTTCTGG AGTCCGGTGG TCGCCTGGAT GTACTGGAGG GCCATTCAGC CTGGAAAACA
CTGGTGGATG ACGGCGGAAC CCTGGCAGTG TCTGCCGGTG GTAAGGCAAC AGATGTCACC
ATGACATCCG GTAGTGCCCT GATTGCAGAC AGTGGTGCCA CTGTTGAGGG GACCAATGCC
AGCGGTAAGT TCAGTATTGA TGGCACATCC GGTCAGGCCA GCGGACTGCT GCTGGAAAAT
GGCGGCAGCT TTACGGTTAA TGCCGGAGGA CTGGCCAGCA ACACCACTGT CGGACATCGT
GGAACACTGA CGCTGGCTGC CGGGGGAAGT CTGAGTGGCA GAACACAGCT CAGTAAAGGT
GCCAGCATGG TACTGAATGG CGATGTGGTC AGTACCGGCG ATATTGTTAA CGCAGGGGAG
ATTCGCTTTG ATAATCAGAC GACACCGGAT GCCGCGCTGA GCCGTGCTGT TGCAAAAGGC
GACTCCCCGG TAACGTTCCA TAAACTGACC ACCAGTAACC TCACCGGTCA GGGTGGCACC
ATCAATATGC GTGTTCGCCT TGATGGCAGC AATACCTCTG ACCAGCTGGT GATTAATGGT
GGTCAGGCAA CCGGCAAAAC CTGGCTTGCG TTTACAAATG TCGGAAACAG TAACCTCGGG
GTGGCAACCT CCGGACAGGG TATCCGGGTT GTGGATGCAC AGAATGGCGC CACCACAGAA
GAAGGTGCGT TTGCCCTGAG TCGCCCGCTT CAGGCCGGCG CCTTTAACTA CACCCTGAAC
CGTGACAGCG ATGAAGACTG GTACCTGCGC AGTGAAAATG CTTATCGTGC TGAAGTCCCC
CTGTATACAT CCATGCTGAC ACAGGCAATG GACTATGACC GGATTCTGGC AGGCTCCCGC
AGCCATCAGA CCGGTGTAAA CGGTGAAAAT AACAGCGTCC GTCTCAGCAT TCAGGGCGGT
CATCTCGGTC ACGATAACAA CGGCGGTATT GCCCGTGGAG CCACGCCGGA AAGCAGCGGC
AGCTATGGCT TCGTCCGTCT GGAGGGTGAC CTGCTCAGAA CAGAGGTTGC CGGTATGTCT
CTGACGACAG GGGTGCATGG TGCCGCAGGC CATTCTTCCG TTGATGTTAA GGATGATGAC
GGTTCCCGCG CCGGCACGGT CCGGGATGAT GCCGGCAGCC TGGGCGGATA CCTGAATCTG
ACACACACGT CCTCCGGCCT GTGGGCTGAC ATTGTGGCAC AGGGAACCCG CCACAGCATG
AAAGCGTCAT CGGACAATAA CGACTTCCGC GCCCGCGGCT GGGGCTGGCT GGGCTCACTG
GAAACCGGTC TGCCCTTCAG TATCACTGAC AACCTGATGC TGGAGCCACA ACTGCAGTAT
ACCTGGCAGG GACTTTCCCT GGATGACGGC CAGGATAACG CCGGTTATGT GAAGTTCGGG
CATGGCAGTG CACAACATGT GCGTGCCGGT TTCCGTCTGG GCAGCCACAA CGATATGAGC
TTTGGTGAAG GCACCTCATC CCGTGACACC CTGCGCGACA GTGCAAAACA CCGTGTGCGT
GAACTGCCGG TGAACTGGTG GGTACAGCCT TCTGTTATCC GCACCTTCAG TTCCCGGGGT
GACATGAGCA TGGGGACAGC CGCCGCCGGC AGTAACATGA CGTTCTCACC GTCCCGGAAT
GGCACGTCAC TGGACCTGCA GGCCGGACTG GAAGCCCGTG TCCGGGAAAA TATCACCCTG
GGCGTTCAGG CCGGTTATGC CCACAGTGTC AGCGGCAGCA GCGCTGAAGG TTATAACGGT
CAGGCCACGC TGAATGTGAC TTTCTGA
 
Protein sequence
MKRHLNTSYR LVWNHITGTL VVASELARSR GKRTGVAVAL SLAAVTSVPV LAADTVVQAG 
ETVSGGTLTN HDNQIVFGTA NGMTISTGLE YGPDNEANTG GQWIQNGGIA NNTTVTGGGL
QRVNAGGSVS DTVISAGGGQ SLQGQAVNTT LNGGEQWVHE GGIATGTVIN EKGWQAVKSG
AMATDTVVNT GAEGGPDAEN GDTGQFVRGN AVRTTINENG RQIVAAEGTA NTTVVYAGGD
QTVHGHALDT TLNGGYQYVH NGGTASDTVV NSDGWQIVKE GGLADFTTVN QKGKLQVNAG
GTATNVTLKQ GGALVTSTAA TVTGSNRLGN FTVENGNADG VVLESGGRLD VLEGHSAWKT
LVDDGGTLAV SAGGKATDVT MTSGSALIAD SGATVEGTNA SGKFSIDGTS GQASGLLLEN
GGSFTVNAGG LASNTTVGHR GTLTLAAGGS LSGRTQLSKG ASMVLNGDVV STGDIVNAGE
IRFDNQTTPD AALSRAVAKG DSPVTFHKLT TSNLTGQGGT INMRVRLDGS NTSDQLVING
GQATGKTWLA FTNVGNSNLG VATSGQGIRV VDAQNGATTE EGAFALSRPL QAGAFNYTLN
RDSDEDWYLR SENAYRAEVP LYTSMLTQAM DYDRILAGSR SHQTGVNGEN NSVRLSIQGG
HLGHDNNGGI ARGATPESSG SYGFVRLEGD LLRTEVAGMS LTTGVHGAAG HSSVDVKDDD
GSRAGTVRDD AGSLGGYLNL THTSSGLWAD IVAQGTRHSM KASSDNNDFR ARGWGWLGSL
ETGLPFSITD NLMLEPQLQY TWQGLSLDDG QDNAGYVKFG HGSAQHVRAG FRLGSHNDMS
FGEGTSSRDT LRDSAKHRVR ELPVNWWVQP SVIRTFSSRG DMSMGTAAAG SNMTFSPSRN
GTSLDLQAGL EARVRENITL GVQAGYAHSV SGSSAEGYNG QATLNVTF