Gene EcSMS35_2249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2249 
Symbolflu1 
ID6143968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2267288 
End bp2270134 
Gene Length2847 bp 
Protein Length948 aa 
Translation table11 
GC content57% 
IMG OID641617125 
Productantigen 43 
Protein accessionYP_001744298 
Protein GI170680707 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAC ATCTGAACAC CAGCTACAGG CTGGTATGGA ATCACATTAC GGGCACCCTG 
GTGGTGGCTT CCGAACTGGC CCGCTCACGG GGAAAACGCG CCGGTGTGGC GGTTGCGCTG
TCTCTTGCTG CTGTCACGCC AGTCCCGGCA CTGGCTGCTG ACACGGTTGT AGAGGCGGGA
GAAACCGTGA ACGGCGGAAC ACTGACAAAT CATGACAACC AGATTGTCTT CGGTACGACC
AACGGAATGA CCATCAGTAC CGGGCTGGAG TATGGGACGG ATAACGAGGC CAATACCGGC
GGGCAATGGG TACAGGATGG CGGCACAGCC AGTAATACCA CCATCAGCAG CGGCGGGCTC
CAGTTTGTAG GTGCCGGGGG AAAGGCTACA GACACAATAA TCAATGAAGG AGGCGGACAA
AGCCTGAAGG GACTGGCACT GAACACCACG CTGAATGGTG GCGAGCAGTG GATGCATGAA
GGTGCGATAG CCACAGGAAC CGTCATTAAT GATAAGGGCT GGCAGGTCGT CAAACCCGGC
GCAGTGGCAA CAGACACCGT TGTGAATACC GGCGCAGAAG GAGGACCGGA TGCAGAAAAT
GGTGATACCG GGCAGTTTGT TCGCGGAAAT GCCGTACGTA CCACCATCAA TAAAAATGGT
CGTCAGATTG TGACTGTTGA AGGAACAGCA AATACCACTG TGGTTTATGC CGGCGGCGAC
CAGACGGTAC ATGGCCACGC ACTGGATACC ACGCTGAATG GCGGTAACCA GTATGTACAC
AACGGCGGTA CAACGTCTGA CACTGTTGTG AACAGTGACG GCTGGCAGAT TATCAAGGAA
GGTGGTCTGG CGGATTTCAC CACCGTTAAC CAGAAAGGCA AACTGCAGGT GAACGCCGGT
GGTACAGCCA CGAATGTCAC CCTGAAGCAG GGAGGCGCAC TGGTCACCAG TACGGCGGCA
ACCGTCACCG GCAGCAACCG TCTGGGCAAT TTCGCGGTGG AAAACGGTAA GGCTGACGGT
GTTGTTCTGG AGTCCGGCGG TCGTCTGGAT GTACTGGAGG GTCATTCAGC GCAGAAAACA
CGGGTGGATG ACGGCGGTAC CCTGGCAGTG TCTGCCGGTG GTAAGGCGAC AGGTGTCACC
ATGACATCCG GTGGTGCCCT GATTGCAGAC AGTGGTGCCA CTGTTGAGGG GACCAATGCC
AGCGGTAAGT TCAGTATTGA TGGCATATCC GGTCAGGCCA GCGGCCTGCT GCTGGAAAAT
GGCGGCAGCT TTACGGTTAA TGCCGGGGGA CAGGCTGGCA ACACCACTGT CGGACATCGT
GGAACACTGA CGCTGGCTGC CGGGGGAAGT CTGAGTGGCA GAACACAGCT CAGTAAAGGC
GCCAGCATGG TACTGAATGG TGATGTGGTC AGTACCGGCG ATATTGTTAA CGCAGGGGAG
ATTCGCTTTG ATAATCAGAC GACACAGGAT GCCGTCCTGA GCCGTGCTGT TGCAAAAGGC
GACGCCCCGG TAACGTTCCA TAAACTGACC ACCAGTAACC TCACCGGTCA GGGTGGCACC
ATCAATATGC GTGTTCGCCT TGATGGCAGC AATGCCTCTG ACCAGCTGGT GATTAATGGT
GGTCAGGCAA CCGGCAAAAC CTGGCTTGCG TTCACAAATG TCGGAAACAG TAACCTCGGG
GTGGCAACTT CCGGACAGGG TATCCGGGTT GTGGATGCAC AGAATGGTGC CACCACAGAA
GAAGGTGCGT TTGCCCTGAG TCGCCCGCTT CAGGCCGGTG CCTTTAACTA CACCCTGAAC
CGTGACAGCG ATGAAGACTG GTACCTGCGC AGTGAAAATG CTTATCGTGC TGAAGTCCCC
CTGTATGCCT CCATGCTGAC ACAGGCAATG GACTATGACC GGATTCTGGC AGGCTCCCGC
AGCCATCAGA CCGGTGTAAA CGGTGAAAAT AACAGCGTCC GTCTCAGCAT TCAGGGCGGT
CATCTCGGTC ACGATAACAA CGGCGGTATT GCCCGTGGGG CCACGCCGGA AAGCAGCGGC
AGCTATGGCT TCGTCCGTCT GGAGAGTGAC CTGCTGAGAA CAGAGGTTGC CGGTATGTCT
GTGACCGCGG GGGTATATAG TGCTGCTGGC CATTCTTCCG TTGATGTTAA GGATGATGAC
GGCTCCCGCG CCGGCACGGT CCGGGATGAT GCCGGCAGCC TGGGCGGATA CCTGAATCTG
GTACACACGT CCTCCGGCCT GTGGGCTGAC ATTATGGCAC AGGGAACCCG CCACAGCATG
AAAGCGTCAT CGGACAATAA CGACTTCCGC GCCCGGGGCT GGGGCTGGCT GGGCTCACTG
GAAACCGGTC TGCCCTTCAG TATCACTGAC AACCTGATGC TGGAACCACA ACTGCAGTAC
ACCTGGCAGG GACTTTCCCT GGATGACGGC CAGGATAACG CCGGTTATGT GAAGTTCGGG
CATGGCAGTG CACAACATAT GCGTGCCGGC TTCCGTCTGG GCAGCCACAA CGATATGAGC
TTTGGTGAAG GCACCTCATC CCGTGACACC CTGCGCGACA GTGCAAAACA CCGTGTGCGT
GAACTGCCGG TGAACTGGTG GGTACAGCCT TCTGTTATCC GCACCTTCAG TTCCCGGGGT
GACATGAGCA TGGGGACAGC CGCCGCCGGC AGTAACATGA CGTTCTCACC GTCCCAGAAT
GGCACGTCAC TGGACCTGCA GGCCGGACTG GAAGCCCGTG TCCGGGAAAA TATCACCCTG
GGCGTTCAGG CCGGTTATGC CCACAGCGTC AGCGGCAGCA GCGCCGAAGG CTATAACGGT
CAGGCCACGC TGAATGTGAC TTTCTGA
 
Protein sequence
MKRHLNTSYR LVWNHITGTL VVASELARSR GKRAGVAVAL SLAAVTPVPA LAADTVVEAG 
ETVNGGTLTN HDNQIVFGTT NGMTISTGLE YGTDNEANTG GQWVQDGGTA SNTTISSGGL
QFVGAGGKAT DTIINEGGGQ SLKGLALNTT LNGGEQWMHE GAIATGTVIN DKGWQVVKPG
AVATDTVVNT GAEGGPDAEN GDTGQFVRGN AVRTTINKNG RQIVTVEGTA NTTVVYAGGD
QTVHGHALDT TLNGGNQYVH NGGTTSDTVV NSDGWQIIKE GGLADFTTVN QKGKLQVNAG
GTATNVTLKQ GGALVTSTAA TVTGSNRLGN FAVENGKADG VVLESGGRLD VLEGHSAQKT
RVDDGGTLAV SAGGKATGVT MTSGGALIAD SGATVEGTNA SGKFSIDGIS GQASGLLLEN
GGSFTVNAGG QAGNTTVGHR GTLTLAAGGS LSGRTQLSKG ASMVLNGDVV STGDIVNAGE
IRFDNQTTQD AVLSRAVAKG DAPVTFHKLT TSNLTGQGGT INMRVRLDGS NASDQLVING
GQATGKTWLA FTNVGNSNLG VATSGQGIRV VDAQNGATTE EGAFALSRPL QAGAFNYTLN
RDSDEDWYLR SENAYRAEVP LYASMLTQAM DYDRILAGSR SHQTGVNGEN NSVRLSIQGG
HLGHDNNGGI ARGATPESSG SYGFVRLESD LLRTEVAGMS VTAGVYSAAG HSSVDVKDDD
GSRAGTVRDD AGSLGGYLNL VHTSSGLWAD IMAQGTRHSM KASSDNNDFR ARGWGWLGSL
ETGLPFSITD NLMLEPQLQY TWQGLSLDDG QDNAGYVKFG HGSAQHMRAG FRLGSHNDMS
FGEGTSSRDT LRDSAKHRVR ELPVNWWVQP SVIRTFSSRG DMSMGTAAAG SNMTFSPSQN
GTSLDLQAGL EARVRENITL GVQAGYAHSV SGSSAEGYNG QATLNVTF