Gene EcSMS35_2179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2179 
Symbol 
ID6143011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2185632 
End bp2188232 
Gene Length2601 bp 
Protein Length866 aa 
Translation table11 
GC content48% 
IMG OID641617055 
Productouter membrane usher protein fimD-like protein 
Protein accessionYP_001744229 
Protein GI170682116 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.932643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.834799 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATAGAA CTCAACGACA ACACAGCCTG TTAAGCTCTG GTGGAGTGCC ATCGTTTATT 
GGTGGGCTGG TGGTGTTTGT GTCGGCAGCG TTCAATGCAC AAGCTGAAAC CTGGTTCGAT
CCAGCCTTTT TCAAAGATGA TCCCTCAATG GTGGCTGATT TGTCTCGTTT CGAAAAAGGG
CAAAAAATAA CGCCAGGGGT TTATCGAGTC GATATTGTTC TGAATCAGAC AATTGTAGAT
ACGCGCAACG TCAATTTTGT TGAGATAACG CCAGAGAAGG GGATTGCCGC CTGTTTGACG
ACTGAAAGCC TGGATGCAAT GGGCGTGAAT ACTGATGCGT TTCCGGCTTT TAAACAACTG
GATAAACAAG CGTGTGCGCC ATTGGCGGAA ATTATTCCGG ATGCCAGCGT AACTTTTAAT
GTGAATAAAC TCCGTCTGGA AATTTCAGTA CCGCAAATCG CCATCAAAAG TAACGCTCGT
GGTTATGTCC CCCCTGAACG TTGGGATGAA GGGATCAACG CGCTATTACT GGGATATTCA
TTTAGCGGGG CTAACAGTAT TCATAGCAGC GCAGGTAGTG ATTCTGGCGA CAGCTATTTT
CTGAATTTAA ACAGTGGCGT TAATTTAGGC CCATGGAGAT TGCGCAACAA TTCAACATGG
AGTCGCAGTA GTGGCCAAAC CGCAGAATGG AAGAATCTCA GCAGCTATTT GCAGCGGGCG
GTGATTCCAT TAAAAGGCGA ACTGACCGTC GGTGATGATT ATACGGCAGG CGATTTTTTT
GATAGCGTCA GCTTTCGTGG CGTGCAGCTG GCGTCAGATG ACAACATGCT GCCAGACAGC
TTGAAAGGGT TTGCGCCAGT GGTGCGTGGT ATCGCCAAAA GCAATGCACA GGTAACGATT
AAGCAAAATG GTTACACCAT CTATCAAACT TATGTTTCGC CTGGCGCTTT TGAGATTAGT
GATCTCTACT CTACGTCGTC GAGTGGTGAT TTGTTGGTTG AAATCAAAGA AGCTGACGGT
AGTGTCAATA GTTACAGTGT GCCCTTTTCC AGCGTGCCAT TACTCCAGCG TCAGGGACGC
ATCAAATATG CGGTGACGCT GGCGAAATAC AGAACCAATA GTAATGACCA GCAAGAAAGT
AAATTTGCTC AGGCCACGCT GCAATGGGGT GGGCCGAGGG GAACGACCTG GTATGGCGGA
GGGCAATATG CTGAATATTA CCGTGCCGCT ATGTTTGGTC TGGGTTTTAA CCTCGGCGAT
TTCGGAGCAA TTTCGTTCGA CGCGACCCAG GCTAAAAGTA CGCTGGCAGA CCAAAGTGAA
CATAAAGGGC AGTCATATCG TTTTCTGTAT GCCAAAACGC TCAACCAATT GGGCACCAAC
TTTCAATTGA TGGGCTACCG CTATTCGACG TCGGGTTTCT ACACCCTTTC CGACACTATG
TATAAACACA TGGATGGCTA CGAATTTAAT GACGGTGATG ATGAAGATAC GCCAATGTGG
TCGCGTTATT ACAATTTGTT TTACACAAAA CGTGGCAAAC TGCAGGTCAA CATCTCCCAG
CAATTAGGCG AGTACGGTTC GTTTTATTTA AGCGGTAGCC AGCAAACTTA CTGGCATACC
GATCAACAGG ATCGGCTATT ACAGTTTGGC TACAACACGC AAATTAAAGA TCTCTCGCTG
GGGGTTTCCT GGAACTACAG TAAGTCCCGT GGTCAACCTG ACGCTGACCA GGTGTTTGCA
CTTAATTTTT CCCTGCCGCT CAATCTGTTG CTCCCCAAAA GTAATGATAG CTATACCAGG
AAAAAAAATT ACGCCTGGAT GACCTCTAAT ACCAGTATCG ATAACGAAGG GCACACTACA
CAAAACCTGG GTTTAACGGA GACACTACTC GATGACGGTA ATCTGAGCTA CAGCGTGCAA
CAGGGATATA ACAGCGAGGG GAAAACGGCT AATGGTAGCG CCAGCATGGA CTACAAAGGG
GCGTTTGCAG ATGCTCGAGT GGGCTACAAC TACAGCGATA ACGGCAGTCA ACAACAACTG
AACTACGCTC TTTCAGGCAG TTTAGTTGCC CATTCGCAGG GTATTACGTT GGGTCAATCA
TTGGGTGAAA CTAATGTCCT GATTGCCGCG CCAGGCGCCG AAAATACTCG TGTGGCGAAC
AGCACCGGGC TGAAAACTGA CTGGCGTGGA TATACCGTTG TGCCTTATGC CACTTCTTAT
CGGGAAAATC GAATTGCACT TGATGCGGCG TCGTTAAAAC GTAACGTGGA TCTTGAAAAT
GCCGTAGTAA ACGTGGTTCC CACCAAAGGG GCATTGGTTC TGGCGGAGTT CAATGCCCAT
GCGGGGGCTA GGGTATTAAT GAAAACATCA AAGCAGGGTA TGCCGCTGAG ATTTGGTGCA
ATGGCAACAC TGGATGGCGC ACAAACAATT AGCGGTATCA TTGATGATGA TGGTTCGCTC
TATATGTCTG GTTTGCCGGC GAAGGGAACG ATAACTGTAC GCTGGGGCGA CGCTCCCGAT
CAAATTTGTC ATATCAGTTA CGAGCTTACC GAACAACAAA TTAACGCTGC GATTACGCGG
ATGGATTCAG TATGCGAATA A
 
Protein sequence
MYRTQRQHSL LSSGGVPSFI GGLVVFVSAA FNAQAETWFD PAFFKDDPSM VADLSRFEKG 
QKITPGVYRV DIVLNQTIVD TRNVNFVEIT PEKGIAACLT TESLDAMGVN TDAFPAFKQL
DKQACAPLAE IIPDASVTFN VNKLRLEISV PQIAIKSNAR GYVPPERWDE GINALLLGYS
FSGANSIHSS AGSDSGDSYF LNLNSGVNLG PWRLRNNSTW SRSSGQTAEW KNLSSYLQRA
VIPLKGELTV GDDYTAGDFF DSVSFRGVQL ASDDNMLPDS LKGFAPVVRG IAKSNAQVTI
KQNGYTIYQT YVSPGAFEIS DLYSTSSSGD LLVEIKEADG SVNSYSVPFS SVPLLQRQGR
IKYAVTLAKY RTNSNDQQES KFAQATLQWG GPRGTTWYGG GQYAEYYRAA MFGLGFNLGD
FGAISFDATQ AKSTLADQSE HKGQSYRFLY AKTLNQLGTN FQLMGYRYST SGFYTLSDTM
YKHMDGYEFN DGDDEDTPMW SRYYNLFYTK RGKLQVNISQ QLGEYGSFYL SGSQQTYWHT
DQQDRLLQFG YNTQIKDLSL GVSWNYSKSR GQPDADQVFA LNFSLPLNLL LPKSNDSYTR
KKNYAWMTSN TSIDNEGHTT QNLGLTETLL DDGNLSYSVQ QGYNSEGKTA NGSASMDYKG
AFADARVGYN YSDNGSQQQL NYALSGSLVA HSQGITLGQS LGETNVLIAA PGAENTRVAN
STGLKTDWRG YTVVPYATSY RENRIALDAA SLKRNVDLEN AVVNVVPTKG ALVLAEFNAH
AGARVLMKTS KQGMPLRFGA MATLDGAQTI SGIIDDDGSL YMSGLPAKGT ITVRWGDAPD
QICHISYELT EQQINAAITR MDSVCE