Gene EcSMS35_4844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4844 
SymbolfimD1 
ID6142689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4949785 
End bp4952355 
Gene Length2571 bp 
Protein Length856 aa 
Translation table11 
GC content49% 
IMG OID641619648 
Productouter membrane usher protein fimD 
Protein accessionYP_001746755 
Protein GI170679720 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCTGGTT TTTTTGTCCG GCTCTTTGTT GCCTGTGCTT TTGCCGCACA GGCACCTTTG 
TCATCTGCCG AACTCTATTT TAACCCGCGT TTTTTAGCGG ATGATCCCCA GGCTGTGGCT
GATTTATCGC GTTTTGAGAA TGGGCAAGAA TTACCGCCAG GGACGTATCG TGTCGATATC
TATTTGAATA ATGGTTATAT GGCAACGCGT GATGTCACAT TTAATACGGG CGACAGTGAA
CAAGGGATTG TTCCCTGCCT GACACGCGCG CAACTTGCCA GTATGGGGCT GAATACGGCT
TCTGTCTCCG GTATGAATCT GCTGGCGGAT GATGCCTGCG TGCCATTAAC CTCAATGATA
CATGACGCTA CTGCGCAACT GGATGTGGGT CAGCAGCGAC TCAACCTGAC GATCCCTCAG
GCATTTATGA GTAATCGCGC GCGTGGTTAT ATTCCTCCTG AGTTATGGGA TCCCGGTATT
AATGCCGGAT TGCTCAATTA TAATTTCAGC GGAAATAGTG TACAGAATCG AATTGGGGGT
AACAGCCATT ATGCATATTT AAACCTACAG AGTGGGTTAA ATATTGGTGC GTGGCGTTTA
CGCGACAATA CCACCTGGAG TTATAACAGT AGCGACAGTT CATCAGGTAG CAAAAATAAA
TGGCAGCATA TCAATACCTG GCTTGAGCGA GACATAATAC CGTTACGTTC CCGGCTGACG
CTGGGTGATG GTTATACTCA GGGCGATATT TTCGATGGTA TTAACTTTCG CGGCGCACAA
TTGGCCTCAG ATGACAATAT GTTACCCGAT AGCCAAAGAG GATTTGCCCC GGTGATCCAC
GGTATTGCTC GTGGTACTGC ACAGGTCACT ATTAAACAAA ATGGGTATGA CATTTATAAT
AGTACGGTGC CACCGGGGCC TTTTACCATC AACGATATCT ATGCCGCAGG TAATAGTGGT
GACTTGCAGG TAACGATCAA AGAGGCTGAC GGCAGCACGC AGATTTTTAC CGTACCCTAT
TCGTCAGTCC CGCTTTTGCA ACGTGAAGGG CATACTCGTT ATTCCATTAC GGCAGGAGAA
TACCGTAGTG GAAATGCGCA ACAGGAAAAA CCCCGCTTTT TCCAGAGTAC ATTACTCCAC
GGCCTTCCGG CTGGCTGGAC AATATATGGT GGAACGCAAC TGGCGGATCG TTATCGTGCT
TTTAATTTCG GTATCGGGAA AAACATGGGG GCACTGGGTG CTCTGTCTGT GGATATGACG
CAGGCTAATT CCACACTTCC CGATGACAGT CAGCATGACG GACAATCGGT GCGTTTTCTC
TATAACAAAT CGCTCAATGA GTCAGGCACG AATATTCAGT TAGTGGGTTA CCGTTATTCG
ACCAGCGGAT ATTTTAATTT CGCTGATACA ACATACAGTC GAATGAATGG CTACAACATC
GAAACACAGG ACGGAGTTAT TCAGGTTAAG CCGAAATTCA CCGACTATTA CAACCTCGCT
TATAACAAAC GCGGGAAATT ACAGCTCACC GTTACTCAGC AACTCGGGCG CTCATCAACA
CTGTATTTGA GTGGTAGCCA TCAAACTTAT TGGGGAACGA ATAATGTCGA TGAGCAATTC
CAGGCTGGTT TAAATACTGC ATTCGAAGAT ATCAACTGGA CGCTCAGCTA TAGCCTGACG
AAAAACGCCT GGCAAAAAGG ACGGGATCAG ATGTTAGCGC TTAACGTCAA TATTCCTTTC
AGCCACTGGC TGCGTTCTGA CAGTAAATCT CAGTGGCGAC ATGCCAGTGC CAGTTACAGC
ATGTCACACG ATCTCAACGG TCGGATGACC AATCTGGCCG GTGTATACGG TACGTTGCTG
GAAGACAACA ACCTCAGCTA TAGCGTGCAA ACCGGTTACG CCGGGGGAGG TGATGGTAAT
AGCGGAAGTA CAGGCTACGC CACGCTGAAT TATCGCGGTG GTTACGGCAA TGCCAATATC
GGTTACAGCC ATAGCGATGA TATTAAGCAG CTCTATTACG GAGTCAGCGG TGGGGTACTG
GCTCATGCCA ATGGCGTAAC GCTGGGGCAG CCGTTAAACG ATACGGTGGT GCTTGTTAAA
GCGCCTGGCG CAAAAGATGC AAAAGTCGAA AACCAGACAG GAGTGCGTAC CGACTGGCGG
GGTTATGCCG TGCTGCCATA TGCCACTGAA TATCGGGAAA ACAGAGTGGC GCTGGATACC
AATACCCTGG CTGATAACGT CGATTTAGAT AACTCGGTCG CTAACGTTGT TCCCACTCGT
GGGGCGATCG TGCGAGCAGA GTTTAAAGCG CGCGTTGGGA TAAAACTGCT CATGACGCTA
ACCCACAATA ATAAGCCGCT GCCGTTTGGG GCGATGGTGA CATCAGAGAG TAGCCAGAGT
AGCGGCATTG TTGCGGATAA TGGTCAGGTT TACCTCAGCG GAATGCCTTT AGCGGGAAAA
GTTCAGGTGA AATGGGGAGA AGAGGAAAAT GCTCACTGTG TCGCCAATTA TCAACTGCCA
CCAGAGAGTC AGCAGCAGTT ATTAACCCAG CTATCAGCTG AATGTCGTTA A
 
Protein sequence
MAGFFVRLFV ACAFAAQAPL SSAELYFNPR FLADDPQAVA DLSRFENGQE LPPGTYRVDI 
YLNNGYMATR DVTFNTGDSE QGIVPCLTRA QLASMGLNTA SVSGMNLLAD DACVPLTSMI
HDATAQLDVG QQRLNLTIPQ AFMSNRARGY IPPELWDPGI NAGLLNYNFS GNSVQNRIGG
NSHYAYLNLQ SGLNIGAWRL RDNTTWSYNS SDSSSGSKNK WQHINTWLER DIIPLRSRLT
LGDGYTQGDI FDGINFRGAQ LASDDNMLPD SQRGFAPVIH GIARGTAQVT IKQNGYDIYN
STVPPGPFTI NDIYAAGNSG DLQVTIKEAD GSTQIFTVPY SSVPLLQREG HTRYSITAGE
YRSGNAQQEK PRFFQSTLLH GLPAGWTIYG GTQLADRYRA FNFGIGKNMG ALGALSVDMT
QANSTLPDDS QHDGQSVRFL YNKSLNESGT NIQLVGYRYS TSGYFNFADT TYSRMNGYNI
ETQDGVIQVK PKFTDYYNLA YNKRGKLQLT VTQQLGRSST LYLSGSHQTY WGTNNVDEQF
QAGLNTAFED INWTLSYSLT KNAWQKGRDQ MLALNVNIPF SHWLRSDSKS QWRHASASYS
MSHDLNGRMT NLAGVYGTLL EDNNLSYSVQ TGYAGGGDGN SGSTGYATLN YRGGYGNANI
GYSHSDDIKQ LYYGVSGGVL AHANGVTLGQ PLNDTVVLVK APGAKDAKVE NQTGVRTDWR
GYAVLPYATE YRENRVALDT NTLADNVDLD NSVANVVPTR GAIVRAEFKA RVGIKLLMTL
THNNKPLPFG AMVTSESSQS SGIVADNGQV YLSGMPLAGK VQVKWGEEEN AHCVANYQLP
PESQQQLLTQ LSAECR