Gene ECH74115_5823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5823 
SymbolfimD1 
ID6970739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5473635 
End bp5476271 
Gene Length2637 bp 
Protein Length878 aa 
Translation table11 
GC content48% 
IMG OID643389450 
Productouter membrane usher protein fimD 
Protein accessionYP_002273842 
Protein GI209397655 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.50645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.941906 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATATC TGAATTTAAG ACTTTACCAG CGAAACACAC AATGCTTGCA TATTCGTAAG 
CATCGTTTGG CTGGTTTTTT TGTCCGGCTC TTTGTCGCCT GTGCTTTTGC CGTACAGGCA
CCTTTGTCAT CTGCCGAACT CTATTTTAAT CCGCGCTTTT TAGCGGATGA TCCCCAGGCT
GTGGCCGATT TATCGCGTTT TGAAAATGGG CAAGAATTAC CGCCAGGGAC GTATCGCGTC
GATATCTATT TGAATAATGG TTATATGGCA ACGCGTGATG TCACATTTAA TACGGGCGAC
AGTGAACAAG GGATTGTTCC CTGCCTGACA CGCGCGCAAC TCGCCAGTAT GGGGCTGAAT
ACGGCTTCTG TCGCCGGTAT GAATCTGCTG GCGGATGATG CCTGTGTGCC ATTAACCACA
ATGGTCCAGG ACGCTACTGC GCATTTAGAT GTTGGTCAGC AGCGACTGAA CCTGACGATC
CCTCAGGCAT TTATGAGTAA TCGCGCGCGT GGTTATATTC CTCCTGAGTT ATGGGATCCC
GGTATTAATG CCGGATTGCT CAATTATAAT TTCAGCGGAA ATAGTGTACA GAATCGGATT
GGGGGTAACA GCCATTATGC ATATTTAAAC CTACAGAGTG GGTTAAATAT TGGTGCGTGG
CGTTTACGCG ACAATACCAC CTGGAGTTAT AACAGTAGCG ACAGATCATC AGGTAGCAAA
AATAAATGGC AGCATATCAA TACCTGGCTT GAGCGAGACA TAATACCGTT ACGTTCCCGG
CTGACGCTGG GTGATGGTTA TACTCAGGGT GATATTTTCG ATGGTATTAA CTTTCGCGGC
GCACAATTGG CCTCAGATGA CAATATGTTA CCCGATAGCC AAAGAGGATT TGCCCCGGTG
ATCCACGGTA TTGCTCGTGG TACTGCACAG GTCACTATTA AACAAAATGG GTATGACATT
TATAATAGTA CGGTGCCGCC GGGGCCTTTT ACCATCAACG ATATCTATGC CGCAGGTAAT
AGTGGTGACT TGCAGGTAAC GATTAAAGAG GCTGACGGCA GCACGCAGAT TTTTACCGTA
CCCTATTCGT CAGTCCCGCT TTTGCAACGT GAAGGGCATA CTCGTTATTC CATTACGGCA
GGAGAATACC GTAGTGGAAA TGCGCAACAG GAAAAACCCC GCTTTTTCCA AAGTACATTA
CTCCACGGCC TTCCAGCTGG CTGGACAATA TATGGTGGAA CGCAACTGGC AGATCGTTAT
CGTGCTTTTA ATTTTGGTAT CGGGAAAAAT ATGGGGGCAC TGGGCGCTCT GTCTGTGGAT
ATGACTCAGG CTAATTCCAC ACTTCCCGAT GACAGTCAGC ATGACGGACA ATCGGTGCGT
TTTCTCTATA ACAAATCGCT CAATGAGTCA GGCACGAATA TTCAGTTAGT GGGTTACCGT
TATTCGACCA GCGGATATTT TAATTTCGCT GATACAACAT ACAGTCGAAT GAATGGCTAC
AACATCGAAA CACAGGACGG AGTTATTCAG GTTAAGCCGA AATTCACCGA CTATTACAAC
CTCGCTTATA ACAAACGCGG GAAATTACAA CTCACCGTTA CTCAGCAACT CGGGCGCTCA
TCAACACTGT ATTTGAGTGG TAGCCATCAA ACTTATTGGG GAACGAGTAA TGTCGATGAG
CAATTCCAGG CTGGATTAAA TACTGCGTTC GAAGATATCA ACTGGACGCT CAGCTATAGC
CTGACGAAAA ACGCCTGGCA AAAAGGACGT GATCAGATGT TAGCGCGTAA CGTCAATATT
CCTTTCAGCC ACTGGCTGCG TTCTGACAGT AAATCTCAGT GGCGACATGC CAGTGCCAGC
TACAGCATGT CACACGATCT CAACGGTCGG ATGACCAATC TGGCTGGTGT ATACGGTACG
TTGCTGGAAG ACAACAACCT CAGCTATAGC GTGCAAACCG GCTATGCCGG GGGAGGCGAT
GGTAATAGCG GAAGCACAGG CTACGCCACG CTGAATTATC GCGGTGGTTA CGGCAATGCC
AATATCGGTT ACAGCCATAG CGATGATATT AAGCAGCTCT ATTACGGAGT CAGCGGTGGG
GTACTGGCTC ATGCCAATGG CGTAACGCTG GGGCAGCCGT TAAACGATAC GGTGGTGCTT
GTTAAAGCGC CTGGCGCAAA AGATGCAAAA GTCGAAAACC AGACGGGGGT GCGTACCGAC
TGGCGCGGTT ATGCCGTGCT GCCTTATGCC ACTGAATATC GGGAAAATAG AGTGGCGCTG
GATACCAATA CCCTGGCTGA TAACGTCGAT TTAGATAACG CGGTCGCTAA CGTTGTTCCC
ACTCGTGGGG CGATCGTGCG AGCAGAGTTT AAAGCGCGCG TTGGGATAAA ACTGCTCATG
ACGCTAACCC ACAATAATAA GCCGCTGCCG TTTGGGGCGA TGGTGACATC AGAGAGTAGC
CAGAGTAGCG GCATTGTTGC GGATAATGGT CAGGTTTACC TCAGCGGAAT GCCTCTAGCG
GGAAAAGTTC AGGTGAAATG GGGAGAAGAG GAAAATGCTC ATTGTGTCGC CAATTATCAA
CTGCCACCAG AGAGTCAGCA GCAGTTATTA ACCCAGCTAT CAGCTGAATG TCGTTAA
 
Protein sequence
MSYLNLRLYQ RNTQCLHIRK HRLAGFFVRL FVACAFAVQA PLSSAELYFN PRFLADDPQA 
VADLSRFENG QELPPGTYRV DIYLNNGYMA TRDVTFNTGD SEQGIVPCLT RAQLASMGLN
TASVAGMNLL ADDACVPLTT MVQDATAHLD VGQQRLNLTI PQAFMSNRAR GYIPPELWDP
GINAGLLNYN FSGNSVQNRI GGNSHYAYLN LQSGLNIGAW RLRDNTTWSY NSSDRSSGSK
NKWQHINTWL ERDIIPLRSR LTLGDGYTQG DIFDGINFRG AQLASDDNML PDSQRGFAPV
IHGIARGTAQ VTIKQNGYDI YNSTVPPGPF TINDIYAAGN SGDLQVTIKE ADGSTQIFTV
PYSSVPLLQR EGHTRYSITA GEYRSGNAQQ EKPRFFQSTL LHGLPAGWTI YGGTQLADRY
RAFNFGIGKN MGALGALSVD MTQANSTLPD DSQHDGQSVR FLYNKSLNES GTNIQLVGYR
YSTSGYFNFA DTTYSRMNGY NIETQDGVIQ VKPKFTDYYN LAYNKRGKLQ LTVTQQLGRS
STLYLSGSHQ TYWGTSNVDE QFQAGLNTAF EDINWTLSYS LTKNAWQKGR DQMLARNVNI
PFSHWLRSDS KSQWRHASAS YSMSHDLNGR MTNLAGVYGT LLEDNNLSYS VQTGYAGGGD
GNSGSTGYAT LNYRGGYGNA NIGYSHSDDI KQLYYGVSGG VLAHANGVTL GQPLNDTVVL
VKAPGAKDAK VENQTGVRTD WRGYAVLPYA TEYRENRVAL DTNTLADNVD LDNAVANVVP
TRGAIVRAEF KARVGIKLLM TLTHNNKPLP FGAMVTSESS QSSGIVADNG QVYLSGMPLA
GKVQVKWGEE ENAHCVANYQ LPPESQQQLL TQLSAECR