Gene ECH74115_5159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5159 
Symbol 
ID6972242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4801540 
End bp4802622 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content44% 
IMG OID643388827 
Productfimbrial family protein 
Protein accessionYP_002273253 
Protein GI209398343 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3539] P pilus assembly protein, pilin FimA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.592502 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT GGCATTATAT ATTTTGCATA ATTCTCTTTC ATTTAGGGTT ACCGTGCGGG 
TATGCGGCAA ATGATGGCAC GTGTGCAACA AGAGGCGGCA CACATACATT AAGCCTTAAT
TTTCCTCTGA CAACGGTCAG TGCAGCAAAC AATGTGCCTG GAAATACATT AATAGATATT
GCTAATGCAA CATCTTCTGA AAATTATAGC GTTCTGTGTA ACTGTGATTC AAAACATAGC
AATGGCGCTT ATCACGAAAT ATATTATACC GCAGACCCTG CTCCCGGTAT GGTTTATAGC
ACCACCGCAA GTGGTCTTGC TTTTTACTAT CTTAACGAAT ATGTCGATGT GGGAACAAAA
ATATCTGTGC TAAATGCGGG GTATACGGCA GTTCCTTTTG AACATGTTTC CAACCAGGCA
ACTACAACAG ATCACACTTG TCAGGGAAAC AAAACTACAG CGGTTGGCGT GAGCCTGAAA
ACTGGAGCAG ATGCGAAGAT TTCATTTCGT ATTAAACGTT CAATAAATGG AACGGTAGTA
ATACCTATCA CCGATATTGC ATTGCTGTAT GCCAACATAT CCAGCACCAC GACCCGTGGT
GAGGCGATTG CAAAAGTTCG AATTTCAGGC AGTTTGACCG CACCACAGTC TTGTCAGATA
AATGCAGGAC AGGTGATTTA TTTTGATTTT GATACTATTC CTGCGTCCGA ATTTTCATCT
ACCGCCGGGC AAGCCATTAC TTCACGAAAA ATCACTAAAA CAGTGAGTAT TGAGTGTACG
GGGATGGGGT ATGAGCGTAC GCAGAAAGTC GATGCTTCTT TTACGGGGAC GAACCGAAGC
AGTGACGATA CGATGGTGGC GACAGACAAT GCTGATGTCG GGATCAAAAT TTACAATAAA
TCGAATGCTG AAGTTAGCGT CAACAACGGC AAGTTACCCG CAGACATGGG CAACACGACC
ATTTTTGGTC GTAAAAATGG TTCGGTAACT TTTTCGGCAG CACCTGCCAG CTTTACCGGT
GCCCGGCCTC AGCCCGGCGT TTTTAACGCT ACCGCGACCT TAACCATTGA ATTTGTAAAC
TAA
 
Protein sequence
MKKWHYIFCI ILFHLGLPCG YAANDGTCAT RGGTHTLSLN FPLTTVSAAN NVPGNTLIDI 
ANATSSENYS VLCNCDSKHS NGAYHEIYYT ADPAPGMVYS TTASGLAFYY LNEYVDVGTK
ISVLNAGYTA VPFEHVSNQA TTTDHTCQGN KTTAVGVSLK TGADAKISFR IKRSINGTVV
IPITDIALLY ANISSTTTRG EAIAKVRISG SLTAPQSCQI NAGQVIYFDF DTIPASEFSS
TAGQAITSRK ITKTVSIECT GMGYERTQKV DASFTGTNRS SDDTMVATDN ADVGIKIYNK
SNAEVSVNNG KLPADMGNTT IFGRKNGSVT FSAAPASFTG ARPQPGVFNA TATLTIEFVN