Gene ECH74115_2969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2969 
Symbol 
ID6971548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2745717 
End bp2746931 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content31% 
IMG OID643386809 
ProductWbdP 
Protein accessionYP_002271277 
Protein GI209398526 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.640876 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0000040837 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATTG CGTTGAATTC AGATGGATTT TACGAGTGGG GCGGTGGAAT TGATTTTATT 
AAATATATTC TGTCAATATT AGAAACGAAA CCAGAAATAT GTATCGATAT TCTTTTACCG
AGAAATGATA TACATTCTCT TATAAGAGAA AAAGCATTTC CTTTTAAAAG TATATTAAAA
GCAATTTTAA AGAGGGAAAG GCCTCGATGG ATTTCATTAA ATAGATTTAA TGAGCAATAC
TATAGAGATG CCTTTACACA AAATAATATA GAGACGAATC TTACCTTTAT TAAAAGTAAG
AGCTCTGCCT TTTATTCATA TTTTGATAGT AGCGATTGTG ATGTTATTCT TCCTTGCATG
CGTGTTCCTT CGGGAAATTT GAATAAAAAA GCATGGATTG GTTATATTTA TGACTTTCAA
CACTGTTACT ATCCTTCATT TTTTAGTAAG CGAGAAATAG ATCAAAGGAA TGTGTTTTTT
AAATTGATGC TCAATTGCGC TAACAATATT ATTGTTAATG CACATTCAGT TATTACCGAT
GCAAATAAAT ATGTTGGGAA TTATTCTGCA AAACTACATT CTCTTCCATT TAGTCCATGC
CCTCAATTAA AATGGTTCGC TGATTACTCT GGTAATATTG CCAAATATAA TATTGACAAG
GATTATTTTA TAATTTGCAA TCAATTTTGG AAACATAAAG ATCATGCAAC TGCTTTTAGG
GCATTTAAAA TTTATACTGA ATATAATCCT GATGTTTATT TAGTATGCAC GGGAGCTACT
CAAGATTATC GATTCCCTGG ATATTTTAAT GAATTGATGG TTTTGGCAAA AAAGCTCGGA
ATTGAATCGA AAATTAAGAT ATTAGGGCAT ATACCTAAAC TTGAACAAAT TGAATTAATC
AAAAATTGCA TTGCTGTAAT ACAACCAACC TTATTTGAAG GCGGGCCTGG AGGGGGGGTA
ACATTTGACG CTATTGCATT AGGGAAAAAA GTTATACTAT CTGACATAGA TGTCAATAAA
GAAGTTAATT GCGGTGATGT ATATTTCTTT CAGGCAAAAA ACCATTATTC ATTAAATGAC
GCGATGGTAA AAGCTGATGA ATCTAAAATT TTTTATGAAC CTACAACTCT GATAGAATTG
GGTCTCAAAA GACGCAATGC GTGTGCAGAT TTTCTTTTAG ATGTTGTGAA ACAAGAAATT
GAATCCCGAT CTTAA
 
Protein sequence
MKIALNSDGF YEWGGGIDFI KYILSILETK PEICIDILLP RNDIHSLIRE KAFPFKSILK 
AILKRERPRW ISLNRFNEQY YRDAFTQNNI ETNLTFIKSK SSAFYSYFDS SDCDVILPCM
RVPSGNLNKK AWIGYIYDFQ HCYYPSFFSK REIDQRNVFF KLMLNCANNI IVNAHSVITD
ANKYVGNYSA KLHSLPFSPC PQLKWFADYS GNIAKYNIDK DYFIICNQFW KHKDHATAFR
AFKIYTEYNP DVYLVCTGAT QDYRFPGYFN ELMVLAKKLG IESKIKILGH IPKLEQIELI
KNCIAVIQPT LFEGGPGGGV TFDAIALGKK VILSDIDVNK EVNCGDVYFF QAKNHYSLND
AMVKADESKI FYEPTTLIEL GLKRRNACAD FLLDVVKQEI ESRS