Gene ECH74115_5282 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5282 
SymbolubiD 
ID6967477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4924412 
End bp4925905 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content52% 
IMG OID643388946 
Product3-octaprenyl-4-hydroxybenzoate decarboxylase 
Protein accessionYP_002273360 
Protein GI209397095 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.064348 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCCA TGAAATATAA CGATTTACGC GACTTCTTGA CGTTGCTTGA ACAGCAGGGT 
GAGCTAAAAC GTATCACGCT CCCGGTGGAC CCGCATCTGG AAATCACTGA AATTGCTGAC
CGCACGCTGC GTGCTGGTGG GCCTGCGCTG TTGTTTGAAA ACCCTAAAGG GTACTCAATG
CCGGTGCTGT GCAACTTGTT CGGTACGCCA AAGCGCGTAG CGATGGGTAT GGGCCAGGAA
GATGTTTCAG CACTGCGTGA AGTCGGTAAA TTATTAGCAT TTCTGAAAGA ACCAGAGCCG
CCAAAAGGTT TTCGCGATCT GTTTGATAAG CTGCCGCAGT TTAAGCAGGT GTTAAACATG
CCGACAAAGC GACTGCGCGG TGCACCCTGC CAACAAAAAA TCGTCTCTGG CGATGACGTC
GATCTCAACC GTATTCCCAT TATGACCTGT TGGCCGGAAG ATGCCGCGCC GCTGATTACA
TGGGGGCTAA CCGTTACACG TGGCCCTCAT AAAGAGCGAC AGAATCTGGG CATTTATCGC
CAGCAACTGA TTGGTAAAAA CAAGCTGATT ATGCGTTGGC TGTCGCATCG CGGCGGCGCG
CTGGATTATC AGGAGTGGTG TGCGGCGCAT CCAGGTGAAC GTTTCCCGAT CTCTGTGGCG
TTGGGCGCTG ATCCGGCAAC CATTCTCGGT GCAGTCACAC CAGTACCAGA TACTTTGTCG
GAATACGCCT TTGCCGGATT GCTACGTGGC ACCAAAACCG AAGTAGTGAA GTGTATTTCC
AATGATCTCG AAGTGCCCGC CAGTGCGGAG ATTGTGCTGG AAGGGTATAT CGAACAAGGC
GAAATGGCGC CAGAAGGACC GTATGGTGAC CACACTGGTT ACTATAACGA AGTCGATAGT
TTCCCGGTAT TTACCGTGAC GCATATTACC CAGCGTGAAG ATGCGATTTA CCATTCCACC
TATACCGGGC GTCCGCCAGA TGAACCCGCG GTACTGGGAG TGGCGTTGAA CGAAGTATTT
GTTCCCATTC TGCAAAAGCA GTTCCCGGAA ATTGTCGATT TTTACCTGCC GCCGGAAGGC
TGCTCTTATC GCCTGGCGGT AGTGACAATC AAAAAACAGT ACGCCGGACA CGCGAAGCGC
GTCATGATGG GCGTCTGGTC GTTCTTACGC CAGTTTATGT ACACTAAATT TGTGATCGTT
TGCGATGATG ACGTTAACGC ACGCGACTGG AACGATGTGA TTTGGGCGAT TACCACCCGT
ATGGACCCAG CGCGGGATAC TGTTCTGGTA GAAAATACGC CTATTGATTA TCTGGATTTT
GCCTCGCCTG TCTCCGGGCT GGGTTCAAAA ATGGGGCTGG ATGCCACGAA TAAATGGCCG
GGGGAAACCC AGCGTGAATG GGGACGTCCC ATCAAAAAAG ATCCAGATGT TGTCGCACAT
ATTGACGCCA TCTGGGATGA ACTGGCTATT TTTAACAACG GTAAAAGCGC CTGA
 
Protein sequence
MDAMKYNDLR DFLTLLEQQG ELKRITLPVD PHLEITEIAD RTLRAGGPAL LFENPKGYSM 
PVLCNLFGTP KRVAMGMGQE DVSALREVGK LLAFLKEPEP PKGFRDLFDK LPQFKQVLNM
PTKRLRGAPC QQKIVSGDDV DLNRIPIMTC WPEDAAPLIT WGLTVTRGPH KERQNLGIYR
QQLIGKNKLI MRWLSHRGGA LDYQEWCAAH PGERFPISVA LGADPATILG AVTPVPDTLS
EYAFAGLLRG TKTEVVKCIS NDLEVPASAE IVLEGYIEQG EMAPEGPYGD HTGYYNEVDS
FPVFTVTHIT QREDAIYHST YTGRPPDEPA VLGVALNEVF VPILQKQFPE IVDFYLPPEG
CSYRLAVVTI KKQYAGHAKR VMMGVWSFLR QFMYTKFVIV CDDDVNARDW NDVIWAITTR
MDPARDTVLV ENTPIDYLDF ASPVSGLGSK MGLDATNKWP GETQREWGRP IKKDPDVVAH
IDAIWDELAI FNNGKSA