Gene EcSMS35_4224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4224 
SymbolubiD 
ID6146624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4320224 
End bp4321717 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content54% 
IMG OID641619047 
Product3-octaprenyl-4-hydroxybenzoate decarboxylase 
Protein accessionYP_001746175 
Protein GI170682759 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.514143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00200609 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACGCCA TGAAATATAA CGATTTACGC GACTTCTTGA CGCTGCTTGA ACAGCAGGGT 
GAGCTAAAAC GTATCACGCT CCCGGTGGAT CCGCATCTGG AAATCACTGA AATTGCTGAC
CGCACTTTGC GTGCCGGTGG GCCTGCGCTG TTGTTCGAAA ACCCTAAAGG CTACTCAATG
CCGGTGCTGT GCAACCTGTT CGGTACGCCA AAGCGCGTGG CGATGGGCAT GGGGCAGGAA
GATGTTTCGG CGCTGCGTGA AGTTGGTAAA TTATTGGCGT TTCTGAAAGA GCCGGAGCCG
CCAAAAGGTT TCCGCGACCT GTTTGATAAA CTGCCGCAGT TTAAGCAAGT ATTGAACATG
CCGACAAAGC GGCTGCGTGG TGCGCCTTGC CAACAAAAAA TCGTCTCTGG CGATGACGTC
GATCTCAATC GCATTCCCAT TATGACCTGC TGGCCGGAAG ATGCCGCGCC GCTGATTACC
TGGGGGCTGA CCGTGACGCG CGGCCCGCAT AAAGAGCGGC AGAATCTGGG CATTTATCGC
CAGCAGCTGA TTGGTAAAAA CAAACTGATT ATGCGCTGGC TGTCGCATCG CGGCGGCGCG
CTGGATTATC AGGAGTGGTG TGTGGCGCAT CCGGGCGAAC GTTTCCCGGT TTCTGTGGCG
CTGGGTGCCG ATCCCGCCAC GATTCTCGGT GCAGTCACCC CCGTTCCGGA TACGCTTTCA
GAGTATGCGT TTGCCGGATT GCTACGCGGC ACCAAAACCG AAGTAGTAAA GTGTATTTCC
AATGATCTCG AAGTGCCCGC CAGTGCGGAG ATTGTGCTGG AAGGGTATAT CGAACAAGGC
GAAACTGCGC CGGAAGGGCC GTATGGCGAC CACACCGGTT ACTATAACGA AGTCGATAGT
TTCCCGGTAT TTACCGTGAC GCATATTACC CAGCGTGAAG ATGCGATTTA TCATTCCACC
TATACCGGGC GTCCGCCGGA TGAACCTGCG GTGCTGGGTG TCGCACTGAA CGAAGTGTTT
GTGCCGATTC TGCAAAAACA GTTCCCGGAA ATTGTCGATT TTTACCTGCC GCCGGAAGGC
TGCTCTTATC GCCTGGCGGT AGTGACGATC AAAAAACAGT ACGCCGGACA CGCGAAGCGC
GTCATGATGG GCGTCTGGTC GTTCTTACGC CAGTTTATGT ACACTAAATT TGTGATCGTT
TGCGATGATG ACGTCAACGC ACGCGACTGG AACGATGTGA TTTGGGCGAT TACCACCCGT
ATGGACCCGG CGCGGGACAC GGTGTTGGTC GAAAATACGC CTATTGATTA TCTGGATTTT
GCCTCGCCTG TGTCCGGGCT GGGTTCAAAA ATGGGGCTGG ATGCCACGAA TAAATGGCCA
GGGGAAACCC AGCGTGAATG GGGACGTCCC ATCAAAAAAG ATCCGGATGT TGTCGCGCAT
ATTGACGCCA TCTGGGATGA ACTGGCTATT TTTAACAACG GTAAAAGCGC CTGA
 
Protein sequence
MDAMKYNDLR DFLTLLEQQG ELKRITLPVD PHLEITEIAD RTLRAGGPAL LFENPKGYSM 
PVLCNLFGTP KRVAMGMGQE DVSALREVGK LLAFLKEPEP PKGFRDLFDK LPQFKQVLNM
PTKRLRGAPC QQKIVSGDDV DLNRIPIMTC WPEDAAPLIT WGLTVTRGPH KERQNLGIYR
QQLIGKNKLI MRWLSHRGGA LDYQEWCVAH PGERFPVSVA LGADPATILG AVTPVPDTLS
EYAFAGLLRG TKTEVVKCIS NDLEVPASAE IVLEGYIEQG ETAPEGPYGD HTGYYNEVDS
FPVFTVTHIT QREDAIYHST YTGRPPDEPA VLGVALNEVF VPILQKQFPE IVDFYLPPEG
CSYRLAVVTI KKQYAGHAKR VMMGVWSFLR QFMYTKFVIV CDDDVNARDW NDVIWAITTR
MDPARDTVLV ENTPIDYLDF ASPVSGLGSK MGLDATNKWP GETQREWGRP IKKDPDVVAH
IDAIWDELAI FNNGKSA