Gene EcSMS35_2863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2863 
SymbolkpdC 
ID6145321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2937063 
End bp2938490 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content58% 
IMG OID641617732 
Product4-hydroxybenzoate decarboxylase, subunit C 
Protein accessionYP_001744887 
Protein GI170681645 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.967142 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTTG ATGATTTACG CAGCTTTTTA CAGGCGCTCG ACGACCATGG GCAATTGCTG 
AAAATCAGCG AAGAGGTGAA CGCAGAACCG GATCTGGCAG CCGCAGCCAA CGCCACCGGG
CGTATCGGCG ACGGCGCGCC CGCGCTGTGG TTTGACAATA TTCGCGGCTT TACCGATGCC
CGCGTGGCGA TGAACACCAT CGGTTCCTGG CAGAACCACG CGATTTCCCT CGGCCTGCCG
CCAAATACCC CGGTTAAAAA ACAGATCGAT GAGTTTATCC GCCGCTGGGA CAACTTCCCG
ATAGCCCCGG AACGCCGAGC CAATCCAGCC TGGGCGCAGA ACACCGTCGA TGGCGAAAAG
ATTAACCTGT TCGATATCCT GCCGCTGTTT CGTTTAAACG ACGGTGATGG CGGTTTCTAT
CTCGACAAAG CGTGCGTGGT TTCCCGCGAT CCGCTCGACC CGGATAACTT CGGCAAGCAG
AACGTCGGTA TCTACCGCAT GGAAGTGAAG GGCAAACGTA AGCTCGGCCT GCAACCGGTG
CCGATGCACG ATATCGCGCT GCATCTGCAT AAAGCGGAAG AGCGCGGTGA AGATCTGCCG
ATTGCCATCA CCCTCGGTAA CGATCCGATC ATCACCCTGA TGGGGGCCAC GCCGCTGAAA
TATGATCAGT CCGAGTACGA AATGGCGGGC GCGCTGCGCG AAAGTCCGTA CCCGATTGCC
ACCGCCCCAC TGACCGGTTT TGATGTACCC TGGGGCTCGG AAGTGATCCT CGAAGGGGTT
ATCGAAAGTC GTAAACGTGA AATCGAAGGG CCGTTCGGTG AGTTTACCGG GCACTACTCC
GGCGGGCGCA ACATGACCGT GGTGCGCATC GATAACGTCT CTTATCGCAG CAAACCCATT
TTTGAATCGC TCTATCTCGG GATGCCGTGG ACGGAAATCG ACTACCTGAT GGGGCCAGCC
ACCTGCGTGC CGCTGTATCA GCAACTGAAA GCCGAGTTCC CGGAAGTGCA GGCGGTAAAC
GCCATGTACA CCCACGGCCT GCTGGCGATT ATCTCCACCA AAAAACGCTA CGGTGGCTTT
GCCCGCGCGG TGGGCCTCCG TGCGATGACC ACGCCACACG GTCTGGGCTA TGTGAAGATG
GTGATTATGG TCGATGAAGA CGTTGATCCG TTCAACCTGC CGCAGGTGAT GTGGGCGCTC
TCTTCGAAAG TGAACCCGGC AGGGGATTTG GTGCAGTTGC CGAATATGTC CGTGCTGGAA
CTCGACCCTG GCTCAAGTCC GGCGGGGATC ACCGACAAGC TGATTATCGA CGCCACCACG
CCTGTCGCCC CGGACAACCG TGGTCACTAC AGTCAGCCGG TGGTGGATTT ACCGGAAACC
AAAGCCTGGG CTGAAAAACT GACCGCTATG CTGGCCGCAC GTCAATAA
 
Protein sequence
MAFDDLRSFL QALDDHGQLL KISEEVNAEP DLAAAANATG RIGDGAPALW FDNIRGFTDA 
RVAMNTIGSW QNHAISLGLP PNTPVKKQID EFIRRWDNFP IAPERRANPA WAQNTVDGEK
INLFDILPLF RLNDGDGGFY LDKACVVSRD PLDPDNFGKQ NVGIYRMEVK GKRKLGLQPV
PMHDIALHLH KAEERGEDLP IAITLGNDPI ITLMGATPLK YDQSEYEMAG ALRESPYPIA
TAPLTGFDVP WGSEVILEGV IESRKREIEG PFGEFTGHYS GGRNMTVVRI DNVSYRSKPI
FESLYLGMPW TEIDYLMGPA TCVPLYQQLK AEFPEVQAVN AMYTHGLLAI ISTKKRYGGF
ARAVGLRAMT TPHGLGYVKM VIMVDEDVDP FNLPQVMWAL SSKVNPAGDL VQLPNMSVLE
LDPGSSPAGI TDKLIIDATT PVAPDNRGHY SQPVVDLPET KAWAEKLTAM LAARQ