Gene EcE24377A_3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3039 
SymbolkpdC 
ID5590877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3042634 
End bp3044061 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content58% 
IMG OID640926685 
Product4-hydroxybenzoate decarboxylase, subunit C 
Protein accessionYP_001464061 
Protein GI157156336 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATTTG ATGATTTACG CAGCTTTTTA CAGGCGCTTG ATGACCACGG CCAGTTACTG 
AAAATCAGCG AAGAGGTGAA CGCTGAGCCG GATCTGGCAG CAGCAGCTAA CGCCACCGGG
CGTATCGGCG ACGGCGCGCC CGCGCTGTGG TTTGATAATA TTCGCGGCTT TACCGATGCC
CGCGTGGCGA TGAACACCAT CGGTTCCTGG CAGAACCACG CGATTTCCCT CGGCCTGCCG
CCAAATGCCC CGGTTAAAAA GCAGATTGAT GAGTTTATCC GCCGCTGGGA TAACTTCCCG
ATTGCCCCGG AGCGCCGCGC CAATCCAGCC TGGGCGCAGA ACACCGTTGA TGGCGACGAG
ATCAACCTGT TCGATATCCT GCCGCTGTTT CGTTTAAACG ATGGCGATGG CGGTTTCTAT
CTCGACAAAG CGTGCGTGGT TTCCCGCGAT CCGCTCGACC CGGATAACTT CGGCAAGCAG
AACGTCGGCA TCTACCGCAT GGAAGTGAAG GGCAAGCGTA AGCTCGGCCT GCAACCGGTG
CCGATGCACG ATATCGCCCT GCATCTGCAT AAAGCAGAAG AGCGCGGTGA AGATCTGCCG
ATTGCGATCA CGCTCGGTAA CGATCCGATC ATCACGCTGA TGGGGGCCAC GCCGCTGAAA
TATGATCAGT CCGAGTACGA AATGGCAGGC GCGCTGCGTG AAAGCCCGTA CCCGATCGCC
ACCGCCCCGC TGACCGGTTT TGATGTGCCG TGGGGTTCAG AAGTGATCCT CGAAGGGGTT
ATCGAAAGCC GTAAACGCGA AATCGAAGGG CCGTTCGGTG AGTTTACCGG GCACTACTCC
GGTGGGCGCA ACATGACCGT GGTGCGCATC GATAAAGTCT CTTATCGCAG CAAACCGATT
TTCGAATCAC TCTATCTCGG TATGCCGTGG ACCGAAATCG ACTACCTGAT GGGGCCAGCC
ACCTGCGTAC CGCTGTATCA ACAACTGAAA GCCGAGTTCC CGGAAGTGCA GGCGGTAAAT
GCCATGTACA CCCACGGCCT GCTGGCGATT ATCTCCACCA AAAAACGCTA CGGCGGCTTT
GCCCGCGCGG TGGGCCTGCG TGCGATGACC ACGCCGCACG GTCTGGGCTA CGTAAAGATG
GTGATTATGG TCGATGAAGA CGTTGACCCG TTCAACCTGC CGCAGGTGAT GTGGGCGCTC
TCCTCGAAAG TAAACCCGGC AGGGGATTTG GTGCAGTTGC CGAATATGTC CGTGCTGGAA
CTCGACCCTG GCTCAAGCCC GGCGGGGATC ACCGACAAGC TGATTATCGA CGCTACCACG
CCTGTCGCCC CGGACAACCG TGGTCACTAC AGCCAGCCGG TGGTGGATTT GCCGGAAACC
AAAGCCTGGG CTGAAAAACT GACCACTATG CTGGCTGCAC GTAAATAA
 
Protein sequence
MAFDDLRSFL QALDDHGQLL KISEEVNAEP DLAAAANATG RIGDGAPALW FDNIRGFTDA 
RVAMNTIGSW QNHAISLGLP PNAPVKKQID EFIRRWDNFP IAPERRANPA WAQNTVDGDE
INLFDILPLF RLNDGDGGFY LDKACVVSRD PLDPDNFGKQ NVGIYRMEVK GKRKLGLQPV
PMHDIALHLH KAEERGEDLP IAITLGNDPI ITLMGATPLK YDQSEYEMAG ALRESPYPIA
TAPLTGFDVP WGSEVILEGV IESRKREIEG PFGEFTGHYS GGRNMTVVRI DKVSYRSKPI
FESLYLGMPW TEIDYLMGPA TCVPLYQQLK AEFPEVQAVN AMYTHGLLAI ISTKKRYGGF
ARAVGLRAMT TPHGLGYVKM VIMVDEDVDP FNLPQVMWAL SSKVNPAGDL VQLPNMSVLE
LDPGSSPAGI TDKLIIDATT PVAPDNRGHY SQPVVDLPET KAWAEKLTTM LAARK