Gene ECH74115_3989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3989 
SymbolkpdC 
ID6967454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3688833 
End bp3690260 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content58% 
IMG OID643387758 
Product4-hydroxybenzoate decarboxylase, subunit C 
Protein accessionYP_002272201 
Protein GI209395965 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTTG ATGATTTACG CAGCTTTTTA CAGGCGCTTG ATGACCACGG CCAGTTACTG 
AAAATCAGCG AAGAGGTGAA CGCGGAACCG GATCTGGCTG CCGCTGCCAA CGCCACCGGG
CGTATCGGCG ATGGTGCACC GGCGCTGTGG TTTGATAATA TTCGCGGTTT TACCGATGCC
CGCGTGGCGA TGAATACCAT CGGTTCCTGG CAGAACCACG CGATTTCCTT GGGCCTGCCG
CCAAACACCC CGGTTAAAAA ACAGATTGAT GAGTTTATCC GCCGCTGGGA TAACTTCCCG
ATCGCCCCGG AGCGCCGCGC CAATCCAGCC TGGGCGCAGA ACACCGTCGA TGGTGACGAG
ATTAATCTGT TCGATATCCT GCCGCTGTTT CGTTTAAACG ACGGTGATGG CGGTTTCTAT
CTCGACAAAG CGTGCGTGGT TTCCCGCGAT CCGCTCGACC CGGATAACTT CGGCAAGCAG
AACGTCGGTA TCTACCGCAT GGAAGTGAAG GGCAAGCGTA AGCTCGGCCT GCAACCGGTG
CCGATGCACG ATATCGCCCT GCATCTGCAT AAAGCGGAAG AGCGCGGTGA AGATCTGCCG
ATTGCCATCA CCCTGGGTAA CGATCCGATC ATCACCCTTA TGGGCGCCAC GCCGCTGAAA
TACGATCAGT CTGAGTATGA AATGGCAGGC GCGCTGCGCG AAAGCCCGTA CCCGATCGCC
ACCGCGCCAT TGACTGGTTT TGATGTGCCG TGGGGTTCAG AAGTGATCCT CGAAGGGGTT
ATCGAAAGCC GCAAACGCGA AATCGAAGGG CCGTTCGGTG AGTTTACCGG GCACTACTCC
GGCGGGCGTA ACATGACCGT GGTGCGTATC GATAAAGTCT CTTACCGCAC CAGGCCGATT
TTCGAATCGC TGTACCTCGG CATGCCGTGG ACCGAAATCG ACTACCTGAT GGGGCCAGCC
ACCTGTGTGC CGCTGTATCA ACAACTGAAA GCCGAGTTCC CGGAAGTGCA GGCGGTAAAC
GCCATGTACA CCCACGGCCT GCTGGCGATT ATCTCCACCA AAAAACGCTA CGGCGGCTTT
GCCCGCGCGG TGGGCCTGCG TGCGATGACC ACGCCGCACG GTCTGGGCTA CGTAAAGATG
GTGATTATGG TCGATGAAGA CGTTGACCCG TTCAACCTGC CGCAGGTGAT GTGGGCGCTC
TCCTCGAAAG TAAATCCGGC AGGGGATTTG GTGCAGTTGC CGAATATGTC CGTGCTGGAA
CTCGACCCTG GCTCAAGCCC GGCGGGGATC ACCGACAAGC TGATTATCGA CGCCACCACG
CCTGTCGCCC CGGACAACCG TGGTCACTAC AGCCAGCCGG TGGTGGATTT GCCGGAAACC
AAAGCCTGGG CTGAAAAACT GACCGCTATG CTGGCCGCAC GTAAATAA
 
Protein sequence
MAFDDLRSFL QALDDHGQLL KISEEVNAEP DLAAAANATG RIGDGAPALW FDNIRGFTDA 
RVAMNTIGSW QNHAISLGLP PNTPVKKQID EFIRRWDNFP IAPERRANPA WAQNTVDGDE
INLFDILPLF RLNDGDGGFY LDKACVVSRD PLDPDNFGKQ NVGIYRMEVK GKRKLGLQPV
PMHDIALHLH KAEERGEDLP IAITLGNDPI ITLMGATPLK YDQSEYEMAG ALRESPYPIA
TAPLTGFDVP WGSEVILEGV IESRKREIEG PFGEFTGHYS GGRNMTVVRI DKVSYRTRPI
FESLYLGMPW TEIDYLMGPA TCVPLYQQLK AEFPEVQAVN AMYTHGLLAI ISTKKRYGGF
ARAVGLRAMT TPHGLGYVKM VIMVDEDVDP FNLPQVMWAL SSKVNPAGDL VQLPNMSVLE
LDPGSSPAGI TDKLIIDATT PVAPDNRGHY SQPVVDLPET KAWAEKLTAM LAARK