Gene SeD_A3231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3231 
SymbolkpdC 
ID6872100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3108265 
End bp3109692 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content56% 
IMG OID642786247 
Product4-hydroxybenzoate decarboxylase, subunit C 
Protein accessionYP_002216888 
Protein GI198244657 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.632709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTTG ATGATTTGCG CAGTTTTTTA CATGCGCTGG ATCAGCAGGG ACAACTGCTG 
AAAATCAGCG AGGAAGTGAA TGCGGAGCCG GATCTCGCCG CCGCTGCCAA CGCAACGGGA
CGCATCGGCG ATGGCGCGCC CGCGCTGTGG TTTGACAATA TTCGTGGTTT TACCGACGCC
CGCGTGGCGA TGAATACCAT CGGTTCCTGG CAGAACCATG CCATCTCACT GGGTTTGCCG
CCTAACACGC CGGTTAAAAA ACAGATCGAT GAATTTATTC GCCGCTGGGA CAATTTTCCG
GTAGCGCCGG AACGACGGGC GAATCCGGGC TGGGCGGAAA ATACCGTCGA CGGCGACGCA
ATCAACCTGT TTGATATCCT GCCGCTGTTT CGTCTGAACG ACGGCGACGG CGGATTCTAT
CTGGATAAAG CCTGTGTGGT ATCACGCGAT CCGCTCGATC CCGATAACTT CGGCAAGCAG
AATGTCGGCA TCTACCGCAT GGAAGTTAAA GGCAAGCGTA AGCTGGGTCT GCAACCAGTG
CCGATGCATG ATATCGCATT GCACCTGCAC AAAGCTGAAG AGCGCGGGGA AGATCTGCCG
ATTGCGATCA CGTTGGGTAA CGATCCGATC ATTACCCTGA TGGGCGCAAC GCCGCTGAAA
TACGATCAGT CAGAGTATGA AATGGCAGGC GCGCTGCGTG AAAGCCCCTA TCCTATCGCC
ACCGCGCCGC TGACCGGTTT TGATGTGCCG TGGGGATCTG AAGTCATTCT TGAAGGGGTC
ATCGAAAGCC GTAAACGTGA AATTGAAGGA CCGTTCGGCG AATTTACCGG CCACTATTCC
GGCGGTCGCA ACATGACCGT GGTGCGTATC GATAAGGTCT CTTACCGCAG CAAACCCATT
TTTGAATCGC TCTATTTGGG GATGCCGTGG ACGGAAATCG ACTACCTGAT GGGGCCGGCG
ACCTGCGTGC CACTGTATCA ACAGCTGAAA GCCGAATTTC CGGAAGTGCA GGCGGTAAAC
GCCATGTACA CCCACGGCTT GCTGGCCATC ATCTCGACCA AAAAACGCTA CGGCGGCTTT
GCCCGTGCGG TGGGCCTGCG AGCGATGACG ACGCCGCACG GTCTGGGATA TGTGAAGATG
GTGATCATGG TCGATGAAGA TGTCGATCCA TTCAATCTGC CACAGGTGAT GTGGGCGTTG
TCGTCGAAAG TGAATCCGGC AGGCGATCTG GTACAGCTAC CGAATATGTC CGTCCTGGAA
CTGGACCCGG GCTCAAGCCC GGCGGGGATT ACCGACAAGC TGATCATTGA CGCCACCACG
CCGGTTGCGC CAGATAACCG TGGTCACTAT AGCCAGCCGG TTGTTGATTT ACCGGAAACT
AAAGCCTGGG CTGAAAAGCT GACCGCCATG CTGGCCAACC GTAAATAA
 
Protein sequence
MAFDDLRSFL HALDQQGQLL KISEEVNAEP DLAAAANATG RIGDGAPALW FDNIRGFTDA 
RVAMNTIGSW QNHAISLGLP PNTPVKKQID EFIRRWDNFP VAPERRANPG WAENTVDGDA
INLFDILPLF RLNDGDGGFY LDKACVVSRD PLDPDNFGKQ NVGIYRMEVK GKRKLGLQPV
PMHDIALHLH KAEERGEDLP IAITLGNDPI ITLMGATPLK YDQSEYEMAG ALRESPYPIA
TAPLTGFDVP WGSEVILEGV IESRKREIEG PFGEFTGHYS GGRNMTVVRI DKVSYRSKPI
FESLYLGMPW TEIDYLMGPA TCVPLYQQLK AEFPEVQAVN AMYTHGLLAI ISTKKRYGGF
ARAVGLRAMT TPHGLGYVKM VIMVDEDVDP FNLPQVMWAL SSKVNPAGDL VQLPNMSVLE
LDPGSSPAGI TDKLIIDATT PVAPDNRGHY SQPVVDLPET KAWAEKLTAM LANRK