Gene Pden_4791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_4791 
Symbol 
ID4583353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008688 
Strand
Start bp285774 
End bp286877 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content71% 
IMG OID639772095 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_918548 
Protein GI119387514 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.22928 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.130512 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTCC GGTCCGGCCC TTTGCAGAAA GAAAGAGCCA TGCCCATCCA GACCGAGAAC 
CTGCATATCG CCGCATTGCG CCCCCTGCCC GCCCCGGCCG CCCTGGCCGC ATCCCTGCCG
CGGGACGAAG CGGTCTCCCG CACCGTCGCC GACAGCCGCG CCGCCATCCG CGCCATCCTG
GCCGGGCGCG ACGACCGGCT GCTGGTCGTC GCCGGCCCCT GCTCGGTCCA TGATCCGGCT
GCGGCGCTGG ATTACGCCGC GCGCCTGGCC GAGATGCGCC ACGCGCTGTC CGACCGGCTG
GAGATCGTCA TGCGGGTCTA TTTCGAAAAG CCGCGCACGA CCGTCGGCTG GAAGGGGCTG
ATCAACGATC CGCATCTGGA CGGCTCGGAC CGGATCGAGG ACGGGCTGCC CCTGGCCCGC
CGCCTGCTGC TGGAGATCAA CCGCATGGGC CTGCCGGCGG CGACCGAGTT CCTGGACCCG
ATTCTGCCGC AATACTTCGC CGACCTGATC GCCTGGGGCG CAATCGGCGC GCGCACCACG
GAAAGCCAGA TCCATCGCCA GCTGGCCTCG GGCCTGTCCT GCCCGGTGGG GTTCAAGAAC
GGCACCGACG GCGGGGTGCA GGTGGCGCTG GACGCGATCC GCTCGGCTTC GCGGCCGCAC
AGCTTTCCCG CGATCACCGC CGAAGGGCGC GCGGCCATCG CCACGACCAC CGGCAACGAT
GCCTGCCACG TCGTGCTGCG CGGCGGCCAT GGCGGGCCGA ATTACGGCGC CGACCATGTC
GCGGCAGTGG CGGCGGCTGC GGCCAAGGCG GGGATCGAGC CCGGTATCGT CATCGACGCC
AGCCACGCCA ACAGCGACAA GGATCCCGCC CGACAGCCGG AGGTGATCGC CGATGTCGCG
GCTCGGATCC GCACGGGCGA CAGCCGCATT CGCGGGGTCA TGCTGGAAAG CCATCTGGTG
GCGGGACGGC AGGATCTGCG GGACGGCCAG GTGCCGGTCT ATGGCCAGAG CATCACCGAC
GGCTGCCTGG GCTGGGAGGA CAGCCGCGCG CTGCTCCTGG ACCTTGCCGG GGCCGCGGCG
ACGCGGCTGC GCTGCGCCGC CTGA
 
Protein sequence
MRFRSGPLQK ERAMPIQTEN LHIAALRPLP APAALAASLP RDEAVSRTVA DSRAAIRAIL 
AGRDDRLLVV AGPCSVHDPA AALDYAARLA EMRHALSDRL EIVMRVYFEK PRTTVGWKGL
INDPHLDGSD RIEDGLPLAR RLLLEINRMG LPAATEFLDP ILPQYFADLI AWGAIGARTT
ESQIHRQLAS GLSCPVGFKN GTDGGVQVAL DAIRSASRPH SFPAITAEGR AAIATTTGND
ACHVVLRGGH GGPNYGADHV AAVAAAAAKA GIEPGIVIDA SHANSDKDPA RQPEVIADVA
ARIRTGDSRI RGVMLESHLV AGRQDLRDGQ VPVYGQSITD GCLGWEDSRA LLLDLAGAAA
TRLRCAA