Gene Rsph17029_2334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2334 
SymbolhemC 
ID4896430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2470072 
End bp2471040 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content69% 
IMG OID640112930 
Productporphobilinogen deaminase 
Protein accessionYP_001044208 
Protein GI126463094 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0181] Porphobilinogen deaminase 
TIGRFAM ID[TIGR00212] porphobilinogen deaminase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.69155 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.249742 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGCG CTAACCAAAT CGGCATGACA CACTCCATGC CCACCCCCGC TGAACCCCTC 
AAGATCGGCA CGCGCGGCTC GCCGCTTGCG CTGGCTCAGG CCTACGAGAC CCGCAGCCGC
CTGTCGGCCG CCTTCTCGCT GCCGGAAGAG GCATTCGAAA TCGTGGTCAT AAAAACCACA
GGTGACAAGG TCTTGGACCG TCCGCTCAAG GAGATCGGCG GCAAGGGCCT GTTCACCCGC
GAGATCGAGG AGGCCCTGCT GTCGGGGGGA ATCGACATCG CGGTCCATTC GATGAAGGAC
ATGCCGACGC TTCAGCCCGA GGGGCTGATC CTCGACACCT ACCTCCCGCG CGAGGACACG
CGGGATGCCT TCATCACCTT CGCCGAGGGG GGGCTGGCGG ATTTGCCGCA GGGGGCCACG
GTCGGCTCGT CGAGCCTGCG CCGCCGCGCG CAGCTGCTGA ACAAGCGGCC GGACCTGCAG
GTGGTCGAGT TCCGCGGGAA CCTCCAGACC CGTCTGAAGA AGCTGAACGA CGGGGTGGCG
CGGGGCACCT TCCTCGCGAT GGCCGGGCTG AACCGGCTGA AGATGAACGA GGTGCCGCGG
GTGCCGATCG AGCCCGAGGA AATGCTCTCG GCCGTGGCGC AGGGCGCCAT CGGGATCGAG
CGACGGACCG ACGATCCGCG GGCGCAGGAG ATGCTGGCGG CGATCCATGA CGTGCCCACG
GGGCACCGGC TCGCGGCCGA GCGCAGCTTC CTTCTGAAGC TCGACGGCTC GTGCGAGACG
CCGATCGCGG GGCTCGCGAT CCTCGAGGGC GATCAGCTGT GGCTGCGCGG CGAGATCCTG
CGGCCGGACG GGTCCGAGTC GATCTCGGGC GAGATCCGCG GTGCGATCGC GGATGCGGCC
GCCCTCGGGG TCGAACTGGC CTCGGAGCTT CTGGGCCGGG CGCCGGCCGA CTTCTTCAGC
TGGCGTTGA
 
Protein sequence
MDSANQIGMT HSMPTPAEPL KIGTRGSPLA LAQAYETRSR LSAAFSLPEE AFEIVVIKTT 
GDKVLDRPLK EIGGKGLFTR EIEEALLSGG IDIAVHSMKD MPTLQPEGLI LDTYLPREDT
RDAFITFAEG GLADLPQGAT VGSSSLRRRA QLLNKRPDLQ VVEFRGNLQT RLKKLNDGVA
RGTFLAMAGL NRLKMNEVPR VPIEPEEMLS AVAQGAIGIE RRTDDPRAQE MLAAIHDVPT
GHRLAAERSF LLKLDGSCET PIAGLAILEG DQLWLRGEIL RPDGSESISG EIRGAIADAA
ALGVELASEL LGRAPADFFS WR