Gene Rsph17029_1907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1907 
Symbol 
ID4895137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2019670 
End bp2020626 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content69% 
IMG OID640112501 
Productchlorophyll synthesis pathway, BchC 
Protein accessionYP_001043783 
Protein GI126462669 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.274753 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.343747 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAACGA CCGCCGTCAT CCTGTCGGGT CCGCGGGACC TTGGCCTTCA GACCATCCAG 
CTGAACGAGC CCGCGCCCGG CGATATCGTC GTCGAGATCA CCCATTCGGG CATTTCGACG
GGCACCGAAA AACTGTTCTA CACCGGCCAG ATGCCGCCCT TTCCGGGCAT GGGCTACCCG
CTGGTGCCGG GCTACGAGGC CGCCGGCGAA GTGGTCGAGG CCGCGCCCGA TACGGGCTTC
CGGCCGGGCG ACCGGGTCTT CGTGCCGGGC TCCAACTGTT TTGCGCCGAC CGATGCGGGG
CCGATCCGCG GCCTGTTCGG AGCGGCGACG AAGCGGCTCG TGACGCCCGC CCATCGCGCC
GTGCGCATCG ATCCTGCGCT CGAGGCCGAG GGGGCGCTTC TGGCGCTTGC CGCGACCGCG
CGCCATGCGC TGGCCGGGCT GAACCATGTG CTGCCGGACC TGATCGTGGG TCACGGCACG
CTGGGCCGGC TGCTCGCGCG TCTGACCATT GCCGCGGGCG GCGAGCCGCC GGTGGTCTGG
GAGACCAAGG CGGAACGGCG CCGCCATGCC GAGGGCTACG AGGTCATCGA CCCCGCGACC
GACCAGCGCC GCGACTACCG CTCGATCTAC GATGCGTCGG GCGATCCGAA ATTGATCGAC
AGTCTGGTGA TGCGGCTTGC CAAGGGCGGC GAGATCGTGC TGGCGGGCTT CTATACCGAA
CCCGTTGCCT TCACCTTCGT GCCCGCCTTC ATGAAGGAGG CGCGCCTGCG CATCGCTGCC
GAGTGGCAGC CCGAGGACAT GGTGGCCACC CGCGCGCTGA TCGAGAGCGG GGCGCTTTCG
CTTGCCAATC TGATCACCCA CACCCGACCG GCGTCGGAGG CGGCCGAGGC CTACGCCACG
GCCTTCAGCG ACCCCGACTG TCTCAAGATG ATCCTGGATT GGAGGGCCAC CGCATGA
 
Protein sequence
MRTTAVILSG PRDLGLQTIQ LNEPAPGDIV VEITHSGIST GTEKLFYTGQ MPPFPGMGYP 
LVPGYEAAGE VVEAAPDTGF RPGDRVFVPG SNCFAPTDAG PIRGLFGAAT KRLVTPAHRA
VRIDPALEAE GALLALAATA RHALAGLNHV LPDLIVGHGT LGRLLARLTI AAGGEPPVVW
ETKAERRRHA EGYEVIDPAT DQRRDYRSIY DASGDPKLID SLVMRLAKGG EIVLAGFYTE
PVAFTFVPAF MKEARLRIAA EWQPEDMVAT RALIESGALS LANLITHTRP ASEAAEAYAT
AFSDPDCLKM ILDWRATA