Gene Rsph17029_1317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1317 
Symbol 
ID4896652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1363193 
End bp1364329 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content68% 
IMG OID640111904 
Productputative deoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_001043199 
Protein GI126462085 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGCTC CCTTCGCCTG CCAGCCCGGC GAGAGCCGCG GCCGGCAAAA GCCCGAGAGC 
ATGTCCACGT TCCGCTCTCC CTTCCAGCGG GACCGCGACC GGATCATCCA CTCCTCCGCC
TTCCGCAGGC TGAAGCACAA GACCCAGGTC TTCGTCGAGC ACGAGGGCGA CTATTACCGC
ACGCGGCTCA CCCATTCCAT CGAAGTGGCG CAGGTCGCGC GCACGATCTC GGGCGTGCTC
GGGCTGAACA CCGATCTGGC AGAATGTATC GCGCTGGCCC ACGATCTCGG CCACACGCCC
TTCGGCCACA CCGGCGAGGA TGCGCTGGCG CGGCTCATGG AGCCCTACGG CGGCTTCGAC
CACAATGCGC AGGCCATGCG GATCGTGACC CGGCTCGAGC GCCATTACGC CGAGTTCGAC
GGGCTGAACC TCACCTGGGA GTCGCTCGAG GGCATCGCCA AGCACAACGG CCCGGTCGAG
GGGCCCTTGC CCTATGCGCT GGCCGAGGCC AATGCGCAGT GGGATCTGGA ACTCCATACC
TATGCCTCGG CCGAGGCGCA GGTGGCGGCC ATCGCCGACG ACGTGGCCTA TTCGCACCAC
GATCTGCACG ACGGGCTCCG CTCGGGTCTC TTCACCGAGG CGGATCTGAT GGAACTGCCC
GTCACCGCCC CCGCCTTCGA CGAGGTGGAC GCGCTCTATC CGGGGCTGGA GCCGATGCGG
CGGCGGCACG AGGCGCTGCG CCGGGTCTTC GGCCGGATGG TCGAGGATGT GATCGCGGTG
GCGCAGGGCC GGCTCGAGGC GGCGCAGCCG AAGTCGGTCG AGGAGATCCG CCAGATGGGC
GCCACGGTCA TCCGCTTCTC GAAGCCGCTC TATCAGGAGC TGAAGGTGAT CCGCAGTTTC
CTCTTTCACC GGATGTATCG CGCGCCCTCG GTGATGAAGG AGCGGGCCAA GGTCACGGCG
GTGGTGAACG ATCTCTTTCC GCTCTTCATG GCACGTCCCG AGCTTCTGCC GCAGGAATGG
CGGCGCGACG TGGAGGCCGC CGCCGACGAA ACCACGCTTG CCCGCATCGT GGCCGACTAT
GTCGCGGGCA TGACCGACCG CTTCGCCCTG CAGGAACACG CGCGGCTCTG CGGCTGA
 
Protein sequence
MLAPFACQPG ESRGRQKPES MSTFRSPFQR DRDRIIHSSA FRRLKHKTQV FVEHEGDYYR 
TRLTHSIEVA QVARTISGVL GLNTDLAECI ALAHDLGHTP FGHTGEDALA RLMEPYGGFD
HNAQAMRIVT RLERHYAEFD GLNLTWESLE GIAKHNGPVE GPLPYALAEA NAQWDLELHT
YASAEAQVAA IADDVAYSHH DLHDGLRSGL FTEADLMELP VTAPAFDEVD ALYPGLEPMR
RRHEALRRVF GRMVEDVIAV AQGRLEAAQP KSVEEIRQMG ATVIRFSKPL YQELKVIRSF
LFHRMYRAPS VMKERAKVTA VVNDLFPLFM ARPELLPQEW RRDVEAAADE TTLARIVADY
VAGMTDRFAL QEHARLCG