Gene Rsph17025_1197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1197 
Symbol 
ID5084481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1238320 
End bp1239456 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content67% 
IMG OID640482755 
Productputative deoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_001167403 
Protein GI146277244 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0299714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGCGC CCTATGCCTG CCAGCCCGGC GAAAGCCGCG GCCGGCAACA GCCCGAGAGC 
ATGTCCACCT TCCGCTCGCC GTTCCAGCGG GATCGAGACC GGATCATCCA TTCCTCGGCC
TTCCGGCGGC TGAAGCACAA GACTCAGGTC TTCGTGGAAC ATGAGGGCGA CTACTACCGC
ACGCGGCTCA CCCATTCGAT CGAAGTGGCG CAGGTCGCGC GGACCATCTC GGGCGTGCTG
GGGCTGAACA CCGATCTGGC CGAGTGCATC GCGCTGGCCC ACGATCTCGG CCACACGCCC
TTCGGCCACA CCGGCGAGGA TGCGCTGGCG AAGCTGATGG AGCCCTACGG CGGATTCGAC
CACAACGCGC AGGCCATGCG GATCGTGACC CGGCTGGAAC GCCATTACGC CGAGTTCGAC
GGGCTGAACC TCACATGGGA GTCGCTGGAA GGCATCGCCA AGCACAACGG CCCGGTCGAG
GGGCCCTTGC CCTATGCGCT GGCCGAGGCC AATGCGCAGT GGGATCTGGA ACTGCACACC
TACGCCTCGG CCGAGGCGCA GGTGGCGGCG ATCGCCGACG ACGTGGCCTA TTCGCACCAC
GACCTGCACG ACGGGCTGCG CTCTGGCCTG TTCACCGAGG ACGACCTGAT GGAGCTGCCC
GTCACCGCGC CCGCCTTTGC CGAGGTCGAT GCGCTCTATC CGGGGCTGGA GCCGATGCGC
CGGCGGCACG AGGCGCTGCG GCGCGTCTTC GGCCGCATGG TCGAGGATGT GATCGCCGTG
GCGCAGGGGC GGCTCGAGGC CGCGCAGCCG AAGTCGGTCG AGGAGATCCG CCAGATGGGC
GCGACCGTGA TCCGCTTTTC GAAACCGCTC TATCAGGAGC TGAAGGTGAT CCGCAGCTTC
CTGTTCCACC GGATGTATCG CGCGCCCTCG GTGATGAAGG AACGCGCGAA GGTGACGGCG
GTGGTGAACG ATCTCTTTCC GCTGTTCATG CGCCAGCCCG AGCTTCTGCC GCAGGAATGG
CGGCGCGATG TCGAGGCGGC CGAGGACGAG ACGACGCTCG CCCGGATCGT CGCCGATTAC
GTCGCCGGCA TGACCGACCG CTTCGCCCTG CAGGAACATG CCCGCCTCTG CGGCTGA
 
Protein sequence
MLAPYACQPG ESRGRQQPES MSTFRSPFQR DRDRIIHSSA FRRLKHKTQV FVEHEGDYYR 
TRLTHSIEVA QVARTISGVL GLNTDLAECI ALAHDLGHTP FGHTGEDALA KLMEPYGGFD
HNAQAMRIVT RLERHYAEFD GLNLTWESLE GIAKHNGPVE GPLPYALAEA NAQWDLELHT
YASAEAQVAA IADDVAYSHH DLHDGLRSGL FTEDDLMELP VTAPAFAEVD ALYPGLEPMR
RRHEALRRVF GRMVEDVIAV AQGRLEAAQP KSVEEIRQMG ATVIRFSKPL YQELKVIRSF
LFHRMYRAPS VMKERAKVTA VVNDLFPLFM RQPELLPQEW RRDVEAAEDE TTLARIVADY
VAGMTDRFAL QEHARLCG