Gene RSP_3644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3644 
Symbol 
ID3722133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007494 
Strand
Start bp751763 
End bp753184 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content52% 
IMG OID640073320 
Productmetal dependent phosphohydrolase 
Protein accessionYP_355157 
Protein GI77465654 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.287416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTTGA ACATGTACAC ATCTGACGAT TGGCTACGCC AGAACGGCGA GGACTCGGCA 
TCTGACCCTT GGAGACCGCC CGTTGTCAGG GATTTCGGTA GGATCATCCA CAGCGCTAGC
TTCAGAAGGC TGCAAGGTAA AACGCAGGTA TTTCCGGGAC ACGAGTCCGA CTTCTTTAGA
AACCGTCTGA CGCATTCACT TGAGGTCTCA CAAATCGCCG AAGGCATCGC AGACCGTTTG
AACTATGTAT ATGCGGAAAA ACTGGGAGGT CGGCGGATCG ACAGCCGACT CTGCGCAGCG
GCAGGTCTTG TCCACGACAT CGGACATCCC CCATTCGGCC ACAATGGTGA GCGCGCACTC
AACACTAAGA TGGAGATGCG CGGCGGATTC GAGGGGAACG CTCAGACGCT GCGTATCTTA
AGCCGCTTGG AAAAGAAGGC GAAGTACAAA GCGCCTGTAG ATGGCGACGA GCGCGCCGGG
ATGAACTTGT GTTTTCGGAC ACTCGCTGCC GTGCTGAAAT ACGACAACGA GATAGATAAA
GAACGCTTCG GTAGTGATGG TCCGCAAAAA GGGTACTACG CCTCCGAAGC TAAGATTGTC
GCAGAGATAA AAAGAAGAGT TTTGGGAGGG AAAGCCCTTC CGGCTGGCGT CAAGTTTAAG
ACCGTTGAAT GTGCAATAAT GGATATTGCA GACGACATCG CTTATTCTGT TTACGATCTT
GAAGACAGTC TTAAAGCAGG TTTCCTAACG CCTGCCTCAA TTCTGGCTAC AGATGATGAT
TTGCTGAAGA GGGTTGCGGA AAAGGCAACT GAGCAGCTGG CGGAAGATTG CAGCGAGAAG
ATCACCGCAC AAGAAGTCTT GGCGACCTTG GTTGCTCTTT TCGGGGACAT CTTTGCTGTG
GACGAAAGCA GAGAGCAAGA CTTTCCGTTC GAACGACGGA ATAAGGATCT TTCAAGTTTT
ATCAACGCGA TGAAGGCGTC TCGTGCGGTG AACATGGACG CAGGGAGGAG GATGAAGCTA
TCGTCCGAAC TGGTTCACGA ATTCATGAAC GCTGTTGAAC TAGAAATAAA CGAAGAGTTT
CCTGCTCTCT CACGAGCTGC GCTAGATAAG AAGACGCGCA TTAAAGTGGA AATCCTGAAG
CAATATACTT TCCTGTCAAC TATCTACTCG AACCGGGTTA AGCTCGGCGA GTACCGTGGT
ACCGAGTTGG TGGGGGAAAT ATTCGAGGCG CTGGAAAAGA AGAGCGGACA TCTGCTTATG
CCCGACGACG TGCGCAAGCG GGTTCAGGAG GCCGGTGGGG ATAAGGATTT GCAGGCGCGT
CACATTTGCG ACTTTGTCGC AGGAATGACG GATAGGTATG CCGTAGAGTT CTGGGCTCGG
CTCAAGTCAG ATGTCGCGGA AAGCATGTTT AAGCCCATCT AG
 
Protein sequence
MMLNMYTSDD WLRQNGEDSA SDPWRPPVVR DFGRIIHSAS FRRLQGKTQV FPGHESDFFR 
NRLTHSLEVS QIAEGIADRL NYVYAEKLGG RRIDSRLCAA AGLVHDIGHP PFGHNGERAL
NTKMEMRGGF EGNAQTLRIL SRLEKKAKYK APVDGDERAG MNLCFRTLAA VLKYDNEIDK
ERFGSDGPQK GYYASEAKIV AEIKRRVLGG KALPAGVKFK TVECAIMDIA DDIAYSVYDL
EDSLKAGFLT PASILATDDD LLKRVAEKAT EQLAEDCSEK ITAQEVLATL VALFGDIFAV
DESREQDFPF ERRNKDLSSF INAMKASRAV NMDAGRRMKL SSELVHEFMN AVELEINEEF
PALSRAALDK KTRIKVEILK QYTFLSTIYS NRVKLGEYRG TELVGEIFEA LEKKSGHLLM
PDDVRKRVQE AGGDKDLQAR HICDFVAGMT DRYAVEFWAR LKSDVAESMF KPI