Gene RPB_2751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2751 
Symbol 
ID3910544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3136057 
End bp3137271 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content67% 
IMG OID637884651 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_486364 
Protein GI86749868 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.276692 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.329676 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGTCG GAATGGCAGC TCCTCGCGCA GCTTTTAGCT GCGACCCGGA CCGCAGCCGC 
GGCCGGCAAT TCGCCGAGCC GCCGAGCAGC AACCGCAGCG CGTTCCGCCG CGATTGCGAC
CGGGTGATCC ATTCCAATGC CTTCCGTCGG CTGAAGCACA AGACCCAAGT CTTCGTGTTT
CACGAGGGCG ATCATTACCG CACCCGGCTG ACCCACAGCC TGGAAGTGGC CCAGATCGCC
CGCGCCATCG CGCGCCAGCT CGGGCTCGAC GAAGATCTGA CCGAGACGCT GGCGCTGGCG
CACGACCTCG GCCATCCACC GTTCGGCCAT GCCGGCGAGC GAGCGCTCGA CGCCTGTCTG
CGAGACCACG GCGGGTTCGA CCACAACGCG CAGACGCTGC GGGTGCTGAC GGCACTTGAG
CACCGCTATC CAGGCTTCGA CGGGCTGAAC CTGACCTGGG AAACGCTCGA AGGCGTGGTC
AAGCACAACG GCCCGCTGAC CGATCGCACC GGAGCGCCGC TGCCGCGCCA TGCCGAGCGT
GGCGTGCCGA TCGGCATTGC CGAATTCAGC CAACGCTTCG ATCTCGAAAT ATGGAGCTTC
GCCTCGCTCG AAGCCCAGGT CGCGGCGCTT GCCGACGACA TCGCCTACGA CGCCCACGAC
ATCGACGATG GTCTTCGCGC CGGGTTGTTC CGGGTCGACG ATCTGCGCGC CGTGCCGCTG
ACCGCCGCCC TCATAGATGG CATTTCGCGA CGCTATCCGG CGCTGGGCGA GAGCCGTCGC
GGCGCCGAAC TCGTCCGCGA GCTGATTTCG CATCTGATCG GCGCCGTCAC GGCAGAGACC
ATGCGCCGGC TCGGCGAGGC GGCGCCACGA TCGGTCGAGG ACGTGCGCCA CGCCAGCACG
GCGATGGTCG CGTTTCCGTC CGAAACGGCC GTCGCGGAGG CCGAGATCAA AGCCTTTCTC
TGGACCCATA TGTACCGCGC CGAGCGGGTC ATGGCGGTGA TGCGGGACGC CGAGGCGATC
GTCGCCGACC TGTTCCGGCG GTATTGCGAG CATCCCGCCG ACCTGCCGCC GGACTGGCTG
CCGGCCGATG GCCCAGTGGC CGAATGCGAG GCCGACCGCT TTCGCCGGAT CCGTAATTTC
ATCGCCGGCA TGACCGACCG CTACGCTTTG ACCGAACATC AGCGGCTTTT TGACTCGACC
CCGGATTTGC GTTAG
 
Protein sequence
MSVGMAAPRA AFSCDPDRSR GRQFAEPPSS NRSAFRRDCD RVIHSNAFRR LKHKTQVFVF 
HEGDHYRTRL THSLEVAQIA RAIARQLGLD EDLTETLALA HDLGHPPFGH AGERALDACL
RDHGGFDHNA QTLRVLTALE HRYPGFDGLN LTWETLEGVV KHNGPLTDRT GAPLPRHAER
GVPIGIAEFS QRFDLEIWSF ASLEAQVAAL ADDIAYDAHD IDDGLRAGLF RVDDLRAVPL
TAALIDGISR RYPALGESRR GAELVRELIS HLIGAVTAET MRRLGEAAPR SVEDVRHAST
AMVAFPSETA VAEAEIKAFL WTHMYRAERV MAVMRDAEAI VADLFRRYCE HPADLPPDWL
PADGPVAECE ADRFRRIRNF IAGMTDRYAL TEHQRLFDST PDLR