Gene RPD_2796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2796 
Symbol 
ID4023294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3114471 
End bp3115685 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content68% 
IMG OID637962994 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_569925 
Protein GI91977266 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0211174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.07254 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGTCG GAATGGCAGC TCCTCGCGCA GCCTATGGTT GCGATCCGGA CCGCAGCCGC 
GGCCGGCAGT TCGCCGAGCC GCCGAGCAAC AACCGCAGTG CTTTTCGGCG TGATTGCGAC
CGGGTGATCC ACTCCAACGC CTTCCGCCGG CTCAAGCACA AGACCCAGGT CTTCGTGTTC
CACGAGGGCG ATCATTACCG CACCCGTCTG ACCCACAGCC TCGAAGTCGC CCAGATCGCC
CGGGCGATCG CGCGCCAGCT CGGGCTCGAC GAGGACCTGA CCGAGACGCT GGCGCTGGCG
CACGATCTCG GCCACCCGCC GTTCGGCCAT GCCGGCGAGC GCGCGCTCGA CGCCTGCCTG
CGCGCCCATG GCGGCTTCGA TCACAACGCT CAGACCCTGC GGGTGCTGAC CGCACTCGAA
CACCGCTATC CGGAATTCGA CGGGTTGAAC CTGACCTGGG AAACGCTCGA AGGCGTGGTC
AAGCATAATG GCCCGCTCAC CGATCGCGCC GGCCGGCCGC TGCCGCGCTA CGCCGAGCGC
GGCGTGCCGA TCGGGATTGT CGAGTTCAGC CAGCGCTTCG ACCTCGAGCT GTGGAGCTTT
GCCTCGCTCG AGGCCCAGGT TGCGGCGATT GCCGACGACA TCGCCTACGA CGCCCACGAC
ATCGACGACG GGCTGCGCGC CGGGCTGTTC CGGGTCGACG ATCTGCGCGC CGTGCCGCTG
ACCGCATCGA TCATCGACGG CATCGCACGG CGCTATCCGG CTCTCGACGA AAGCCGGCGC
GGCGCCGAGC TGGTGCGCGA GCTGATCTCG CATTTGATCG GCGCGGTGAC CGCCGAGACC
ATGCGGCGAC TGGGCGAGGC TGCGCCGCGC TCGGCCGAGG AGGTGCGTCA CGCCAGTTCG
GCGATGGTGG CGTTCCCGAT CGAGACGGCC GCCGCGGAAG CCGAGATCAA GGCATTCCTC
TGGACCCACA TGTATCGGGC AAACCGCGTC ATGGCGGTGA TGCGCGACGC CGAGGCGATC
GTCGCCGACC TGTTCCAGCG CTATTGCGAC CATCCGGCCG ATCTGCCGCC GGACTGGCTG
CCGACCGATG GGCCGGTCGC CGAGTGCGAA GCGGACCGGC TGCGACGGAT CCGCAATTTC
ATCGCCGGCA TGACCGACCG CTATGCGCTG ACCGAACACC AGCGGCTTTT TGACTCGACT
CCGGATTTGC GTTAG
 
Protein sequence
MSVGMAAPRA AYGCDPDRSR GRQFAEPPSN NRSAFRRDCD RVIHSNAFRR LKHKTQVFVF 
HEGDHYRTRL THSLEVAQIA RAIARQLGLD EDLTETLALA HDLGHPPFGH AGERALDACL
RAHGGFDHNA QTLRVLTALE HRYPEFDGLN LTWETLEGVV KHNGPLTDRA GRPLPRYAER
GVPIGIVEFS QRFDLELWSF ASLEAQVAAI ADDIAYDAHD IDDGLRAGLF RVDDLRAVPL
TASIIDGIAR RYPALDESRR GAELVRELIS HLIGAVTAET MRRLGEAAPR SAEEVRHASS
AMVAFPIETA AAEAEIKAFL WTHMYRANRV MAVMRDAEAI VADLFQRYCD HPADLPPDWL
PTDGPVAECE ADRLRRIRNF IAGMTDRYAL TEHQRLFDST PDLR