Gene RPC_2507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2507 
Symbol 
ID3971089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2718005 
End bp2719219 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content68% 
IMG OID637925615 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_532377 
Protein GI90424007 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0639278 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGTCG GAATGGCAGC TCCTCGCGCG GCGTTCGCCT GCGATCCCGA CGCCAGCCGC 
GGCCGGCTGT TCGACGAACT GCCGAGCAAG ACCCGCAGTC CGTTCCGGCG CGATTGCGAC
CGGGTGATCC ATTCCACCGG GTTTCGCCGG CTGAAGCACA AGACCCAGGT GTTCGTCTAT
CACGAGGGCG ACCACTATCG CACCCGGCTG ACGCATTCGC TGGAGGTGGC GCAGATCGCC
CGCGCCATCG CCCGCCAACT CGGGCTCGAC GAAGACCTTA CCGAAGCGCT GGCGCTGGCG
CACGACCTCG GCCATCCGCC GTTCGGCCAC GCCGGCGAAC GCGCGCTCGA CGCCTGCCTG
CAGCGCTATG GCGGCTTCGA CCACAACGCC CAGAGCCTGC GCGTGGTGAC GGCGCTGGAG
CATCGCTATC CGGAGTTCAA CGGCCTCAAT CTGACTTGGG AAACGTTGGA GGGCATCGTC
AAGCACAACG GCCCGCTGAC CGACCGCAGC GGCGCGCCGC TCGGCCGCTA TCAGGCGCAT
GGCGTGCCGA CCGGCATCGT CGAATTCAAC CGCTGTTTCG ACCTGGAATT GTGGAGCCAC
GCCTCGCTCG AGGCGCAGGT CGCCGGCATC GCCGACGATA TCGCCTATGA CGCCCACGAC
ATCGACGACG GGCTGCGCGC CGGGCTGTTC GGCGTCGACG ATCTCGGCGA GATGCCGCTG
ACCGCGCAGA TGACCGCCGC GATCGACGTC CGCTATCCCG GGCTCGACCC GGCACGGCGC
GGCGCCGAAC TGGTGCGCGA GCTGATTTCG TTCTTGATCG GCGCCGCGGT GGCCGAAGCC
GAGCGGCGGT TGATCGCGGC GCAGCCCGCC TCGGTGCAGG CGGTGCGCGA GGCCGGCCAG
GATCTGATCA TGTTCGCGCC GGACGCCGCC GAAGCCGAAG CGCTGATCAA GGCGTTCCTG
AAGCGCCACA TGTATCGCCA TCCGCGGGTG ATGCGGGTGA TGGACGACGC CGAGACCGTG
GTGTTCGAGC TGTTCGCCCG CTACCGCGAC CATCCGGCGG ATCTGCCGGC GGAATGGCTG
CCGGCGAACG CCGGGCAGGG CGAAACCGAG GCGGATCGGC TGCGCCGAAT CTGCAATTTC
ATCGCCGGCA TGACCGACCG CTACGCGCTG ACCGAGCACC AACGGCTCTT TGACTTAACG
CCGGAATTGC GTTAG
 
Protein sequence
MSVGMAAPRA AFACDPDASR GRLFDELPSK TRSPFRRDCD RVIHSTGFRR LKHKTQVFVY 
HEGDHYRTRL THSLEVAQIA RAIARQLGLD EDLTEALALA HDLGHPPFGH AGERALDACL
QRYGGFDHNA QSLRVVTALE HRYPEFNGLN LTWETLEGIV KHNGPLTDRS GAPLGRYQAH
GVPTGIVEFN RCFDLELWSH ASLEAQVAGI ADDIAYDAHD IDDGLRAGLF GVDDLGEMPL
TAQMTAAIDV RYPGLDPARR GAELVRELIS FLIGAAVAEA ERRLIAAQPA SVQAVREAGQ
DLIMFAPDAA EAEALIKAFL KRHMYRHPRV MRVMDDAETV VFELFARYRD HPADLPAEWL
PANAGQGETE ADRLRRICNF IAGMTDRYAL TEHQRLFDLT PELR