Gene Rru_A1780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1780 
Symbol 
ID3835202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp2071855 
End bp2073051 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content64% 
IMG OID637825877 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_426867 
Protein GI83593115 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.797351 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATCT GGACCCCAAA ATCCCCGAGT CTCGCGCCCT ATGCCAGCGA TCCGGCGACC 
AGCCGGGGCC GGCTTTACCC CGAAGCGTCG TCGCCGACGC GCTCGCCCCA TCAGCGCGAC
CGCGATCGCG TGCTGCATTC GGCGGCCTTC CGCCGCCTGA AATACAAGAC CCAGGTCTTC
GTCAATTCGG TCGGCGAGAA TTACCGCACC CGCCTGACCC ATAGCCTGGA AGTCTCGCAG
ATCGCCCGCT CGGTCAGCCG GGTTCTTGGT CTCAACGAGG ATCTGGCCGA GGCCCTGGCC
CTGGCCCATG ACCTGGGCCA CACCTGCTTT GGCCATGCCG GCGAGGATGC GCTGAAGGAC
TGCATGGCGG CCTATGACGG CTTTGACCAT AACGCCCAAT CGCTGCGCAT CGTCACCAAG
CTGGAGCGGC GCTATGCCGA ATTCGACGGC CTCAATCTGA CCTGGGAAAC CCTGGAGGGG
CTGGTCAAGC ACAACGGGCC GCTGATCCGC CCGGGGGAGG CGACGCTCCA GGACCTGCCC
GCGGCCATCC TCGAGTATGT CGACCGCCAT GACCTGGAGC TTTCGTCCTT CGCCGGACCC
GAGGCCCAGG TGGCGGCATT GTCCGATGAT ATCGCCTATA ACGCCCATGA TCTGGACGAT
GGCCTGCGCG CCGGTCTGTT TCCCTTGGAG GCGGTGATCG AGGTCCCGCT GGTCGGCCCC
TTGTTGCGCC ACGTGCTTGA TCGCTATCCC GGCATCGAAC CGTCGCGGGC GATCCATGAA
ACGGTGCGCC GGGTGATCAC CGCCATGGTC GATGACGTCT GCGCCGAAAG CGCCCGGCGG
CTGGAGCGGC AAAACCCCGG GTCGGCGGCC GAAGTGCGGG CGCTGGATGC GCCGGTGATT
GCCTTCAGCG AAGAAATGGC CCAAAAGGAC GCCGGATTGA AAGGTTTCCT TTTCCCCACC
CTTTATCGTC ACTACCGGGT GAATCGGATG ACCAGCAAGG CGCGGCGCGT CGTTCGCGAG
ATGTTCGGCC TGCTGGTCGA AGAGCCGATG CTGTTGCCCG ATGACTGGCG CGCCCGCACC
ACCCGCCCGC ACAGCCACAA AACCGCCCGT GTTGTTTGCG ACTACATCGC CGGCATGACC
GACCGCTTCG CGCTGGACGA ACATGCCAGA CTGTTTGATC CTTCGGTGAA ACCATGA
 
Protein sequence
MSIWTPKSPS LAPYASDPAT SRGRLYPEAS SPTRSPHQRD RDRVLHSAAF RRLKYKTQVF 
VNSVGENYRT RLTHSLEVSQ IARSVSRVLG LNEDLAEALA LAHDLGHTCF GHAGEDALKD
CMAAYDGFDH NAQSLRIVTK LERRYAEFDG LNLTWETLEG LVKHNGPLIR PGEATLQDLP
AAILEYVDRH DLELSSFAGP EAQVAALSDD IAYNAHDLDD GLRAGLFPLE AVIEVPLVGP
LLRHVLDRYP GIEPSRAIHE TVRRVITAMV DDVCAESARR LERQNPGSAA EVRALDAPVI
AFSEEMAQKD AGLKGFLFPT LYRHYRVNRM TSKARRVVRE MFGLLVEEPM LLPDDWRART
TRPHSHKTAR VVCDYIAGMT DRFALDEHAR LFDPSVKP