Gene GSU1246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1246 
Symbol 
ID2686634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1347775 
End bp1348914 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content62% 
IMG OID637125920 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionNP_952299 
Protein GI39996348 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.641773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATCCC TGGAGCGGGC TGATCTGGCC GGCTATGCGG CTCGCAGTTG CCGTTCGCGG 
GGGCGGATGC ACCCGGAAGA GTTCCGGGAC GACCGCCCCG CCTTCGAGCG GGACCGGGAC
AGGATCATCC ACTGTGCGGC GTTCAGGAGG CTGGAGTACA AAACTCAGGT CTTCGTGAAC
CATGAGGGGG ACTACTACCG CACCCGGTTG ACCCACTCCC TGGAGGTGGC CCAGATCGGC
AAGGCCATTG CCCGTCGACT CGCCCTGAAC GAGGAACTGA CCGAGGCTCT GGCCCTGGCC
CACGACCTGG GACACACCCC CTTCGGGCAC ACGGGCGAGG AGGTGCTGAA CCGTCTGATG
GAAGGCTTCG GCGGCTTCGA GCACAATCTT CAGTCGTTCA GGGTGGTGGA CCAGTTGGAG
GAGCGGTACC CCGGCTTCAA CGGGCTCAAC CTTTCCTGGG AAGTGCTGGA AGGGATCATC
AAGCATTCAT CGCCCTACGA CCGGCCGACC GGTCTGATCG AGGGATTCCT GCCCGGCGTG
GTGCCGACCA TCGAAGCTCA GATCATCAAC TTCGCCGATG AGATAGCCTA CAACAATCAC
GATATCGACG ACGGTCTCAA GTCGGGTTAC ATTACGATTG AGCAACTCAA CGGGGTTGAC
CTCTGGCGTG AGGTTTGGGA GAGGATCGAT ACCGCCCATC CCGGCCTGGA TCGGGAGCGG
AAGAAGTTCC AGACCATAAG CGCGCTGATC GGTCTCCTCA TCAGGGACCT GATTACTGCC
ACCGAGGCGA ATCTGCGTGC TTACGGCGTC TCCACCCTTG ACGACGTGCG GCGGGTCAAC
CGCCCCCTGG TGACCTTCTC GTCCGCCATG GAGGAGCGGA ACCGTTCCCT TAAGCGGTTC
CTGTTCACAA ACCTGTACCG GCACCACAAG GTGGAGCGGA TGCGGGTCAA GGCGGAGCGC
TATCTGACGC AGCTGTTCGA GAGTTACGTG AAGCACCCGA CGCTGCTCCC CCGCAAGTAC
CAGCAGAAGA TGGATACGCT GGGACGCGAG CGCGTGGTCT GCGACTACAT CGCCGGCATG
ACCGACCGCT TCGCCCTTGA TGAGTTCAAG CGTTTGTTCG AGCCTTACGA GCGCGTCTGA
 
Protein sequence
MRSLERADLA GYAARSCRSR GRMHPEEFRD DRPAFERDRD RIIHCAAFRR LEYKTQVFVN 
HEGDYYRTRL THSLEVAQIG KAIARRLALN EELTEALALA HDLGHTPFGH TGEEVLNRLM
EGFGGFEHNL QSFRVVDQLE ERYPGFNGLN LSWEVLEGII KHSSPYDRPT GLIEGFLPGV
VPTIEAQIIN FADEIAYNNH DIDDGLKSGY ITIEQLNGVD LWREVWERID TAHPGLDRER
KKFQTISALI GLLIRDLITA TEANLRAYGV STLDDVRRVN RPLVTFSSAM EERNRSLKRF
LFTNLYRHHK VERMRVKAER YLTQLFESYV KHPTLLPRKY QQKMDTLGRE RVVCDYIAGM
TDRFALDEFK RLFEPYERV