Gene AFE_0731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAFE_0731 
Symbol 
ID7134939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 23270 
KingdomBacteria 
Replicon accessionNC_011761 
Strand
Start bp660107 
End bp661306 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content63% 
IMG OID643529137 
Productdeoxyguanosinetriphosphate triphosphohydrolase, putative 
Protein accessionYP_002425219 
Protein GI218666601 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCGCGGC GCAGGAATCG TCCGGGCATG CCTGATCTGG CCTCTCCGGG ACAGCCCGGG 
GGCTTGGCAC CATACGCCGC CGACCCCGCG CAGAGCCGTG GCCGCAAACA CGCAGAAGAG
CCGCCCGCCG GTCGCAGCGC CTTTCAGCGC GACCGCGACC GGGTCATCCA CTCCGGCGCA
TTCCGCCGTC TGGAATACAA GACCCAGGTC TTTGTGAATC ACGAGGGAGA TCTCTACCGC
ACCCGCCTCA CCCACAGCCT GGAAGTCGGA CAGGTGGGGC GCGCCATCGC CCGTCAGTTG
GCCTTGAACG AGGATCTGGT CGAAGCCATC GCTCTGGCCC ACGATCTGGG TCACACCCCT
TTCGGTCACG CCGGACAGGA TGCCCTGCAG GAATGCATGG CCGGCCTAGG CGGTTTCGAG
CACAACATCC AGTCCCTGCG AATCGTGGAC CATCTGGAAA AGCGCTACGG CGACTTTCCA
GGCCTGAACC TGACCTTCGA AACCCGCGAG GGCATTCTCA AGCATTGCGC CAAAAATAAA
GCCCACGACC TGGGCGACGT GGGTGAACGC TTTTTGCGGG GCTCCCAGCC CAGCCTGGAG
GCGCAGATCA CCAATCTCGC GGATGAGATC GCCTATAACA ACCATGATAT CGATGACGGC
CTGCGCGCCG GTCTGCTGAA CCGCGAAGAA CTCTGCGCCA ACACGTTATA CGGGACGATT
TTCGCCGAGG TGCAGCAACG CTTTCCCGAC GCTTCCGCGA CCATATTGCG TCACGAAACC
GGCCGCCGCC TCATCAATCT GCTCATCCAC GATCTGGTCA GTGAGACCCG GCGCCGCATC
GCGGCCGAAG GCGTCACCAC CATAGGGGAA GTGCGGGCCG CGTCGCAGCC GCTCGCCGCC
CACAGTCCGG CCATCGCCCA CGAAGTCGCC ACCCTCAAAC GCTTTCTCTT TCAGCGCATG
TATCGCCACC CCCGGGTGCA CCGTCAGGCG GAAAAAGCCA AGCGCCTGGT ACGCAAGCTG
TTCCATACCC TGCTCGAAGA TCCGCGGCTG CTGCCATTGA AATACCAGGT ACGCATCAAT
GAATGCGACG GCGCAGCGAA GCGTCAGGCG CGTGTCGTTG CCGACTATGT GGCGGGCATG
ACCGACCGCT TTGCCATCGC CGAGTATGAC CGTCTGTTTG ACCCGCACGG CGAGGCCTGA
 
Protein sequence
MPRRRNRPGM PDLASPGQPG GLAPYAADPA QSRGRKHAEE PPAGRSAFQR DRDRVIHSGA 
FRRLEYKTQV FVNHEGDLYR TRLTHSLEVG QVGRAIARQL ALNEDLVEAI ALAHDLGHTP
FGHAGQDALQ ECMAGLGGFE HNIQSLRIVD HLEKRYGDFP GLNLTFETRE GILKHCAKNK
AHDLGDVGER FLRGSQPSLE AQITNLADEI AYNNHDIDDG LRAGLLNREE LCANTLYGTI
FAEVQQRFPD ASATILRHET GRRLINLLIH DLVSETRRRI AAEGVTTIGE VRAASQPLAA
HSPAIAHEVA TLKRFLFQRM YRHPRVHRQA EKAKRLVRKL FHTLLEDPRL LPLKYQVRIN
ECDGAAKRQA RVVADYVAGM TDRFAIAEYD RLFDPHGEA