Gene Sfum_0190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_0190 
Symbol 
ID4461540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp225125 
End bp226159 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content60% 
IMG OID639700945 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_844327 
Protein GI116747640 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.230443 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCCACG AGTTGGAGGA ACGTGAAGCA CGGTTTCTCG ATCCCCGTGC GCAGCTCAGC 
AGGGAGACCA GAGGACGGCT GAAACCCGAG ACGGAATGCA CTCTGCGCAC GGCCTACCAG
CGGGACCGCG ATCGAATCGT TCACTGCAAG GCTTTCAGAA GGCTGAAACA CAAGACTCAG
GTCTTCCTGT CTCCGACGGG AGACCACTAC CGTACGCGGC TCACGCACAC CCTGGAAACC
TCCCAGATTG CCCGCACCAT CGGCAGGGCG CTGGCCTTGA ACGAAGATCT TATCGAGGCC
GTCGCGCTCG GACACGACTT GGGCCACACG GCGTTCGGCC ACGGGGGCGA GAGCGTGCTC
AACGATCTCG TTCCCGGAGG TTTCTTTCAC AACGAGCAGA GCCTGCGCAT TGTCGACATC
CTCGAAAAAA ACGGGGAGGG GCTCAATCTC ACCCACGAAG TGCGCGACGG CATCCTCAAA
CATTCCAAGG GGCGCGCGGA TCCGATCCTG CTCGACCCTG AAGCCAGAGC GGAAACGCTG
GAAGGTCAGG TGGTCCGGGT TGCGGACATC ACGGCTTATC TCAACCATGA CCTGGACGAC
GCCCTGAGGG CTGAAATCCT CAGTGCCGAT GCCATTCCCC CCGATATCCG GATGCACCTC
GGAGCCCGTC ATTCTCAGCG CATCCACGCG ATGGTCGAGG ATGTCATTCA CTCGACCCTG
GAGGGCGATC TCATCGAAGT GCGCATGAGC GAGGCGATGC TCGCCCGGGT TGACCAGCTC
AGGGAGTTCC TTTTCGAGCA CGTTTACGAT CTGCCTCAGG TCAGGGAAGA ATTCAGGCGC
GTCCGGAAGA TCATCGAGGA TCTCTTCGAC GTACTGATGA AGGATGATGC GGTGTTTCGG
GAAGAGATCG GCACGCCGCG CGACGGCACG CTCAAGGAGC GGCAGGTGTA CGACCATATC
GCGGGAATGA CTGACCGTTA CGCTCTCGAC CTGTACAAAA AGATCTTTCT TCCCAAGCCA
TGGATGAAAC TGTGA
 
Protein sequence
MRHELEEREA RFLDPRAQLS RETRGRLKPE TECTLRTAYQ RDRDRIVHCK AFRRLKHKTQ 
VFLSPTGDHY RTRLTHTLET SQIARTIGRA LALNEDLIEA VALGHDLGHT AFGHGGESVL
NDLVPGGFFH NEQSLRIVDI LEKNGEGLNL THEVRDGILK HSKGRADPIL LDPEARAETL
EGQVVRVADI TAYLNHDLDD ALRAEILSAD AIPPDIRMHL GARHSQRIHA MVEDVIHSTL
EGDLIEVRMS EAMLARVDQL REFLFEHVYD LPQVREEFRR VRKIIEDLFD VLMKDDAVFR
EEIGTPRDGT LKERQVYDHI AGMTDRYALD LYKKIFLPKP WMKL