Gene Mext_3393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3393 
Symbol 
ID5835055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3762993 
End bp3764246 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content70% 
IMG OID641369192 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_001640850 
Protein GI163852807 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.366496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGGTGC GGCAGAACGG CGAGCGCTGG CGGGCGCCCT ACGCCACCGA TCCGGCGGCG 
ACCCGCGGGC GGCTGATCCC CGAGGCGTTT TCGCCCACCC GCAGCGACTT CCAGCGCGAC
CGCGACCGGA TCATCCACTC CACCGCCTTC CGGCGGCTGA AGCACAAGAC GCAGGTCTTC
GTGCATCACG AGGGCGACCA TTACCGCACG CGGCTCACCC ACAGCCTGGA GGTGAGCCAG
ATCGCCCGGG CGCTCGCCCG CGCGCTCGGC CTCGACGAGG ATCTCGCGGA AGCGCTGGCG
CTGAGCCATG ACCTCGGCCA CACCTGCTTC GGGCATACCG GCGAGGACGC GCTGCACGCC
TGCATGGCCG AGTATGGCGG CTTCGACCAC AACGCCCAGG CTTTGCGCAT CGTCACCCGG
CTGGAGCGGC GCTATGCAGG CTTCGACGGC CTCAACCTGA CCTGGGAGAC GCTGGAGGGG
CTGGTCAAGC ATAACGGCCC ACTCCTCGAC GCCTCCGGCG CGCCGGTCCG GCGCTACGCC
GCCGACGGCA TCCCGGCGGC TGTGCTGGAA TACAACGCGA CGAACGACCT CGAACTGTCG
CGCTTCGCTG GGCCGGAGGC GCAGGGCGCC GCGCTCGCCG ACGACATCGC CTACGATGCC
CACGATCTCG ACGACGGCCT GCGAGCCGGG CTGTTCGATC TTGCCGACCT CACGGCCGTG
CCGTTCCTGG ACGGGTTGCT CGACGAGATC GACGCCCTGC ATCCCGGCCT GGAGCCGTCG
CGAAAAATCC ACGAGTTGGC GCGCCGGGTC ATCACGCGTT TCGTCGAGGA CGTGATCCGC
GAGAGCGAGT CGCGCATCGC CGCGCTGGCC CCCCGCAGCG TCGGCGACAT CCGCGCCGCG
CAGGAGCCGG TCATCGCCTT CTCGCCCGCC ATCGCCACAG CGGATGCGGA CATCAAGCGC
TTCCTGTTCG CGCGGATGTA CCGCCACCCG GAAGTGATGG CGGTGCGGGC CAAGGCCGCG
ACCATCGTCG ACGACCTATT CTCGGCCTTC TGCGCCGACC CCGCGCGGAT GCCGGCCGAA
TGGTCGGAGG GTCTGGAGAA TGCCAGCGAG GCCCGCCTCG CCCGGCGCAT CGCCGACTAC
ATCGCCGGCA TGACCGACAC CTACGCGGTG TTGGAGCACG GCCGGCTGTT TGCGGCGACG
CCCAACCTGC ACTGGAGCCC GCCGAGCCGC GGCCTGCCAC TGACGGAGCC GTGA
 
Protein sequence
MQVRQNGERW RAPYATDPAA TRGRLIPEAF SPTRSDFQRD RDRIIHSTAF RRLKHKTQVF 
VHHEGDHYRT RLTHSLEVSQ IARALARALG LDEDLAEALA LSHDLGHTCF GHTGEDALHA
CMAEYGGFDH NAQALRIVTR LERRYAGFDG LNLTWETLEG LVKHNGPLLD ASGAPVRRYA
ADGIPAAVLE YNATNDLELS RFAGPEAQGA ALADDIAYDA HDLDDGLRAG LFDLADLTAV
PFLDGLLDEI DALHPGLEPS RKIHELARRV ITRFVEDVIR ESESRIAALA PRSVGDIRAA
QEPVIAFSPA IATADADIKR FLFARMYRHP EVMAVRAKAA TIVDDLFSAF CADPARMPAE
WSEGLENASE ARLARRIADY IAGMTDTYAV LEHGRLFAAT PNLHWSPPSR GLPLTEP