Gene Mboo_1969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1969 
Symbol 
ID5410261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2031608 
End bp2032786 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content53% 
IMG OID640869209 
Productmetal-dependent phosphohydrolase 
Protein accessionYP_001405126 
Protein GI154151508 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.883701 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAAGC AGAGAGATCA GGACCCGGAT CGCAACGCAG CAATTTTCCG GTACACACGC 
GAACGCGAGG CTCTTCTCTC TCCTCGTGCC TCGCGCAGCG ACCAGGCGCT GAGACGCAAG
AGCCGCAAGC CGGAAGATAT ACGCACCCCG TACTCCCGGG ACGCGGACCG AATCCTCCAT
ACCCGGGCGT ACACGAGGTA CATTGACAAG ACTCAGGTTT TTTACCTTGT TGAAAACGAC
CACATCACCC ACCGGGTTAT CCATGTCCAG CTCGTTTCGA AGATCGCCCG CACGATCGGC
CGGTGCCTGC GCCTCAACGA AGACTTGATC GAGGCGATCG CGCTCGGACA CGATATCGGT
CACATACCTT ATGGGCATTT CGGCGAGGCC TGCCTTTCGG ACCTCTGCCT GGAGCACGGG
ATCGGGAAAT TTGCCCATAA CGTTCAGAGC GTAAGATCCC TGGACAGGAT TGAGGATCAG
GATCTGACCA TGCAGGTACT GGACGGGATC CTTTGCCATA ACGGGGAAGC CGAGGATTTG
CGTATGTCCC CGGAATCCTG CCCGGACTGG GCAACTTTTG ATCGGAAAGT CTGCGCAAAC
GAGACAGGTG GGCGGCCTGG ATCTCCGATG ACTCTTGAGG GATGTGTGGT AAAATTTGCC
GATACGATTG CATATATTGG CCGCGATCTC CAAGATGCAC AAGAAGTCGG GCTTATTAAA
AATCCCGGTG AGATTCCGCA GGAATGCCAA GAGGTATTTG GATCAGATAA CCGCGCTATA
ATCGATACCC TGATTCGTGA TCTACTGGAG AACAGCGATG CGGATGATAA ATGTTTTATC
TCCTACAGCA GGGAGGTAGA ACATGCACTC GCTACACTCC GGGCATTCTC CCGGCACACC
ATTTACAATA ACCCGAAACT GACCGCGGAG CGGGAAAAGA TCCGAACGAT GTACCGGGTT
CTGTTCTTAA CCTATCTTTC CGATATAGAA TCCGATCGGC GCAGCTCAAA AATATTCTCT
GATTTTATTA ATGCCCCATG GGTTAATCGG GAGTACCTTC ACACGACCCC GCCTGCCGGG
CTCACCCGTG ATTTTATTTC CGGGATGACC GATCGCTATT TCCTGAAACG ATTCGAGGAT
TGTGTAATTC CCCACAGAAT CGAAGGGGCA TTTCGGTGA
 
Protein sequence
MVKQRDQDPD RNAAIFRYTR EREALLSPRA SRSDQALRRK SRKPEDIRTP YSRDADRILH 
TRAYTRYIDK TQVFYLVEND HITHRVIHVQ LVSKIARTIG RCLRLNEDLI EAIALGHDIG
HIPYGHFGEA CLSDLCLEHG IGKFAHNVQS VRSLDRIEDQ DLTMQVLDGI LCHNGEAEDL
RMSPESCPDW ATFDRKVCAN ETGGRPGSPM TLEGCVVKFA DTIAYIGRDL QDAQEVGLIK
NPGEIPQECQ EVFGSDNRAI IDTLIRDLLE NSDADDKCFI SYSREVEHAL ATLRAFSRHT
IYNNPKLTAE REKIRTMYRV LFLTYLSDIE SDRRSSKIFS DFINAPWVNR EYLHTTPPAG
LTRDFISGMT DRYFLKRFED CVIPHRIEGA FR