Gene Mboo_2077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2077 
Symbol 
ID5409786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2152073 
End bp2153188 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content57% 
IMG OID640869322 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_001405234 
Protein GI154151616 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAAAG TCGCGGCAAT AGGCGGGGAC GGGATCGGCC CGGAGATAGT TGCAGAAGGA 
AAGAAGGTGC TTGAAGCCGC CGGGGAGCGG TACCGCTTTG ATATTGACTG GACGGATTTT
GATATCGGGG CGGACAGGTA TCTCGCTACA AAGAAACTCC TCACCGAAGA CGACCTTGCC
GAACTCAAAA AATTCAAGGC GATCTACTTT GGTGCCATCG GCGACCCCCG GGTTCCACCG
GGCATCCTTG AGAAGGGGAT CCTTCTTGCC CTCCGGTTCT CGTTTGATCA GTACGTGAAC
CTGCGGCCGA TCCGGCTTCT TGAGGGCGTT GAAACACCGC TTGCCAACAA GACCCCAAAG
GACATCGATT TTGTCGTTGT CCGTGAGAAT ACCGAGGATT TCTATGTCGG GATCGGTTCC
CGGTTCAAGA AACACCAGAA GACCGAGCTT GCCGTTGTCC GCGACCTCTA CAACGTGAAG
TTCGGGCTCG ACATCGAGAC CGATGCCGAG GAACTCGCGT ACCAGATCGG GGTCGTGACC
CGGGAAGGTT CGCGCCGGGT CCAGACCTAT GCCTTTGACC TCGCCACAAA ACGCCGCAAA
AAGCTCACTT CGGTTGACAA GGCAAACGTC CTCTCAGACG TATACGGGCT CTGGCGGGAC
GTGTTTACCG AGACCGCAAA GAAGTATCCC GAAGTTGTCA CCGACTTTAA CTTCGTTGAC
GCGGTTACGA TGTGGTTTGT GAAAAACCCC GAGTGGTTCG ATGTCGTGGT CACACCCAAC
ATGTTCGGCG ATATCATCAC CGATCTCGGC GCCATGATCC AGGGCGGCCT CGGCCTTGCC
CCGGGCGGGA ACATCAACCC GAAGGGCACC TCCATGTTTG AGCCGATCCA CGGATCGGCG
CCGAAGTACA AGGGCATGGA CGTTGCAAAC CCGATAGCAA CCGTCTGGGC CGGTTCGCTT
CTCCTTGATC ACTTGGGAGA GCACGCGGCG GCGGCGGCTG TTGTTTCGGC CATAGAGAGA
AGCATAAAGG ACGGTATGGT CACCAAGGAC CTCGGCGGGA CTGTGGGGAC AAAGAAGGCT
GGCAGTTATA TTGCGGATTG TGTCCGCCGT GGCTGA
 
Protein sequence
MYKVAAIGGD GIGPEIVAEG KKVLEAAGER YRFDIDWTDF DIGADRYLAT KKLLTEDDLA 
ELKKFKAIYF GAIGDPRVPP GILEKGILLA LRFSFDQYVN LRPIRLLEGV ETPLANKTPK
DIDFVVVREN TEDFYVGIGS RFKKHQKTEL AVVRDLYNVK FGLDIETDAE ELAYQIGVVT
REGSRRVQTY AFDLATKRRK KLTSVDKANV LSDVYGLWRD VFTETAKKYP EVVTDFNFVD
AVTMWFVKNP EWFDVVVTPN MFGDIITDLG AMIQGGLGLA PGGNINPKGT SMFEPIHGSA
PKYKGMDVAN PIATVWAGSL LLDHLGEHAA AAAVVSAIER SIKDGMVTKD LGGTVGTKKA
GSYIADCVRR G