Gene Mboo_1203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1203 
Symbol 
ID5411351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1218013 
End bp1219665 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content57% 
IMG OID640868430 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001404364 
Protein GI154150746 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0134222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGATC TGCGAAGCGA CACCATACGG AAAGGGTACG AGCGGGCTCC GAATCGTTCT 
CTCCTGCGCT CGCTGGGAGT TACGGATCGG GAGATAGAAC TCCCGTTTAT CGGTATTGCC
AATGCATTCA ATACCATCGT GCCGGGTCAT ACGCACCTGC GTCAGCTTTC AGATAAGGTA
AAAGAGGGAA TTGCCGCGGC CGGGGGCGTC CCCTTTGAAT TTGGCGTGAT CGGCATTTGC
GATGGGATCG CGATGGGACA TGAAGGGATG CGGTACTCCC TTCCCTCCCG TGAAAATATC
GCTGACTCCA TTGAGCTCAT GGTACAGGCC CACCGGTTCG ACGGTCTTGT GTGTGTGGGT
ACCTGCGACA AGATTGTTCC CGGCATGCTC ATGGCTGCCG TCAGGACCAA TATTCCGACG
ATCGTTGTCA CCGGCGGAGC AATGCTTCCC GGCAGTAGCG GGGGTAAAGA TCTTTCACTC
ATTGATGTTT TCGAAGGAGT GGGAAAGGTT GCTGCCGGTA CCATGGAAGA AGATGCCCTT
AAGGAACTGG AATGCTGCGC CATGCCCGGC TGCGGGAGCT GCCAGGGGCT CTACACGGCA
AACACCATGG CCTGCATGAC CGAGACTATG GGCATGTCCC TGCCAGGCTG TGCAGCTGTT
CCTGCCGTGG AGGCGGCAAA ACTGCGGATC GCCCGGGAGA GTGGCGAAGC AATCATTCCC
CTGGTAAAGA AAAACAGTAC TGCCCGGGAT ATCGTGACCA AGAAGAGCCT GGAAAACGCA
ATCCGCGTGG ATATGGCATT AGGAGGATCA ACCAATACCG TACTGCACCT TATGGCGATT
GCAACCGAGG CTGAGATCCC TCTCTCTCTT GCAGACTTCA ACCGCATCGC AGATGAGATC
CCGCATATCT GCCACATGCT TCCGGCTGGC CCCTACTCTA TGCAGGCACT TTACAGGGCC
GGTGGTATAC CTGCTGTACT TAAGAGGCTG GAAAAACATC TTGACGACTG TCCGACCGTT
TCCGGTCTGT CTCTTTACCA GGTTGCACGG AATGCAATGA TCAAAAACGA GCAGGTAATA
AGATCCCTGG ATGCCCCGGT AAGTCCGGCC GGTGGGCTTC GCATACTCTT TGGTTCGCTT
GCTCCCGATG GCGCCGTGGT CAAATCTGCC GCTGTTCCAA AAGAGATCTG GAAACATACC
GGACCCGCCC GGGTCTTCGA GTCCGAGGAG CCTGCAATGG CAGCAATCCT TTCCCGGCAG
ATCCATGAAG GTGATGCGGT GATCATCAGG AATGAGGGGC CTCGTGGTGG GCCGGGGATG
CCCGAGATGC TTTCTGCAAC CTCGGCACTT ATGGGTGTGG GCTATAAAAA CGTAGTCCTG
ATCACTGACG GCCGGTTCTC TGGCGGAACC AGAGGGCCCT GTATCGGGCA TGTTGCACCT
GAGGCTGCTG TCGGTGGCCC GATTGCATTG GTGCAGGATG GCGACCGGAT TGCCGTGGAC
CTGTTTATGC GGACCATTGA CCTGCTGGTG GATCCAGAAG TCCTCACGTC CCGCAAGGCC
GCATGGAAAC CGGTGATGCG GCCGGTGACC GGTGTCCTTG CCCGCTATGC AAAGACCGTC
GGGCAGGCAA ACCTTGGTGC GGTGCTGAGA TAA
 
Protein sequence
MTDLRSDTIR KGYERAPNRS LLRSLGVTDR EIELPFIGIA NAFNTIVPGH THLRQLSDKV 
KEGIAAAGGV PFEFGVIGIC DGIAMGHEGM RYSLPSRENI ADSIELMVQA HRFDGLVCVG
TCDKIVPGML MAAVRTNIPT IVVTGGAMLP GSSGGKDLSL IDVFEGVGKV AAGTMEEDAL
KELECCAMPG CGSCQGLYTA NTMACMTETM GMSLPGCAAV PAVEAAKLRI ARESGEAIIP
LVKKNSTARD IVTKKSLENA IRVDMALGGS TNTVLHLMAI ATEAEIPLSL ADFNRIADEI
PHICHMLPAG PYSMQALYRA GGIPAVLKRL EKHLDDCPTV SGLSLYQVAR NAMIKNEQVI
RSLDAPVSPA GGLRILFGSL APDGAVVKSA AVPKEIWKHT GPARVFESEE PAMAAILSRQ
IHEGDAVIIR NEGPRGGPGM PEMLSATSAL MGVGYKNVVL ITDGRFSGGT RGPCIGHVAP
EAAVGGPIAL VQDGDRIAVD LFMRTIDLLV DPEVLTSRKA AWKPVMRPVT GVLARYAKTV
GQANLGAVLR