Gene Mboo_2019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2019 
Symbol 
ID5411824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2092440 
End bp2093582 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content59% 
IMG OID640869261 
Productaldo/keto reductase 
Protein accessionYP_001405176 
Protein GI154151558 
COG category[R] General function prediction only 
COG ID[COG1453] Predicted oxidoreductases of the aldo/keto reductase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0479916 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTACA GGAGATTCCC TAAGGTCCAC CAGGATATAT CGATCCTCGG GTTCGGGTGT 
ATGCGCCTGC CGGTGCTCGA AAACCAGCAG ATAAACGAAC CGCTTGCAAC AGAGATGGCG
CGGTACGCCA TCGACCACGG TGTCAACTAC GTGGACACCG CGTACCCCTA CCACAACGGG
GAGAGCGAAC CATTCGTGGG CCGGGCGCTT GCCGACGGCT ACCGCGAGAA AGTCATGCTC
GCCACCAAGC TCCCGAGCTG GCTGATCACA AAACCCGGGG ACATGGACAA GTACCTCAAC
GAGCAGCTTG CCCGCCTTGC CACCGACCAT ATCGACTTCT ATCTCGTCCA CGGGCTTAAC
GCGGCCACCT GGAAGGCCAC AAGCGAAGCG GGTGTGCTCG ACTTCCTCGA CGATGCAATA
GACGACGGGC GGATCCGGTA CCCCTGTTTC TCGTTCCACG CCGCCCTCCC GCTCTTTAAG
GAGATCGTAG ATGCCTATGA CTGGACCTTT GCCCAGATCC AGTACAACTT CATGGACGAA
CAGTACCAGG CGGGAACCGA AGGCTTGCAG TATGCGGCAA AGAAAGGCAT CGGGATCGTG
GTGATGGAAC CCCTCCGGGG AGGGCTTCTC GCAAAAGAGA TCCCGGCAAC AAAGGATATC
CTTGCACATG CCCCCGTGCA GCGCACCCCT GTGGAGTGGG GCCTGCGCTG GGTCTGGAAC
CATCCTGAAG TCACTGTTGC GCTCTCGGGC ATGTCTGCGA TGGAGCAGGT GGTTGAAAAT
ATTGCCTGCG CAGAACAGGG AAAGGCCGGC TCGCTCTCAA AAGACGATCT TGCCGTTATC
GCTAATGTGA AAAAGGCGCT CGCAGAACGG GTGAAGATCC CCTGCACCGG CTGCCGGTAC
TGCACCCCCT GCGAGAACGG GGTCGGGATT CCCGAGTGCT TTGAGTTCTA CAACCAGGCG
CACATCTACG ACGCAAAGGA ACACGCCGGC GGGATCTACG GATGGGCCTT AAGCGGGATC
TTCGGGGGCA TCCCGGCATA TGCCTCCTGC TGCACCGAAT GCGGGGCCTG CGAGGAAAAG
TGCCCCCAGG GCCTCCCGAT CAGAAAGCAC CTCAAAGAGG TTGCAGAATT TTTCGGGAAA
TAA
 
Protein sequence
MLYRRFPKVH QDISILGFGC MRLPVLENQQ INEPLATEMA RYAIDHGVNY VDTAYPYHNG 
ESEPFVGRAL ADGYREKVML ATKLPSWLIT KPGDMDKYLN EQLARLATDH IDFYLVHGLN
AATWKATSEA GVLDFLDDAI DDGRIRYPCF SFHAALPLFK EIVDAYDWTF AQIQYNFMDE
QYQAGTEGLQ YAAKKGIGIV VMEPLRGGLL AKEIPATKDI LAHAPVQRTP VEWGLRWVWN
HPEVTVALSG MSAMEQVVEN IACAEQGKAG SLSKDDLAVI ANVKKALAER VKIPCTGCRY
CTPCENGVGI PECFEFYNQA HIYDAKEHAG GIYGWALSGI FGGIPAYASC CTECGACEEK
CPQGLPIRKH LKEVAEFFGK