Gene Mboo_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2020 
Symbol 
ID5411825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2093698 
End bp2094675 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content62% 
IMG OID640869262 
ProductD-isomer specific 2-hydroxyacid dehydrogenase, NAD-binding 
Protein accessionYP_001405177 
Protein GI154151559 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0306614 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCC TCTTCTGCGG CGAGGGCTTC CCCGAGGCAA GGAAGCAGCT GGCCGCACTC 
CTCCCGGACG ACGAGATCCT TGCCTTCCCG CCAGACCAGA TCGGTTCCCA TATCGCGGAT
GCAGATATTG TCGTGCCGAC CGTCAACCGG GTAGACGAGG CCCTCATGAA AAAGGGGCAT
TTTGCGTTCA TCCAGCAGTT CGGGGTGGGC CTTGAGGGAG TGGACATCGA AGCGGCCACA
AGAAACGGCA TCCGGGTGGC CCGTATCCCC AGTGAAGAGT CCGGCAATGC CGCATCGGTT
GCCGAACACG CCATCCTCTT CATGCTTATG CTCTCGCGGA ACTGGAACCG GCTTGCCCGG
GCACGGGAAG AGAACAAACC CCTCCCGTGG GGTTCCCCCG AAGGTGTGGC GCTCCGGGGA
AAAACAGTCT GCATCGTGGG CCTGGGCGGG ATCGGAAGGG AGCTGGCCCG CCGGCTCGCC
GGCTTTCAGG TCCGGATCGT GACTGCAGAC GACCATGCCG ATCGCACGGT GCCGGGCATA
GAGATCGCCC GCCGGTATAC CCTTGCCGAA CTTCCCGCAG CTGTGGCAGG GGCCGACTAC
GTGGTGCTCT CGCTCAACTA CACGCCGGAC CGGTACCACC TTATCGGCAA AGCCGAAATT
GCTGCCATGA AACGCGGGGT CTATCTCATC AATGTGGCCC GTGGCGGCCT CCTTGATGAG
CATGCCCTGC TTACAGCACT AAAAAGCGGG CAGGTGGCCG GCGCCGGTCT TGATGTTTTC
TGGGAGGAGC CGGTGGACCC GAACCATCCG ATCTTTAAAG AGAACGTGAT CGCCACTCCC
CATACCGGAG GGGTCACGGA CGTCTCGTAC GAGGGTATTT CCCGGGCCTT TGCAGAGAAT
GTGAAACGGT ATGCGGCCGG GGAAAAGCCC CGGTATCTCG CAAACGATCC TACAACGACC
CGGCGCAGGG TACCGTAG
 
Protein sequence
MKILFCGEGF PEARKQLAAL LPDDEILAFP PDQIGSHIAD ADIVVPTVNR VDEALMKKGH 
FAFIQQFGVG LEGVDIEAAT RNGIRVARIP SEESGNAASV AEHAILFMLM LSRNWNRLAR
AREENKPLPW GSPEGVALRG KTVCIVGLGG IGRELARRLA GFQVRIVTAD DHADRTVPGI
EIARRYTLAE LPAAVAGADY VVLSLNYTPD RYHLIGKAEI AAMKRGVYLI NVARGGLLDE
HALLTALKSG QVAGAGLDVF WEEPVDPNHP IFKENVIATP HTGGVTDVSY EGISRAFAEN
VKRYAAGEKP RYLANDPTTT RRRVP