Gene Mboo_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1960 
SymbolegsA 
ID5409997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2023825 
End bp2024904 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content55% 
IMG OID640869201 
ProductNAD(P)-dependent glycerol-1-phosphate dehydrogenase 
Protein accessionYP_001405118 
Protein GI154151500 
COG category[C] Energy production and conversion 
COG ID[COG0371] Glycerol dehydrogenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.990143 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCAG ATGCTATAAA ATTACTCAAA GAGAAGGTCT TTGACAAGTC CAAATGGATG 
CAGTTACCCC GTGACGTGGT GATCGGTCAC GATGTCCTCG GCCAGATCGC TCCTGTGTGC
GAAGATCTCA AACTGGGCAG GTCCGCCCTT TTGATCTCCG GCAAAAACAC GATGGACCGG
GCCGGAAAAA CGGTTCAGGA TGTGATCGGG AAGACCTGTG ACGTAATGGT CTATATCTCC
GATGAGATCA GCCCGGCTGT TATCAAGGAT GCCGAGAAGG CGGCAAAGGA CGTAGACTTT
GTGATCGGTG TCGGAGGCGG CCGGGTTATC GATACGGCAA AGATCGTATC GTACAACCTT
GATCGACAGT TCGTGTCGGT ACCAACCGCT GCCTCCCACG ATGGCATTGC ATCAGCCCGG
GCATCGGTAC CTACCGGCGA AGGTAACGTT TCCCTTGAAG CTCATCCCCC CATAGCGATC
ATTGCAGATA CCTGCATCAT TGCATCTGCC CCTCACCGTC TCCTTGCGGC CGGGTGCGCT
GATGTGATCT CCAATTACAC AGCGATCCTT GACTGGGAGA TGGCGCACCG GATCAAGGGT
GAACCCATGA GTGAATATGC AGTGGCCCTT TCCAAAATGA CTGCAGAGAT TCTGGTGAAG
AATGCCGATC TCATCCGGCC AAACCAGGAA CAATCGGCAT GGTTTGTCAC CAAGGCCCTT
GTGTCGAGCG GGGTGGCTAT GAGCATTGCC GGATCGTCCC GGCCGGCCAG CGGCGGCGAG
CATAAATTCT CCCATGCGCT CGACCGGCTT GCCCCCAATA AAGCTCTTCA CGGTGAAAGC
TGTGGGATAG GGACGATCAT ATCGATGTAT CTCCATGGGG GCGACTGGCG GGGAATCCGT
CAGTCCCTCC GGACGATCGG TGCCCCGGTA ACACCGACTG ACGTTGGCAT AGCGGATGAG
ATTGCGGTAG AGGCGCTCCT TATGGCAAAG ACTATCCGCC CGGAGCGCTT TACCATCTTT
GACATGGGTA TTACCCGGGA CTCCGCGGAG AAGCTTATCC AGATGCTTTA TGCGGATTGA
 
Protein sequence
MSPDAIKLLK EKVFDKSKWM QLPRDVVIGH DVLGQIAPVC EDLKLGRSAL LISGKNTMDR 
AGKTVQDVIG KTCDVMVYIS DEISPAVIKD AEKAAKDVDF VIGVGGGRVI DTAKIVSYNL
DRQFVSVPTA ASHDGIASAR ASVPTGEGNV SLEAHPPIAI IADTCIIASA PHRLLAAGCA
DVISNYTAIL DWEMAHRIKG EPMSEYAVAL SKMTAEILVK NADLIRPNQE QSAWFVTKAL
VSSGVAMSIA GSSRPASGGE HKFSHALDRL APNKALHGES CGIGTIISMY LHGGDWRGIR
QSLRTIGAPV TPTDVGIADE IAVEALLMAK TIRPERFTIF DMGITRDSAE KLIQMLYAD