Gene Mboo_1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1749 
Symbol 
ID5409910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1829257 
End bp1830321 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content47% 
IMG OID640868984 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_001404909 
Protein GI154151291 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCC TTGTGACCGG CGCGGCCGGG TTTATCGGGA GCAATTTTGT TTATTATTAT 
CTTTCCCGCT ATCCGGAGAG AACGATAATC GGTCTTGATA ATCTTTCGTA TGCGGGAAAT
CTGGAAAACC TTTCTTTGTT ATCCCCTGAT CAAAAGGCGA GATTCGTCTT TGAGAAGGCC
GATATTACCG ACACCGTTCA GATAAAAAAG ATCCTTTCAA AATACCCTGT TGACGGTATC
ATAAATTTCG CTGCTGAGAC CCATGTGGAC AGGTCAATTC ATGACCCGCA GGTGTTCCTG
AAAACAAACA TCCTTGGCAC TCATGTTCTC CTGGATGCAG CAAAAACTAT CTGGCACACA
AAAGAAGGCG GCTGGGAAGA CGGGAAAAAA TTCCTGCAGG TGTCCACGGA TGAAGTGTAC
GGGACACTTG GGCCATCGGG GTACTTTACC GAAACAACGC CCCTGGACCC TCACAGTCCC
TATTCCGCGA GTAAAGCCTC TGCCGACCTC GTGGTAAAAG CGTATCACGA CACCTATGGA
ATGCCGGTGA ATATTACCCG GTGTTCCAAT AATTACGGGC CCTGGCAGTT CCCCGAGAAA
TTAATCCCTC TCCTGATACA GAACGCCCTT TTGCACCGGG AAATTCCCGT TTATGGCGAT
GGAAAACAGA TCCGTGACTG GCTCTATGTC GGAGATCACT GCCGGGCAAT TGATCTGGTA
TACGAGTCCG GTAAGACCGG GGAGACCTAT AATATCGGGG GCAATAACGA ACGGGAAAAT
ATTGTCATTA TAAAAAAGAT TCTTGTCCTG TTACAGGATA TGACCGGAGA CCCGCACATT
AATGATAACT TGATTTCTTA TGTGAAAGAT CGGCTGGGAC ATGACCGGAG ATACGCGATT
GACGCTTCGA AAATCAAAAG GGATCTTCAC TGGGAGCATA AGGTTCCCTT CGATGAGGGG
ATCGAGCGAA CGGTCCGGTG GTATCTTGAT CACCGGGAAT GGATGGCCAA TGTCATCTCC
GGGGAATACA CGAAATTTTA TGAGAAAAAT TATGAGGATC GTTAA
 
Protein sequence
MKILVTGAAG FIGSNFVYYY LSRYPERTII GLDNLSYAGN LENLSLLSPD QKARFVFEKA 
DITDTVQIKK ILSKYPVDGI INFAAETHVD RSIHDPQVFL KTNILGTHVL LDAAKTIWHT
KEGGWEDGKK FLQVSTDEVY GTLGPSGYFT ETTPLDPHSP YSASKASADL VVKAYHDTYG
MPVNITRCSN NYGPWQFPEK LIPLLIQNAL LHREIPVYGD GKQIRDWLYV GDHCRAIDLV
YESGKTGETY NIGGNNEREN IVIIKKILVL LQDMTGDPHI NDNLISYVKD RLGHDRRYAI
DASKIKRDLH WEHKVPFDEG IERTVRWYLD HREWMANVIS GEYTKFYEKN YEDR