Gene Mlab_1346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1346 
Symbol 
ID4794398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1368996 
End bp1370372 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content62% 
IMG OID640100028 
Producthypothetical protein 
Protein accessionYP_001030780 
Protein GI124486164 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.212301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.000869519 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGCGGG ATCTGGTTCA GTTCGAGTTG TCCGGGGTTG TTTCGCCGGA GACGATGCGG 
GTTGTTGATA ATAACGCGGA TGATTACGGA ATCTCCGCAG CCCAGCGCAT GGAAAGCGCC
GGGTCCGTAC TCGCCGCCGC CGTTCGATCT GAGTGTCCCG CTTCCGTTCT TATCCTCTGC
GGAACGGGAA ATAACGGGGG AGACGGGTTC GTCTGTGCCC GTCATCTCGC AAAGGAGTAT
TCGGTAAAGG TGATCTTTAC CGGTGAACCG AAGACGCCGG AGGCACGGGC GGCGTTTTCC
TCCCTTGACG GCTGCCCGGT GGAACTCGCT TCCTCCTGGT CGGCCGAAGA TTTCTCCGCC
GATGTTATCG TGGACGCCCT TCTTGGGACC GGCGCTTCCC TGCCGCTGAA AGAACCGTAT
GCTTCCCTTG TGGGTCTGAT GAACGAGGCA AAAGGCCGTG TTCTCGCCTG CGATATGCCG
ACGCCGGGCG GCCGGGCTGA CCGCGTGATC GCTTTTCATC TCGCAAAAAC CGAGGGCGCC
GAGGTATACA GTATCGGGAT TCCGTTCGGC GCCGAGGTTT TCTGCGGGAA GGGGGACCTT
CTTTCCGTCC CGAAAAAGCC GGCCGGGGCG CACAAGGGAT GGGCGGGCTA TGTGCTCGTG
ATCGGCGGCG GGCCGTATCA GGGTGCGCCG TTCCTTGCAG GGACCGCCGC CCTTCGTTCG
GGTGCGGATG TCGTCCGCGT GGCGGCTCCC GTTGACGGAT TCATGCCCGA TCTGATCCTT
GAGAGACTGC CGGGAAACAA AGTGGGCAAA GAACATCTGA CCCGTCTGCT TGCATTGGCA
GAGAACGCGG GTGTGGTGAT CGCCGGACCC GGGCTCGGCG CCGATCCGGA AAGTCTCGAG
GTGGCTTCGC AGGTGGTCTC AGCCGCGAAA CGTGCGGTGG TGGACGCCGA TCTTCTGCGT
AATCCTCTGC CGAAAGCACG GGAACAGACG ATCTATACGC CGCATGCAGG CGAGTTTGCC
CGGGTCTTTT GTCCCGTGCC GGAGAAACTC GGTGAGCGGG GGATCATCGT CCGTGAAGCC
GCGAAGCGTG CCGGCGGGAC GGTTCTGTTG AAAGGAGCGG TCGATGTGAT TTCGGACGGT
TCCCGGGTGA AGTTCAACCG GTCCGGCGCT CCGGGCATGA CCACGGGAGG AACCGGCGAT
GTTCTTTCCG GTGTCTGCGG CGGGCTTCTT GCCCGGATGG ATGCCTTCGA GGCCGCCTGC
GCCGCGGTGC ATGCCGCGGG ACTTGCCGGG GAGCTGGCCG ATGAAGACAC CGGGGACGGT
CTAATCGCAA CCGATTTACT GAGGCACCTT GCATACATAG TGTATAAGGA GAAATAA
 
Protein sequence
MMRDLVQFEL SGVVSPETMR VVDNNADDYG ISAAQRMESA GSVLAAAVRS ECPASVLILC 
GTGNNGGDGF VCARHLAKEY SVKVIFTGEP KTPEARAAFS SLDGCPVELA SSWSAEDFSA
DVIVDALLGT GASLPLKEPY ASLVGLMNEA KGRVLACDMP TPGGRADRVI AFHLAKTEGA
EVYSIGIPFG AEVFCGKGDL LSVPKKPAGA HKGWAGYVLV IGGGPYQGAP FLAGTAALRS
GADVVRVAAP VDGFMPDLIL ERLPGNKVGK EHLTRLLALA ENAGVVIAGP GLGADPESLE
VASQVVSAAK RAVVDADLLR NPLPKAREQT IYTPHAGEFA RVFCPVPEKL GERGIIVREA
AKRAGGTVLL KGAVDVISDG SRVKFNRSGA PGMTTGGTGD VLSGVCGGLL ARMDAFEAAC
AAVHAAGLAG ELADEDTGDG LIATDLLRHL AYIVYKEK