Gene Mlab_1590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1590 
Symbol 
ID4794399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1620963 
End bp1621928 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content56% 
IMG OID640100276 
Producthypothetical protein 
Protein accessionYP_001031020 
Protein GI124486404 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR03550] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofG subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00904391 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.000759826 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGGTCA TTACGTTTTC CCGCAACGTG TTTCTTCCGC TGACCAACGT CTGTGCAAAT 
ACCTGCGGAT ACTGCTCGTT CAAATCCCCG GTGGCCGAAG GCTGCGTGAT GCCAAAGGAG
GAGGTTGTCT CCACGCTCGA ACGCGGGGCG GCGTTTTCCT GCACCGAGGC CCTTTTCACG
TTCGGCGAAC GTCCCGAACG CGAAGCAGGT TTCACCGCAT ACATTAACCG GATGGGGTAT
CCCGATATTC TTTCCTACTG TAAAGACATG AGCAGATACG CAATCTCGCT TGGGATTTTG
CCGCACACGA ATGCCGGCAT CCTCACCTAT GAGGAGCTGG AGGATCTCCG GCCCGTGAAC
GCGAGCATGG GTCTGATGCT TGAAACGACC GCCGAGATCC CTGCCCATGC CCACAGTCCG
GGAAAAGATC CGTCAGTCAG GATAGAAATG ATGGAAAATG CGGGAAAACT CAAGATCCCG
TTCACCACCG GACTGCTGCT TGGGATCGGC GAAACCCGGG ACGACCGTAT CGAGTCGCTC
GAAGTCATCC GCGATCTGCA TAAAAAATTC GGGCACATCC AGGAAGTGAT CGTCCAGAAC
TTCTGCCCGA AAGAAGGTAC CGACATGGCG TCGTTTCAGG GAGCTTCGAC CGAGGTGATC
GCGGATACCC TCCGGCTTTC TAAAGAGATC CTGCCCGCTG ATGTGTCGAT CCAGATCCCG
CCGAATCTTG CCGATGCTTC CCTCCTTCTT GATCTCGGCG TGACCGATCT TGGCGGAGTG
TCGCCGGTTA CGATCGATTA CATCAATCCG GAACATCCCT GGCCGGCTCT GGATGCATTG
AAGGACATTG CCCGCGGATA CGAGGTACGG GAACGTCTGT GCATTTATGA GAAATACTGC
ACCCCGGCGT GGGTGGCTCC GGAGTTGTTC GGACTCGTGA GCGAACTCGC GAAGAAGGTG
TACTGA
 
Protein sequence
MPVITFSRNV FLPLTNVCAN TCGYCSFKSP VAEGCVMPKE EVVSTLERGA AFSCTEALFT 
FGERPEREAG FTAYINRMGY PDILSYCKDM SRYAISLGIL PHTNAGILTY EELEDLRPVN
ASMGLMLETT AEIPAHAHSP GKDPSVRIEM MENAGKLKIP FTTGLLLGIG ETRDDRIESL
EVIRDLHKKF GHIQEVIVQN FCPKEGTDMA SFQGASTEVI ADTLRLSKEI LPADVSIQIP
PNLADASLLL DLGVTDLGGV SPVTIDYINP EHPWPALDAL KDIARGYEVR ERLCIYEKYC
TPAWVAPELF GLVSELAKKV Y