Gene Mlab_1711 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1711 
Symbol 
ID4795839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1747266 
End bp1748543 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content56% 
IMG OID640100402 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001031139 
Protein GI124486523 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATCG TTGAAGATGC AAAAAAAGGC CTCATCACTG AAGAGATGAA GGTCGTGGCA 
AAATCCGAAG GGGTCACCGA AGATTTCATC CGCAGGGGAA TTGCCGGCGG TCATATCGTC
ATCCCGATGA CGCCCTACCG GAAAGTAAAA CTCTGCGGTA TCGGCTCGGG ACTTCGCACA
AAAGTGAATG CATCGATTGG GACCTCGTCT GATATCGTGA ATGTGGAAGA GGAGCTGGAA
AAGGCACGTC AGGCAGAACT CGCCGGGGCA GACTCGCTGA TGGAGCTTTC GACAGGCGGG
GACTTCCTCG ATATCCGCCG CCGGGTCTGT GAACAGTCGA ACCTTTCCGT TGGATCCGTT
CCGCTCTATC AGGCGTTCAT CGAAGCTGCC AGAAACAAAG GCGGCGTCGT TTTCATGGAC
GAGGACGATC TCTTCAAGAT CACCGAGCAG CAGGCAAAAC TCGGTACGAA TTTTATGGCG
ATCCACACCG GAATCAACTA CGAGACCGTG AAGAGACTGA AAAATCAGGG CAGACACGGC
GGTCTCGTCT CCCGTGGAGG GGCATTCATG ACGGCATGGA TGCTGCACAA CGAGATGGAG
AACCCTCTCT ACCGCAGGTT CGACTACCTC GTCGAGATCT TAAAGGAACA CGAAGTCACC
CTCTCCTTTG GAAACGGCAT GCGGGCAGGG GCCTGTCATG ACGCCACGGA TCGTGCCGCC
ATCCAGGAAC TTTTGATCAA CGCAGAACTC GCCGATCAGG CACACAACGC CGGTGTTCAG
TGCATTCTCG AAGGTCCGGG TCACATCCCT CTCGATGAGA TCAAAACCAA TGTCCAGCTG
GAAAAACGGG TCACCAACAA CAAACCGTTC TATATGCTCG GTCCTCTGGT AACGGATATC
GCTCCGGGAT ACGACGACCG CGTTGCTGCG ATCGGAGCTT CCGTCTCGTC CGCCGCAGGT
GCAGATTTCA TCTGCTATGT AACCCCGGCC GAGCATCTGG CTCTTCCGAC CCCCGAAGAG
GTCTATGAGG GAGTTATGAG TTCACGCATC GCGGCCCACG TCGGCGATAT GGTCAAACTC
CCAAAGACCC GTGAAGCCGA TCTCGAGATG GGGCACGCCC GCCGCGACCT CGACTGGGAA
CGCCAGTATG CCGTCTCGAT AAACGCAGAG AAGGCACGCT GCATCAGAAA CTCCCGCATG
CCGGCCGATT CCGATGCCTG CACGATGTGC GGCGATTTCT GTGCGATCAA GATCGTCCAG
AAGACGTTTA ATTTCTGA
 
Protein sequence
MTIVEDAKKG LITEEMKVVA KSEGVTEDFI RRGIAGGHIV IPMTPYRKVK LCGIGSGLRT 
KVNASIGTSS DIVNVEEELE KARQAELAGA DSLMELSTGG DFLDIRRRVC EQSNLSVGSV
PLYQAFIEAA RNKGGVVFMD EDDLFKITEQ QAKLGTNFMA IHTGINYETV KRLKNQGRHG
GLVSRGGAFM TAWMLHNEME NPLYRRFDYL VEILKEHEVT LSFGNGMRAG ACHDATDRAA
IQELLINAEL ADQAHNAGVQ CILEGPGHIP LDEIKTNVQL EKRVTNNKPF YMLGPLVTDI
APGYDDRVAA IGASVSSAAG ADFICYVTPA EHLALPTPEE VYEGVMSSRI AAHVGDMVKL
PKTREADLEM GHARRDLDWE RQYAVSINAE KARCIRNSRM PADSDACTMC GDFCAIKIVQ
KTFNF