Gene Mlab_0200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0200 
Symbol 
ID4795729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp186798 
End bp188063 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content53% 
IMG OID640098846 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001029643 
Protein GI124485027 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00649886 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTCCA TCATCCAGAA CTGCCTCCAC GGAGATTTAT CCGATATTTG CGAGGCTGCA 
AAATCCGAGG GAATGACGCC GGAGGCCCTG TCCAGGAATA TCATAGCAGG CCGGACCATT
CTTCTGAAAA ACGAGGCACG TGAATATAAA CCGTGTGTGA TCGGCGAGGG AGCGACCGTC
AAGATCAATG TAAATATCGG AACATCCGGC GTCACCTGCG ATCCGGCAAA GGAGATGGTG
AAAGCAAAGG CGGCGATTCA AAACGGCGCC GATGCCATAA TGGATCTGTC CACGGGGGGC
GATCTTGCCG CCATCCGAAA AGAGATCCTC AAACTCGGTA TCCCCGTTGG GACCGTTCCG
ATCTACGAGG CGGTCCGCCG TGCAGGAAAT GTTGTCGACT TAACGGCGGA TATCCTGTTT
TCCGTGATCC TCGACCAGGC AAAACAGGGC GTCGATTTCA TGACGCTTCA CTGCGGGGTC
AATCTGGATG TGCTGGATGC CCTGACTCTC GACCCAAGAG TGATGGGGGT CGTTTCACGC
GGAGGGTCGT TCCACACGGC AATGATGCTT TCAAGCGGCG AGGAGAATCC TCTCTATAAA
GAATACGATT ATCTTCTCGA GATCCTCGAT GAATACGAGA TCTCGCTCTC ACTTGGCGAC
GGTATGCGTC CGGGAGCATA TGTCGATTCA AGCAAACTTG CCAAGTCCCA GGAGTATCTG
ACGCTTGGCA AACTTGCCAG ACGTGCCAAA GATAAAGGAG TCCAGCGGAT GATCGAAGGT
CCGGGACACA TGGACTACAA CGAGATCTCC TACAACGTGA AAATGATCAA GGAGATCACA
GATTTTGCGC CGCTGTATCT TTTAGGTCCG CTGGTTACGG ATATAGCTCC GGGGTATGAT
CATATCACAG GCGCAATCGG CGGGGCAGCA GCAGCCTGTG CTGGCGCCGA TTTCCTGTGT
ATGGTCTCTC CGTCCGAGCA TCTCGCTCTC CCCAACGTCG ACGATATCAT CGAGGGGACC
CGGGTCTGCA AAGTTGCTGC GCATGTGGGT GACCTTTCCC GCAGAAGAGA TGTCGAGCTT
CCCCGTCAGG CGAAAATGGC CGAAGCCCGC AAAAATCTCG ACTGGCAGGC TCAGTATGAT
CTCTCTCTTT TTGGCGGACA TGCTAAAGAG ATCCATGATC GGGACGGGGA ATGCGAAACC
TGTTCGATGT GCGGGGATCT TTGCGCGATA AAGATCGTTG AAAAAGCTCT GGAAAAAAAG
ATCTGA
 
Protein sequence
MHSIIQNCLH GDLSDICEAA KSEGMTPEAL SRNIIAGRTI LLKNEAREYK PCVIGEGATV 
KINVNIGTSG VTCDPAKEMV KAKAAIQNGA DAIMDLSTGG DLAAIRKEIL KLGIPVGTVP
IYEAVRRAGN VVDLTADILF SVILDQAKQG VDFMTLHCGV NLDVLDALTL DPRVMGVVSR
GGSFHTAMML SSGEENPLYK EYDYLLEILD EYEISLSLGD GMRPGAYVDS SKLAKSQEYL
TLGKLARRAK DKGVQRMIEG PGHMDYNEIS YNVKMIKEIT DFAPLYLLGP LVTDIAPGYD
HITGAIGGAA AACAGADFLC MVSPSEHLAL PNVDDIIEGT RVCKVAAHVG DLSRRRDVEL
PRQAKMAEAR KNLDWQAQYD LSLFGGHAKE IHDRDGECET CSMCGDLCAI KIVEKALEKK
I