Gene Mbar_A2129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2129 
Symbol 
ID3626083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp2686866 
End bp2688152 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content49% 
IMG OID637701007 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_305640 
Protein GI73669625 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTGA TGGAAGATGC AAAGAAGGGA ATCGTCACCC CTTCTATAGA AACTGTGGCA 
AAAACTGAAG GAATAGACCC TGAGACTGTC TGCTCCTGCG TGGCAAAAGG GCTAATTGCT
ATCCCTGTAA ATAACAGGCG AGAGACCCTT CCTATAGGTA TTGGCAAGTA TATGAGCACA
AAGATCAATG CCAATGTCGG AACATCGAGG GACTATGTAG ACATTGATGC CGAAATCAAA
AAAGCAAAGG CAGCAGAAGC TTTCGGCGCT CATGCTGTAA TGGACCTTTC TACAGGTGGA
AATCTGGACG AAATTCGCAC CCGAATCCTG AAATCCGTTA ATATTCCAGT CGGAACGGTC
CCAATCTACC AGGCTGCAGC TTCCAGGAAA GTTGTTGTGG AAATGACTTC GGATGATATG
TTCAACGCCG TCCGGAAACA TGCCGAACAG GGAGTTGACT TTGTGACTGT GCACGCCGGG
GTTAACTTAA ACTCACTTGA ACGGCTGCGT CAGAGCGACA GGATAATGAA TGTCGTGAGT
CGTGGAGGCT CTTTTACCCT TGCATGGATG CTGCATAATG GAGAAGACAA TCCCTTCTAT
GCTGAATTTG ATTATCTACT TGAAATCGCA AAAGAATATG ATATGACCCT GAGCCTTGGG
GACGGCATGC GTCCCGGCTG TATTGCCGAT GCCTCCGACC GCCCGAAGTT TATGGAATTT
ATCACACTCG GTGAGCTCGT AAAGCGGGCA AGGGCTGCCA ATGTCCAGAC CTTTGTGGAA
GGTCCAGGTC ATGTGCCTTT GAACGAAATC GAACTCAGCG TTAGAGGCAT GAAAGAGCTC
TGTAATGGCG CTCCTCTCTA TCTCCTGGGG CCGCTTGTAA CCGATATCGC ACCGGGCTTC
GATCACATTA CAGGTGCGAT AGGAGGAGCA GTTGCGGGGA TGCATGGCAC GGATTTTCTC
TGCATGGTAA CTCCTTCGGA ACACCTCGCC CTTCCAACCC TTGAAGATAT AAAAGAAGGT
CTGCTCGTAA CAAAGGTTGC AGCTCACACT ATTGACCTTA TAAAAGAAGG TCCGAGAGAA
CGCGCCTGGG AAAAGGATCT TGCCATGGCC TATGCCCGCA GGGACCTTGA CTGGGAAAAA
CAGTTCGAAC TGGCAATCGA TGGCAACAGG GCCCGCAAAA TTCGAGATGC CCGAAAAACT
GAAAGCGATA CCTGCTCTAT GTGTGGGGAG CTCTGCGCTT TGAAAATCGT AAAGGAAGCT
TTTGAGAAAA AGAATTCGGA AGAATAA
 
Protein sequence
MTLMEDAKKG IVTPSIETVA KTEGIDPETV CSCVAKGLIA IPVNNRRETL PIGIGKYMST 
KINANVGTSR DYVDIDAEIK KAKAAEAFGA HAVMDLSTGG NLDEIRTRIL KSVNIPVGTV
PIYQAAASRK VVVEMTSDDM FNAVRKHAEQ GVDFVTVHAG VNLNSLERLR QSDRIMNVVS
RGGSFTLAWM LHNGEDNPFY AEFDYLLEIA KEYDMTLSLG DGMRPGCIAD ASDRPKFMEF
ITLGELVKRA RAANVQTFVE GPGHVPLNEI ELSVRGMKEL CNGAPLYLLG PLVTDIAPGF
DHITGAIGGA VAGMHGTDFL CMVTPSEHLA LPTLEDIKEG LLVTKVAAHT IDLIKEGPRE
RAWEKDLAMA YARRDLDWEK QFELAIDGNR ARKIRDARKT ESDTCSMCGE LCALKIVKEA
FEKKNSEE