Gene Mbar_A0597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A0597 
Symbol 
ID3628003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp718417 
End bp719703 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content49% 
IMG OID637699490 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_304157 
Protein GI73668142 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCG TGGAAGATGC ACAAAAAGGG ATTATTACTG AAGAAATGAA GATTGTTGCA 
AAGGACGAAG GACTTGACCC TGAATTCATC CGTCGTGGTG TTGCAGCCGG AAGAATTGTT
ATTCCAACCT CCCCATACAG GCAGGTAAAG ATCTGCGGTA TAGGAGAAGG GCTCAGGACC
AAAGTCAATG CATCCATCGG TGTATCCTCG GATATTGTTG ATGCAGACAT GGAAGTTAAA
AAAGCACAGG CTGCCGAAGC TGCAGGTGCA GACACCCTTA TGGAGCTCGG AACTGGTGGA
GACTTCCTTG CAATCAGGAA AAAAGTCATT GACAGTATTT CCCTTTCAGT CGGTTCAGTG
CCTCTTTACC AGGCCTTCAT TGAGGCCGCA AGGAAATACG GCTCAATCGT GGATATGACC
GAAGACGAAC TCTTCAAGGC AACCGAAGAC CAGGCAAAGC TCGGAACTAA TTTCATGGCA
ATTCACACAG GAATCAACAA TATCACCATG GACCGCCTTA AAGCCCATGG CAGGTACGGT
GGCCTCTGTT CCCGTGGTGG CGCCTTTATG ACTTCCTGGA TGCTCCACAA TGAAAAGGAA
AATCCACTTT ATGCAAACTT TGATTACCTT GTTGAGATCC TCAAGGAACA CGAAGTAGTC
CTCTCTACCG GAAACGGTAT GCGTGCAGGT GCAGTCCACG ATGCAACCGA CCGTGCCCAG
ATCCAGGAAT TAATTATTAA CTCCGAACTG GCCGACAGAG CCCACAAGCA GGGTGTGCAG
GTCATTGTCG AAGGTCCGGG TCATGTCCCT CTCGACCAGA TAGGAACCAA CGTAAAACTC
ATGAAGGAAA TGAGCGGTCA CAAGCCATTC TACATGCTCG GCCCACTTGT AACTGACATC
GCACCAGGTT ACGACCACAT CGTAACTGCA ATCGGAGCAT CGGTTTCTGC TTCATATGGC
TGTGACTTCC TTTGCTATGT AACTCCTGCA GAGCACCTTG CCCTTCCAAA CCTTGAAGAT
GTTATCACAG GAGTCAAAAC CTCAAAGATT GCAGCTCACG TAGGCGATAT GGTAAAATAT
CCAGACAGGG CAAGAGAACA GGACCTTGCT ATGGGCAGAG CTAGAAGAGA CCTCGATTGG
CAAAAGATGT ACTCTCTTGC AATCGACCCA GAACACGCAA AAGAAGTTAG GAACAGCAGG
GCTCCCGAAG ATTCTGACGC CTGCACAATG TGCGGTAACT TCTGCGCCCT CAAGATCGTA
AACCAGAACT ACAACCTCGC AAAATAA
 
Protein sequence
MTIVEDAQKG IITEEMKIVA KDEGLDPEFI RRGVAAGRIV IPTSPYRQVK ICGIGEGLRT 
KVNASIGVSS DIVDADMEVK KAQAAEAAGA DTLMELGTGG DFLAIRKKVI DSISLSVGSV
PLYQAFIEAA RKYGSIVDMT EDELFKATED QAKLGTNFMA IHTGINNITM DRLKAHGRYG
GLCSRGGAFM TSWMLHNEKE NPLYANFDYL VEILKEHEVV LSTGNGMRAG AVHDATDRAQ
IQELIINSEL ADRAHKQGVQ VIVEGPGHVP LDQIGTNVKL MKEMSGHKPF YMLGPLVTDI
APGYDHIVTA IGASVSASYG CDFLCYVTPA EHLALPNLED VITGVKTSKI AAHVGDMVKY
PDRAREQDLA MGRARRDLDW QKMYSLAIDP EHAKEVRNSR APEDSDACTM CGNFCALKIV
NQNYNLAK