Gene Mbar_A0238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A0238 
Symbol 
ID3624851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp281400 
End bp282428 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content33% 
IMG OID637699130 
Productglycosyltransferase 
Protein accessionYP_303802 
Protein GI73667787 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00200202 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCCA AGGTCGCCAT TATTATTCTC AACTGGAACG GTTGGGAAGA CACTATAGAA 
TGTTTAGAGT CTATTTATCA AATAGCATAT CCTCTTTATG ATATCATCTT AGTCGACAAT
GGCTCAAAAG ATAATTCTAT ACAAAAAATA AAAGACTATG CTGAAGGAAA AATTAAAGTT
GAATCTCAAT TTTTTGATTA TTCTCCCGAC AATAAACCTC TCTATTTAAA GGAATTTACA
AAAAATGAAC TGGACTCCTC AGTTTCTATT AAGAAATCTA TTGATAGCCT GGCTCCAAAT
AAAAATTTGA TTATTATTAA AAACGATAAT AATTACGGCT TTGCTGAAGG AAACAATATT
GGCATTAGAT TTGCTTTAAA TAACCTTAAT CCAGATTATA TTCTTTTACT TAATAACGAT
ACAGTTGTTG ATCCTTATTT TCTAAAAGAA CTTATAACGG TGGCAGAGAG TGACTCACTC
ATAGGAATAC TGGGGCCAAC TGTATACGAA TACAAAAGTC CTCAAGTAAT TCAATCTGCA
GGTGCAAAGA TTTATTGGAA CAAAGGTGAA GTAATTAACC TGACACCTAA TGAAAATAAG
TATTCAGACG AGCCTGAAAA CGTCGACAGT GTAATAGGCT GTGCATTACT TGCAAAAAGT
GAATTATTTC ATAAAATCGG GTATTTGAAT AAAGATTATT TCGCATACCT TGAAGAGACT
GAGTGGTGTG TACGTGTTTC TAAAGCATCG TACAAAATAG TTTACGTGCC AAAAGGAAAA
ATCTGGCACA AAGGTGGAGC TACAAGCAAT AAAATAACCG GTTTTACGCT CTTTCACCAT
ACCAGAAACA AGTTTTGGTT TATGAAAAAG CACTCATCAA AAAAACAATA CATTTCTTAT
TTAATTTATT TTTTTGGATT TCGTGCATGG ATGATCATTG GTGGCATCTT TTACCGGCAA
AAAAATAAGG AAATTTTGCC ATCTTTGATT TCTTTCTTGA AAGGAATTCG GGATGGAATT
CTGACCTAA
 
Protein sequence
MISKVAIIIL NWNGWEDTIE CLESIYQIAY PLYDIILVDN GSKDNSIQKI KDYAEGKIKV 
ESQFFDYSPD NKPLYLKEFT KNELDSSVSI KKSIDSLAPN KNLIIIKNDN NYGFAEGNNI
GIRFALNNLN PDYILLLNND TVVDPYFLKE LITVAESDSL IGILGPTVYE YKSPQVIQSA
GAKIYWNKGE VINLTPNENK YSDEPENVDS VIGCALLAKS ELFHKIGYLN KDYFAYLEET
EWCVRVSKAS YKIVYVPKGK IWHKGGATSN KITGFTLFHH TRNKFWFMKK HSSKKQYISY
LIYFFGFRAW MIIGGIFYRQ KNKEILPSLI SFLKGIRDGI LT