Gene Mbar_A1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A1920 
Symbol 
ID3627199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp2396481 
End bp2397692 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content42% 
IMG OID637700801 
Productglucosaminyltransferase 
Protein accessionYP_305440 
Protein GI73669425 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.807733 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCCAC TTCTTTTATT CTTCTATATT ATAGGATTCA CCTGCTTTTC TGTAGGTATT 
CTGACAATTC CATATTATCC ACTATCTTTA GCTTTCGAAA TGCGCAAAAG CAAACAGAAT
ATATTCAACA GCAAGAATCC CTTTGTCTCT ATAGTGGTAC CAGCCTATAA TGAAGGAAAA
GTGATAGGCC ATTGTATAAA GTCCATTCTA GAATCAAATT ATTCAGAATA TGAGGTTATC
CTAGTAGATG ATGGTTCATC TGATAATACA CTTGAAGAGA TGCAGCATTA TGAAACGAAC
TCTCATGTAA TTGTGGTAAC CAAGAAAAAT GGAGGCAAAG CCTCAGCTCT TAACGTGGGA
CTAAAACTAG CCAAAGGAGA AGTAATATTC TTCGTCGATG CTGACGGCAT CTTTGCTCCT
GATACCATAA GCAAGATGCT CAGTGGTTTT ATCAGCGAAG ATGTAGGTGC AGTATGTGGC
AACGATGCTC CCATTAACCT AGATAAGCTT CAGACACAAC TGGCGAACTT GCTCACTCAC
GTAGGCACAG GCTTTGTGCG TAGAGCGCTG TCTACTATTG ATTGCCTTCC TGTCGTGTCT
GGAAATATAG GCGCCTACCG ATCTAGCACT CTTGAAAAGA CTGGTCCCTT CTTAGAGGGA
TTTATAGGCG AAGATATGGA ATTGACCTGG CGTGTGCACA AGGCAGGTTA CAAAGTAGTG
TTTCAACCTT GGGCCATAGT GTATGCAGAA GCACCCTCAA CTATCACGGG CCTGTGGAAG
CAGCGCGTGC GATGGGCGCG TGGGTTACTT AAGACCGCTT ATATTCACCG AGATATGTTA
TTCAATCCAA AGTATGGCCT TTTTGCCTTT TATTTGCCTA TTAACTTAAC ATCAATGATT
ATCATACCTT TGCTACAGTT GATATCAATC ATACTGTTGC CAATCTTGCT ATTTCGCAAC
ATTAATCCCA TTCCTTTAAG CTGGATAAGC ATCATGGGTT GGCTTGGTAT GTTCTCTTCC
ATTTTTGCAT TATTATTTTC TATTGCCCTT GACCGAGCTT GGCTTGATTT AAAATACTTC
TACGTGATAC CATTATGGGT ACTATATTCC TTTATGATGG ATGTAGTAAT GCTATGGGCG
ATCATCGTTG AGCTGCGTAG AAAGGAGGCA AAATGGAACA AGCTAGATCG GACCGGCATT
GTAAGCCGCT GA
 
Protein sequence
MEPLLLFFYI IGFTCFSVGI LTIPYYPLSL AFEMRKSKQN IFNSKNPFVS IVVPAYNEGK 
VIGHCIKSIL ESNYSEYEVI LVDDGSSDNT LEEMQHYETN SHVIVVTKKN GGKASALNVG
LKLAKGEVIF FVDADGIFAP DTISKMLSGF ISEDVGAVCG NDAPINLDKL QTQLANLLTH
VGTGFVRRAL STIDCLPVVS GNIGAYRSST LEKTGPFLEG FIGEDMELTW RVHKAGYKVV
FQPWAIVYAE APSTITGLWK QRVRWARGLL KTAYIHRDML FNPKYGLFAF YLPINLTSMI
IIPLLQLISI ILLPILLFRN INPIPLSWIS IMGWLGMFSS IFALLFSIAL DRAWLDLKYF
YVIPLWVLYS FMMDVVMLWA IIVELRRKEA KWNKLDRTGI VSR