Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A1920 |
Symbol | |
ID | 3627199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | + |
Start bp | 2396481 |
End bp | 2397692 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637700801 |
Product | glucosaminyltransferase |
Protein accession | YP_305440 |
Protein GI | 73669425 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.807733 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCCAC TTCTTTTATT CTTCTATATT ATAGGATTCA CCTGCTTTTC TGTAGGTATT CTGACAATTC CATATTATCC ACTATCTTTA GCTTTCGAAA TGCGCAAAAG CAAACAGAAT ATATTCAACA GCAAGAATCC CTTTGTCTCT ATAGTGGTAC CAGCCTATAA TGAAGGAAAA GTGATAGGCC ATTGTATAAA GTCCATTCTA GAATCAAATT ATTCAGAATA TGAGGTTATC CTAGTAGATG ATGGTTCATC TGATAATACA CTTGAAGAGA TGCAGCATTA TGAAACGAAC TCTCATGTAA TTGTGGTAAC CAAGAAAAAT GGAGGCAAAG CCTCAGCTCT TAACGTGGGA CTAAAACTAG CCAAAGGAGA AGTAATATTC TTCGTCGATG CTGACGGCAT CTTTGCTCCT GATACCATAA GCAAGATGCT CAGTGGTTTT ATCAGCGAAG ATGTAGGTGC AGTATGTGGC AACGATGCTC CCATTAACCT AGATAAGCTT CAGACACAAC TGGCGAACTT GCTCACTCAC GTAGGCACAG GCTTTGTGCG TAGAGCGCTG TCTACTATTG ATTGCCTTCC TGTCGTGTCT GGAAATATAG GCGCCTACCG ATCTAGCACT CTTGAAAAGA CTGGTCCCTT CTTAGAGGGA TTTATAGGCG AAGATATGGA ATTGACCTGG CGTGTGCACA AGGCAGGTTA CAAAGTAGTG TTTCAACCTT GGGCCATAGT GTATGCAGAA GCACCCTCAA CTATCACGGG CCTGTGGAAG CAGCGCGTGC GATGGGCGCG TGGGTTACTT AAGACCGCTT ATATTCACCG AGATATGTTA TTCAATCCAA AGTATGGCCT TTTTGCCTTT TATTTGCCTA TTAACTTAAC ATCAATGATT ATCATACCTT TGCTACAGTT GATATCAATC ATACTGTTGC CAATCTTGCT ATTTCGCAAC ATTAATCCCA TTCCTTTAAG CTGGATAAGC ATCATGGGTT GGCTTGGTAT GTTCTCTTCC ATTTTTGCAT TATTATTTTC TATTGCCCTT GACCGAGCTT GGCTTGATTT AAAATACTTC TACGTGATAC CATTATGGGT ACTATATTCC TTTATGATGG ATGTAGTAAT GCTATGGGCG ATCATCGTTG AGCTGCGTAG AAAGGAGGCA AAATGGAACA AGCTAGATCG GACCGGCATT GTAAGCCGCT GA
|
Protein sequence | MEPLLLFFYI IGFTCFSVGI LTIPYYPLSL AFEMRKSKQN IFNSKNPFVS IVVPAYNEGK VIGHCIKSIL ESNYSEYEVI LVDDGSSDNT LEEMQHYETN SHVIVVTKKN GGKASALNVG LKLAKGEVIF FVDADGIFAP DTISKMLSGF ISEDVGAVCG NDAPINLDKL QTQLANLLTH VGTGFVRRAL STIDCLPVVS GNIGAYRSST LEKTGPFLEG FIGEDMELTW RVHKAGYKVV FQPWAIVYAE APSTITGLWK QRVRWARGLL KTAYIHRDML FNPKYGLFAF YLPINLTSMI IIPLLQLISI ILLPILLFRN INPIPLSWIS IMGWLGMFSS IFALLFSIAL DRAWLDLKYF YVIPLWVLYS FMMDVVMLWA IIVELRRKEA KWNKLDRTGI VSR
|
| |