Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbur_2018 |
Symbol | |
ID | 3996970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcoides burtonii DSM 6242 |
Kingdom | Archaea |
Replicon accession | NC_007955 |
Strand | + |
Start bp | 2122437 |
End bp | 2123612 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637959757 |
Product | glycosyl transferase family protein |
Protein accession | YP_566645 |
Protein GI | 91773953 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGTGG TAGCGGCTAT TCCTGCATAC AATGCTGAGG TTCATATAAA GAACATCATT AAAAGAACGA AGTACTATGT AGATCATGTG ATCGTTGTTG ATGATGGTAG CAGTGATGCT ACAGCTCATA TTGCTACTAA TATGGGTGCA CGGGTTATCA GGCATGGTGA CAATCTGGGG AAAGCTGCCG CACTGGCAAC GGCTTTTGAG GCTGCAAAAA AATTAAACCC TGCTGTGCTT GTGACGATCT ATGCTAATGG TTTTCATAAT CCGGATGATA TCCCTTCAAT ATTAAAACCT GTACTTTCAA AAGATGCGGA TGTTGTAAAC GGTGCTTACA TTTCTTCTTC CAGTATTGGG CTTGATGCTA CTTTTGATGA TGCGGAGGAT ACAAATGAAG CCAGGCTCGT TAAGGATAGT GGTTTTCGTG CGTATTCTTC AAAGACGCTC GATACTTTCA AATTCACAAA GACCGATGGT GCTATTGAGG TCGAGCTTAT CGATGAGGCT ATTAATGCAG GCTTCAGGGT CAAAGAGGTG CCTATTAAGA TAATCGATCC GGTCAAGCGG GAACTTCTTG CAAGAACACG CATAGGTGTG GTAGTACCCG CCTACAATGA GGAGAAGCTG ATCAAAAATA CTGTTGAAGG TATTCCTCAG TATGTTGACC GTATCTATGT GATAAATGAT GCGAGCACGG ATAATACTGC AAAGGTCATC GAGACGTTGA ACGACCCGAG GGTGGTCGTG ATAACACACG AGACCAATAA AGGTGTGGGT GCTGCACTTA TCAATGGATA CAAAAAAGCA CTAAGGGAAA ATATGGATGT TGTGGCCGTG ATGGCCGGTG ACGACCAGAT GAACCCTGAT CAGTTGTACA AGCTCATTAT CCCGATCATT GAAGGCAGGG CCGACTATAC GAAAGGCAAT CGCCTAATGG ATATCGAATA CCACATCGGA ATGAGCAAGT GGCGTAAGGT CGGTAACGCA GTGCTGACCA TTCTTACAAA GATCGGCAGT GGATACTGGC ACATAATGGA CCCCCAGAAC GGCTATACAG CTATCTCAAA GGATGCCCTC ATAGGAATAG GACTCGATGA TGTTTACACA TACTACGGAG AATTGCGATT AACTGCTGAT TTTTTTCAAA GCAAATATTT CCAGCTAAAA TTTTGA
|
Protein sequence | MTVVAAIPAY NAEVHIKNII KRTKYYVDHV IVVDDGSSDA TAHIATNMGA RVIRHGDNLG KAAALATAFE AAKKLNPAVL VTIYANGFHN PDDIPSILKP VLSKDADVVN GAYISSSSIG LDATFDDAED TNEARLVKDS GFRAYSSKTL DTFKFTKTDG AIEVELIDEA INAGFRVKEV PIKIIDPVKR ELLARTRIGV VVPAYNEEKL IKNTVEGIPQ YVDRIYVIND ASTDNTAKVI ETLNDPRVVV ITHETNKGVG AALINGYKKA LRENMDVVAV MAGDDQMNPD QLYKLIIPII EGRADYTKGN RLMDIEYHIG MSKWRKVGNA VLTILTKIGS GYWHIMDPQN GYTAISKDAL IGIGLDDVYT YYGELRLTAD FFQSKYFQLK F
|
| |