Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2066 |
Symbol | |
ID | 7408775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 2183550 |
End bp | 2184677 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 643716433 |
Product | glycosyl transferase group 1 |
Protein accession | YP_002573916 |
Protein GI | 222530034 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.033296 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTTT TGCACCTTAT AAGCGGTGGT GATACAGGGG GAGCAAAGAC ACATATAATA AACCTGTGTT CAAAACTAAA AGATCTTGTC AGTCTTAAAA TTATATGTTT CATGTACGGG CAATTCTATG AAGAGGTAAA AAATGCTGGA ATTGATATAG ATGTTATTCA ACAATCCTCT CGATTTGACT TGAGTGTGGC TGACAGAATT GCGCATATCG TTAAAGCTGA AGATTATGAT ATAATTCACT GTCATGGTGC AAGAGCAAAT TTTATTGGAA TGTTTTTAAA ACGAAAAATC AAAAACAAGC CATTTATCAC AACAGTACAT AGCGACTTTG ATTTAGATTT TCAGGACGTT TTTTATAAAA GAGTAGTGTT TTCATTTCTT AATAAACTTT CTTTAAAAAG ATTTGACTAT TTTATTTCTG TGGGATCTGC ATTGATTGAC AAAATAAAAG GACTTGGGGT GAAAGAAAAT AGAATTTTTC TTTTGTACAA TGGTTTTGAC TTTTCAAAAG AGATACATTA TGTGCAAAAG GATGAATTTT TATCAAAGTT TTTTGATAGA AAAGTATTTG ACTCCAAAAT AGTTATAGGA AACTTGAGCA GGTTATACAA GGTAAAAGGT TTAGATGTAT TTATAAAAGC CGCCAATATA ATAGCTAAAA AATATCCTGA GGTCATTTTT TTAATCGGCG GAAGTGGTCC TCAAAAGGAA TTTTTAAAGC AAATGATAAG TGAATACAAT TTAAATGACA GGGTATTTCT ACTTGGCAGT ATAAAAAATC CATATGACTT TTTTAATAGC ATAGATATAA ATGTCATAAG TTCATACTCT GAAACTTTCC CATATTCAAT CTTAGAAGCA ACAGCACTTG AAAAGTGTTG TATATCAAGC AAAGTGGGTT CAGTGCCAGA CTTGATTGAA GATGGTAAAA ATGGTTTTTT ATTTGACGCT GGAGATTATA AAGGGCTTGC TCAAAAGATA GAAATTCTTT TGCAAAATAA AGACCTTATC AAAGAATTTG GACAGCTTCT TTCTAAAAAA GCAAAAGAAA AGTTTTCTGC AGAAAATATG GCAAGGATGC AATTTGAGAT TTATAAAAGC ATACTTTCGA AAAAATAA
|
Protein sequence | MKVLHLISGG DTGGAKTHII NLCSKLKDLV SLKIICFMYG QFYEEVKNAG IDIDVIQQSS RFDLSVADRI AHIVKAEDYD IIHCHGARAN FIGMFLKRKI KNKPFITTVH SDFDLDFQDV FYKRVVFSFL NKLSLKRFDY FISVGSALID KIKGLGVKEN RIFLLYNGFD FSKEIHYVQK DEFLSKFFDR KVFDSKIVIG NLSRLYKVKG LDVFIKAANI IAKKYPEVIF LIGGSGPQKE FLKQMISEYN LNDRVFLLGS IKNPYDFFNS IDINVISSYS ETFPYSILEA TALEKCCISS KVGSVPDLIE DGKNGFLFDA GDYKGLAQKI EILLQNKDLI KEFGQLLSKK AKEKFSAENM ARMQFEIYKS ILSKK
|
| |