Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0423 |
Symbol | |
ID | 7407500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 481407 |
End bp | 482633 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 643714810 |
Product | glycosyltransferase 28-like protein |
Protein accession | YP_002572328 |
Protein GI | 222528446 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG TATTCTTACA ATTTGGTTCT GGGTTAGGTC CCATGTCTCG ATCCCTTCCA ATCGCAGAAG GGTTACAAAG AGAAGGATAT ATCGTAAAGT ATTTTGGATT TGAAAATGCA AAACCATATA TGAATAAAAT AGGAATTGAA GAGTTATCAG AAAATTTTAA TATCAAGGAT ATTAAAAAAG GAGTGCAAAC TCCTAATTGG TATTGTGCAG AGCAATTTTG GGAAATAATA GGATATGGTA ATATGGAATG GGTAGAAAAA AAAGTTGAAG AATTAATAGA ATATTTAAAA GATTTTTCTC CTGATTTTAT AATATCAGAC CTTGGTATAT TAAGTTGTAT TGCTGCAAGA ATAATGGACA TACCTTTGAT AGCTATAACT CAAAGTTGTT ATCATCCTAA CATTGCTTTT GGAAGAATAA GATGGTGGGA AGAAGAACAA AATTTAAAGT TTACATTAAC TGAGAAATTA AATAATTATT TTAAGAAAAA AGGTGTTTCA CAATTAAATT CTTTTGAAGA AATTTTTACT GGTAGTTTAA CCATAATTCC CAGTTTTCCT GAATTTGATC CAATAAATAA TCCTTCAGAA TTTAACACAT ATTATGTTGG TCCCATATTA TGGGATCCAT TAGACATGGC TAAAGAAGAG TATATAAAAT TGTTTAACAG AGATAAAAAT AAGCCTACAA TTTTTTGCTA TACAGCAAGA TTTTACGACA ATGTGGGTGA AAGTGGAATT ATTATTTTTA AAACATTACT TTCAGCATTA AAAAAATTTG ATGCTAACAT TATTTTTTCT ACAGGGAGTG ATTCGGACAG GAAAATAGCA AAAGAGATTT TAAACTCTTA CGGAATTGAT GAAGAGAAAT TTAGCATTAT TGATTGGGTT CCAATGGGAA TTGCTTATGG AAACTCTGAT GTTGTTATCC ATCATGGAGG CCATGGAAGT TGTTTAGGTC AATTTTTGTA TGAGGTACCT TCATTAATTA TACCTACTCA TACTGAACGA GAGTATAATG CAAGAATTTG CACCAATATG GGAGTTTCTA AATTTATAAA AAGAGAAGAC ATTGAAAAAG CAGATATATT AGCTGAAATT GTTGAGATTT TAACTAACTC AAGTTTTAAA GAAAGATTAC ACTTTTGGCA TACTAAACTA AATGAATATA ATTTTACAGG TGTAAATAAA GTTTTAGAAT TGATTCAAAA ATTATAA
|
Protein sequence | MKKVFLQFGS GLGPMSRSLP IAEGLQREGY IVKYFGFENA KPYMNKIGIE ELSENFNIKD IKKGVQTPNW YCAEQFWEII GYGNMEWVEK KVEELIEYLK DFSPDFIISD LGILSCIAAR IMDIPLIAIT QSCYHPNIAF GRIRWWEEEQ NLKFTLTEKL NNYFKKKGVS QLNSFEEIFT GSLTIIPSFP EFDPINNPSE FNTYYVGPIL WDPLDMAKEE YIKLFNRDKN KPTIFCYTAR FYDNVGESGI IIFKTLLSAL KKFDANIIFS TGSDSDRKIA KEILNSYGID EEKFSIIDWV PMGIAYGNSD VVIHHGGHGS CLGQFLYEVP SLIIPTHTER EYNARICTNM GVSKFIKRED IEKADILAEI VEILTNSSFK ERLHFWHTKL NEYNFTGVNK VLELIQKL
|
| |