Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0459 |
Symbol | |
ID | 7407537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 521866 |
End bp | 524238 |
Gene Length | 2373 bp |
Protein Length | 790 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643714847 |
Product | glycosyltransferase 36 |
Protein accession | YP_002572364 |
Protein GI | 222528482 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTATG GATATTTTGA TTCTCAAAAC AGAGAGTATG TTATAACAAA CCCCAAAACA CCAACTTCAT GGGTAAATTA TTTAGGAACA AGTGATTATT GCCTTATAAT CTCCAACAAT GCCTCTGGTT ATTCGTTTTA TAAATCTCCA AAACTTGGAA GAGTTACTCG TTTTAGATTC AATAGTATTC CAATGGACAG ACCTGGCAGG TACGTATATA TAAAAGATGA AAAAACCAAA GATTTTTGGT CAATAAGCTG GCAACCTGTT GGAAAGCCTC TTGAGAAGTT CCTGAGCATC TGTCGACATG GTCTTGGATA TTCAATATTT GAAAGTAAAT ATAGCAATAT AACCTCATCT TTAAAAATCT TTGTCCCAGT AGACAAACCA ATTGAAATCT GGGAAGTTAA AATCAAGAAT GAGTCAGATG AGAAAAAAGA ACTATCGATA TTTACTTATA CAGAGTTCTG TCTATGGAAT TCTATGCTTG ACATGATGGA TTTTCAGTAT ATTCTTTATA CCTGCAGAAT GGGTTACAAC AAAGAAGATG AAATTGTAGA TTATTCTATC AAACTCTGGA GTCCTTATGA ACCAAAAGCA TTTTTCACAT GCACAAATAA AAAGATTGAA AGTTTTGATA CAGATAGAGA TGTATTTATT GGTCCATATA ACAGCGAGGC TAATCCAGAA GCAATTCAAA ACGGCAGGTG TTTTGGCTCA ATTGCAATAG GTGGAAATCC ATGTGCTGCA ACACAGGTAA AAATTGAACT TCAGCCCGGT CAAGAAGAAT ACTTAGTGTT TGTACTGGGA ATAGGAGATG CATACAAGGA AGGAAAAGAA TATAAAAAAC TATTTGCATC AAAAGAAAAT ATTCAAAAAG AATTTGAAAA AGTACAAAAG TATTGGAATG AACGACTTAG CAAGTTTAAG TTCTCAACGC CAAGCGAAAA GATGAATTTG ATGTTAAATA TATGGAATCA ATATCAGTGC CATACAACAT TCAACTGGTC AAGGTCTGCA TCGTTCATTG AAGCTGGTGG AAGAGACGGG CTTGGCTTTA GAGATTCTTC ACAGGACATT CTGGGCGTTG CACATTCAAT CCCCCAAGAG GTAAGAAAAA GACTTATTGA ACTTTTGAGG GCTCAGCTGT CTGAAGGATA TGCAATGCAT CATTTCCAGC CTCTTACATG GACTCAGGGA GAACATAATA TACCACCACG TGAGAGAATT TATTCAGACG ACCACTTATG GCTTTTGATT GCTGTGCCAC ACTATATAAA AGAAACAGGA GACTTTTCCA TCTTAGATGA AGTTGTTGAA TATGCGGACA AGTCAAGTGC TTCTGTTTAT GAGCATTTAA AACAAGCTTT GGAGTTTTCA TGGAATCACA GAGGAAAACA TGGACTTTTG CTTGGTCTTG CTGCTGACTG GAATGACTGT ATCAACCTCA AAAACGGTGG CGAGAGTACA TGGTCAACCC AGCTTTATTA CAAAGCTTTA TCTGAGTTTA TAGAACTTGC TGAGTATATT GGTAAGACTG ATGATGCTGA AAAGTATAAA GCTTATAGAA ATGAAATCAA AAAGGCAATG GAAGAGTATA CATGGGATGG CGAATGGTTT GTAAGAGGGT ATTTGGCAAG TGGTAAAAAA CTTGGTTCAA AAGAAAGTGA GCAAACCAAG ATATTCTTAA ATTCACAGTC TTGGGCAGTG TTTTCTGGGG CTTTTATTGA TGAAAAAGGC AAAATGGCAA TGGATAGTGT TAAAAAGTAT CTTGCAACAG AGCATGGTTG TGTTAAGAAC TGGCCAGCTT ATGTTGATTA TATCATAGAG GTTGGGGCTG TAACTTCTTT CCCACCAGGA TTAAAAGAAA ATGCTGCTAT TTTCTGTCAT GCTAATACAT GGGTAATTAT TGCAGAGGCT GTACTTGGAA GAGGCGATTA TGCATTTGAA TACTATATGT CGTTCCTACC TGCAAACAAA AATGATATTG CTGAAATCTA TACCACAGAA CCTTATGTTT ATTCCCAGTT TATCACCGGA AAAGAACATC CATATTATTT TGGCCGTGCG CGAAATCCAT GGTTGACAGG TACTGCAACA TGGGCATTTG TTGCAGCAAC ACAGTATATC CTAGGGGTTC GCCCACACTA CAAAGGTCTT ATTATTGACC CATGTATACC AAATCAGTGG GACAGTTTTG AAGTTGAGAG AGTTTTCAGA GGAAGAAAAC TTTCTATTAA GGTTTCAAAT CCAGACCATA TTTCAAAAGG TGTTAAAAAG ATATTGGTAA ATGGAAAAGA AATTGTGGGT AATCTGATTC CAGTAGAATT GCTTGATGAG GAAAATGTAG TTGAAGTTGT GATGGGAAAA TAA
|
Protein sequence | MNYGYFDSQN REYVITNPKT PTSWVNYLGT SDYCLIISNN ASGYSFYKSP KLGRVTRFRF NSIPMDRPGR YVYIKDEKTK DFWSISWQPV GKPLEKFLSI CRHGLGYSIF ESKYSNITSS LKIFVPVDKP IEIWEVKIKN ESDEKKELSI FTYTEFCLWN SMLDMMDFQY ILYTCRMGYN KEDEIVDYSI KLWSPYEPKA FFTCTNKKIE SFDTDRDVFI GPYNSEANPE AIQNGRCFGS IAIGGNPCAA TQVKIELQPG QEEYLVFVLG IGDAYKEGKE YKKLFASKEN IQKEFEKVQK YWNERLSKFK FSTPSEKMNL MLNIWNQYQC HTTFNWSRSA SFIEAGGRDG LGFRDSSQDI LGVAHSIPQE VRKRLIELLR AQLSEGYAMH HFQPLTWTQG EHNIPPRERI YSDDHLWLLI AVPHYIKETG DFSILDEVVE YADKSSASVY EHLKQALEFS WNHRGKHGLL LGLAADWNDC INLKNGGEST WSTQLYYKAL SEFIELAEYI GKTDDAEKYK AYRNEIKKAM EEYTWDGEWF VRGYLASGKK LGSKESEQTK IFLNSQSWAV FSGAFIDEKG KMAMDSVKKY LATEHGCVKN WPAYVDYIIE VGAVTSFPPG LKENAAIFCH ANTWVIIAEA VLGRGDYAFE YYMSFLPANK NDIAEIYTTE PYVYSQFITG KEHPYYFGRA RNPWLTGTAT WAFVAATQYI LGVRPHYKGL IIDPCIPNQW DSFEVERVFR GRKLSIKVSN PDHISKGVKK ILVNGKEIVG NLIPVELLDE ENVVEVVMGK
|
| |