Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2354 |
Symbol | |
ID | 7407773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2497018 |
End bp | 2499333 |
Gene Length | 2316 bp |
Protein Length | 771 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716718 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_002574197 |
Protein GI | 222530315 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCAATTG AAAAAAGGGT AAACCAGCTT TTGCAGCAGA TGACAGTTGA AGAAAAGGTG TATCAGCTCA CAAGTGTGCT TGTAAAAGAT ATTTTGGAAA ACAACCAATT TTCTGAGGAA AAAGCAAAGA AAGTCATTCC TCATGGTATT GGCCAGATTA CAAGGGTTGC AGGTGCGAGC AATTTCACAC CTCAACAGGC TTTAGAGGCA GCAAACCAAA TCCAAAAGTT TTTGATTGAA AACACAAGGC TCAAAATTCC TGCGATAATC CATGAAGAAT CTTGTTCTGG TTTTATGGCA AGCAAAGCAA CAGTATTTCC ACAGAGCATT GGTGTTGCCT GCACTTTTGA CAATGAACTT GTAAAAGAGA TGGCAAAGGT TATAAGGCTG CAGATGAAAG CTGTAGGTGC GCATCAGGCT TTGGCACCAC TTATTGATGT TGCAAGGGAT GCACGATGGG GAAGGGTTGA AGAGACATTT GGTGAAGACC CATATCTTGT TGCAAATATG GCAGTAAGTT ATGTTGAAGG AATTCAGGGC AAGAACTTTG AAGAAAAGAT TATTGCAACA GGCAAACATT TTGTTGGTTA TGCAATGTCA GAAGGTGGGA TGAACTGGGC ACCTGTTCAT ATTCCTGAAA GAGAGCTAAG AGAAGTGTAT CTTTATCCAT TTGAGGTCGC TGTTAAAGTG GCAGGATTAA AATCAATTAT GCCAGCTTAC CATGAAATTG ACGGAATTCC TTGTCATGCA AACAGAAAGC TTTTGACCGA AATTGCAAGG AATGAATGGA GATTCGATGG AATATTTGTG TCTGACTACA GTGGTGTTAA AAATATCTTA GACTATCATA AGTCGGTTAA AACTTATGAA GAGGCAGCGT ATATTTCTCT TTGGGCAGGA CTTGATATTG AACTTCCAAG AATAGAGTGT TTTACTGAGA AGTTTATTGA GGCATTAAAA GAAGGCAAGT TTGATATGGC AGTTGTTGAT GCTGCTGTGA AGAGAGTTTT AGAGATGAAG TTCAGGCTCG GACTTTTTGA CAATCCATTT GTAAAAACAG AAAATATTTT AGAACTTTTT GACAATGAGG AGCAAAGAAG CCTTGCAAGA AAAGTTGCCC AAGAGTCTAT GGTTCTTTTG AAAAACGACG GTATATTGCC ACTTAAAGAA AAAGAACTCA AGAAAGTTGC TGTGATAGGA CCTAATGCCA ACTCAGTTAG AAATCTTCTT GGTGATTATT CTTACCCAGC ACACATATCA ACAACAGAAA TGTTCTTTAT GAAAGAAGAG GTTGACCTCG GCGATGAAGA TGCATTTGTC AAAAAGGTTG TAAATATTAA ATCTGTATAT GAAGTTATAA AAGAAAGAAT AGGTAAGCAT ACAGAGGTAG TCTATGCAAA AGGTTGTGAT GTAAACTCTC AAGATAAGTC CAGCTTTGAA GAAGCTAAAA AAGCTGCCCA GGGCGCAGAT GTTGTTATAG TTGTAGTTGG TGACAAGGCA GGGTTAAAAC TTGACTGCAC ATCTGGTGAG TCAAGAGATA GAGCAAGCTT AAAACTTCCA GGTGTTCAGG AAGAGCTGAT AGAAGAAATT TCAAAAGTAA ATCAAAACAT TGTTGTTATT CTTGTAAACG GTCGACCTGT TGCGCTCGAA AATTTCTGGC AAAAGTCCAA AGCTATTCTT GAAGCTTGGT TCCCGGGCGA AGAAGGTGCA GAGGCGATTG CAGATGTTAT CTTTGGAAAG TACAATCCGG GTGGAAAACT TGCAATTTCA TTCCCAAGAG ATGTTGGGCA AGTACCGGTA TACTATAGTC ACAAACCATC CGGTGGAAAA TCATGCTGGC ATGGGGACTA TGTTGAAATG TCTTCAAAGC CATTTTTACC ATTTGGTTAC GGTCTTTCGT ATACAACTTT TGAATACAAA AATCTTACCA TTGAAAAAGA AAAAATTACA ATGGATGAGA GCATAAAAAT CTCGGTTGAG ATAGAAAATA CAGGAAACTA TGAAGGAGAT GAGGTAGTTC AGCTGTATAC AAGAAAAGAA GAGTTTTTAG TAACAAGACC TGTAAAAGAG CTAAAGGCAT ACAAGAGAGT TCACTTAAAA CCTGGTGAAA AGAAGAAAGT TGTATTTGAA ATCTTCCCAG ACCAGTTTGC ATACTATGAT TATGATATGA ACAGGGTAAT CTCACCCGGC ACTGTTGAGG TCATGGTAGG GGCATCTTCA GAAGACATAA AGTTTACAGG GACATTTGAG ATTGTTGGGG AAAAGAAAGA TGCAAAAGAA ATCAAAAATT ATCTTAGCCA TGCATGGTGT GAATAA
|
Protein sequence | MSIEKRVNQL LQQMTVEEKV YQLTSVLVKD ILENNQFSEE KAKKVIPHGI GQITRVAGAS NFTPQQALEA ANQIQKFLIE NTRLKIPAII HEESCSGFMA SKATVFPQSI GVACTFDNEL VKEMAKVIRL QMKAVGAHQA LAPLIDVARD ARWGRVEETF GEDPYLVANM AVSYVEGIQG KNFEEKIIAT GKHFVGYAMS EGGMNWAPVH IPERELREVY LYPFEVAVKV AGLKSIMPAY HEIDGIPCHA NRKLLTEIAR NEWRFDGIFV SDYSGVKNIL DYHKSVKTYE EAAYISLWAG LDIELPRIEC FTEKFIEALK EGKFDMAVVD AAVKRVLEMK FRLGLFDNPF VKTENILELF DNEEQRSLAR KVAQESMVLL KNDGILPLKE KELKKVAVIG PNANSVRNLL GDYSYPAHIS TTEMFFMKEE VDLGDEDAFV KKVVNIKSVY EVIKERIGKH TEVVYAKGCD VNSQDKSSFE EAKKAAQGAD VVIVVVGDKA GLKLDCTSGE SRDRASLKLP GVQEELIEEI SKVNQNIVVI LVNGRPVALE NFWQKSKAIL EAWFPGEEGA EAIADVIFGK YNPGGKLAIS FPRDVGQVPV YYSHKPSGGK SCWHGDYVEM SSKPFLPFGY GLSYTTFEYK NLTIEKEKIT MDESIKISVE IENTGNYEGD EVVQLYTRKE EFLVTRPVKE LKAYKRVHLK PGEKKKVVFE IFPDQFAYYD YDMNRVISPG TVEVMVGASS EDIKFTGTFE IVGEKKDAKE IKNYLSHAWC E
|
| |