Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2057 |
Symbol | |
ID | 7408270 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2172903 |
End bp | 2175221 |
Gene Length | 2319 bp |
Protein Length | 772 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716424 |
Product | alpha-xylosidase YicI |
Protein accession | YP_002573907 |
Protein GI | 222530025 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTTA CAGACGGTTT TTGGCGTGTA AAAGATGGAA TAAGATTATA TCATCCAGCC CATATATATG ATTACGAAAT TTCGAAAGAC TCAACCACAA TTATTGCGCC AGCTCAATTT ATTACAAACA GAGGACAAAC CTTACAAGGT CCTGTTTTCA CTATACGTTT TTCTTCACCT TTTGAAGATG TTATAAGAGT GCAAATTTGG CACTACAAGG GTCAAAAAGA TAAAAAGCCA TATTTTGAAT TTTATAAAGA AGAAGGATAT TGCCCTTTGA TAGAAGTTTT TTCGGAGAGT ATAGTAATAA CAAGTGGAAA GCTAAAAGCT GTTATTAATA GAAAAGGTGA ATGGAAAGTA GCATATTACT ACGAAGATAA ATATCTAACA AGAAATGGTT ATAAATATCT TGGTTACGCA ATCATGCCTG ATAATACTAC TTACATGAGG GAACAGCTTT CTTTGAGTGT TGGAGAGTGT GTTTACGGGT TGGGCGAGAG GTTTACTCCT TTTGTTAAAA ACGGACAAAT GATTGATATG TGGAACGAAG ATGGTGGTAC GAACTCTGAT CTTGCATACA AAAACATTCC TTTTTACATT ACAAACCGTG GATATGGTGT TTTTGTAAAT GACCCAGGAC GAGTGTCATT TGAAGTAGCC ACAGAGAATG TCGAGAGAGT TCAGTTTTCT GTGGAAGGTG AATATTTGGA ATATTTCATA ATTGGCGGTA GCAACATGAA AAATGTTTTA GAAAATTACA CAAAACTCAC AGGTCGGCCA CAGCTTCCTC CAGCATGGTC TTTTGGACTT TGGCTTACAA CCTCTTTTAC AACAAGCTAT GATGAAAAGA CTGTTACAAA CTTTATAGAT GGAATGATTG AAAGGGATAT TCCACTTCAT GTGTTTCATT TTGACTGTTT CTGGATGAAA GATATGCACT GGGTTGATTT TGAGTGGGAC AGAAGGGTTT TTCTTGAACC ATCACAGATG CTAAAGCGTC TAAAAGAAAA GGGAGTAAAA ATATGTGTTT GGATAAATCC CTATATATCT CAGCTTTCTA AACTGTTTGA CGAAGGCAAA GAAAAAGGGT ATTTTTTGAA AAAGCCAAAT GGTGATGTAT GGCAGACAGA TGATTGGCAG CCTGGTATGG CAATTGTTGA TTTTACAAAC CCTGAGGCGT GCAGGTGGTA TTCAGAAAAG CTCAAAGAGC TAATTAAAAT GGGAGTTGAC TGTTTTAAGA CAGATTTTGG TGAAAGAATT CCAACAGATG TTGTTTATTT TGATGGTTCA GACCCTCAAA AGATGCACAA TTACTACACC TATCTTTACA ACAAGACAGT ATATGAGACG CTTCAAGAAA CGTTTGGCAA GGGAAATGCA GTTGTTTTTG CAAGGTCAGC GACAGTAGGA AGCCAGAAAT TTCCTGTGCA CTGGGGCGGA GACTGTTTAG CTTCATATGA GTCCATGGCA GAGACACTCA GGGGTGGCCT TTCACTTTCA CTTTGCGGGT TTGGTTTTTG GAGTCATGAC ATAGGGGGGT TTGAGAGTAC AGCAACACCA GATCTTTACA AGAGATGGGT AGCATTTGGA CTTTTATCTT CTCACAGCAG ACTTCATGGA AATTCTGCCT ATAAAGTTCC ATGGCTTTAT GACGAAGAGG CGGTTGACGT ACTTAGGTTC TTTACAAAAT TAAAATGTAA ACTTATGCCA TACATCTTTT CAGCGGCTGT AGAGGCAACA GAAAGAGGGA TTCCAGTCTT GAGGCCAATG GTCTTAGAGT TTCCGGACGA TCCTGCTTGT CTTTATCTTG ACAGGCAATA TATGCTTGGA GACAGTCTTT TGGTTGCACC AATCTTTTCA GAAGATGGAT ATGTTGAGTA TTATGTGCCA GAAGGGATTT GGACAAATAT CCTGACAGGT GAAAAAGTTG AGGGTGGCAA GTGGAGAAAA GAAAAGCACG GCTATTTTAG CCTTCCACTT TTAGCAAGGC CAAATACTGT AATCCCAATG GGAAGTGTAG ACACAAAGCC CGATTATGAT TATGCTGATA ATGTGGCGAT GAATATTTAT CATATTGATA GTGGGGAGAC ATTGAAATCT CAGATAAGAA ATGTAGAAGG TAAGGCAGAA ATTGAAATTG AAGTGAGAAG ACACGGAGAT GTTATTTATG TCACAAATAT AAGAGATTCA AAAAAAACAT GGAGTCTATA TTTTGATTCT CTAAGGATAG AAGTTATATC TGGAGCAAGT GTAAAGGTCG ACAGTAATGG TAGTAAGATA AATGTATATT CTGACACAGC TGTGTTAAAA GTGATTTAA
|
Protein sequence | MKFTDGFWRV KDGIRLYHPA HIYDYEISKD STTIIAPAQF ITNRGQTLQG PVFTIRFSSP FEDVIRVQIW HYKGQKDKKP YFEFYKEEGY CPLIEVFSES IVITSGKLKA VINRKGEWKV AYYYEDKYLT RNGYKYLGYA IMPDNTTYMR EQLSLSVGEC VYGLGERFTP FVKNGQMIDM WNEDGGTNSD LAYKNIPFYI TNRGYGVFVN DPGRVSFEVA TENVERVQFS VEGEYLEYFI IGGSNMKNVL ENYTKLTGRP QLPPAWSFGL WLTTSFTTSY DEKTVTNFID GMIERDIPLH VFHFDCFWMK DMHWVDFEWD RRVFLEPSQM LKRLKEKGVK ICVWINPYIS QLSKLFDEGK EKGYFLKKPN GDVWQTDDWQ PGMAIVDFTN PEACRWYSEK LKELIKMGVD CFKTDFGERI PTDVVYFDGS DPQKMHNYYT YLYNKTVYET LQETFGKGNA VVFARSATVG SQKFPVHWGG DCLASYESMA ETLRGGLSLS LCGFGFWSHD IGGFESTATP DLYKRWVAFG LLSSHSRLHG NSAYKVPWLY DEEAVDVLRF FTKLKCKLMP YIFSAAVEAT ERGIPVLRPM VLEFPDDPAC LYLDRQYMLG DSLLVAPIFS EDGYVEYYVP EGIWTNILTG EKVEGGKWRK EKHGYFSLPL LARPNTVIPM GSVDTKPDYD YADNVAMNIY HIDSGETLKS QIRNVEGKAE IEIEVRRHGD VIYVTNIRDS KKTWSLYFDS LRIEVISGAS VKVDSNGSKI NVYSDTAVLK VI
|
| |