Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0603 |
Symbol | |
ID | 7406944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 681588 |
End bp | 682904 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643714986 |
Product | xylose isomerase |
Protein accession | YP_002572502 |
Protein GI | 222528620 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2115] Xylose isomerase |
TIGRFAM ID | [TIGR02630] xylose isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTACT TCAAAGACAT TCCAGAAGTA AAATATGAAG GACCACAGTC GGACAACCCA TTTGCTTTCA AGTACTACAA TCCTGACGAA ATCATTGACG GCAAGCCTTT GAAAGACCAC CTTCGTTTTG CTATTGCTTA CTGGCACACA TTCTGTGCAA CAGGAAGCGA TCCGTTTGGA CAACCTACAA TTGTTCGTCC TTGGGATAAG TTTTCAAACC GAATGGACAA CGCAAAAGCA AGGGTTGAGG CAGCATTTGA ATTTTTTGAA CTGTTAGATG TACCATTTTT CTGCTTCCAT GACAGAGATA TTGCACCTGA AGGGGAAAAT TTAAAAGAGT CAAATAAGAA TTTGGATGAG ATTGTTTCTT TAATAAAAGA GTATTTGAAA ACCAGCAAGA CAAAAGTATT ATGGGGAACA GCAAACCTAT TTTCACATCC GCGATATGTT CATGGTGCTG CAACATCCTG CAATGCCGAT GTTTTTGCAT ATGCAGCAGC GCAAGTGAAA AAGGCGTTAG AGGTTACAAA AGAGCTTGGC GGCGAAAACT ATGTGTTCTG GGGCGGAAGG GAAGGTTATG AGACACTTCT AAATACAGAT ATGGGATTGG AACTTGATAA CCTTGCAAGA TTTTTGCATA TGGCGGTTGA GTATGCAAAG GAAATAGGTT TTGACGGACA GTTTTTAATA GAACCAAAAC CAAAAGAGCC AACTAAGCAT CAGTACGATT TTGATTCGGC TCATGTTTAT GGATTTTTGA AAAAGTATGA TCTTGACAAA TACTTCAAGC TCAACATAGA GGTAAACCAT GCAACCTTAG CAGGACATGA TTTCCACCAT GAGTTGAGAT TTGCGCGAAT AAACAACATG CTTGGTTCAA TTGACGCTAA CATGGGCGAT TTGCTTTTGG GCTGGGATAC AGATCAGTTC CCAACAGATG TAAGACTTAC TACACTTGCT ATGTATGAGG TTATTAAAGC TGGTGGTTTT GACAAAGGTG GACTTAACTT TGACGCAAAG GTAAGAAGAG GTTCTTTTGA GCTTGAAGAC TTGGTCATTG GTCACATTGC TGGCATGGAT GCTTTTGCTA AAGGCTTCAA GATTGCGTAT AAGCTTGTTA AAGATGGCGT ATTTGATAAA TTTATAGATG AGAGATACAA GAGCTACAAA GAAGGAATCG GTGCTAAGAT TGTAAGCGGT GAAGCAAACT TCAAGATGTT AGAGGAATAT GCTCTGTCTC TTGACAAGAT AGAAAATAAA TCTGGCAAGC AAGAGCTTCT TGAGATGATT TTGAACAAAT ATATGTTCAG CGAATAA
|
Protein sequence | MKYFKDIPEV KYEGPQSDNP FAFKYYNPDE IIDGKPLKDH LRFAIAYWHT FCATGSDPFG QPTIVRPWDK FSNRMDNAKA RVEAAFEFFE LLDVPFFCFH DRDIAPEGEN LKESNKNLDE IVSLIKEYLK TSKTKVLWGT ANLFSHPRYV HGAATSCNAD VFAYAAAQVK KALEVTKELG GENYVFWGGR EGYETLLNTD MGLELDNLAR FLHMAVEYAK EIGFDGQFLI EPKPKEPTKH QYDFDSAHVY GFLKKYDLDK YFKLNIEVNH ATLAGHDFHH ELRFARINNM LGSIDANMGD LLLGWDTDQF PTDVRLTTLA MYEVIKAGGF DKGGLNFDAK VRRGSFELED LVIGHIAGMD AFAKGFKIAY KLVKDGVFDK FIDERYKSYK EGIGAKIVSG EANFKMLEEY ALSLDKIENK SGKQELLEMI LNKYMFSE
|
| |