Gene Athe_0603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0603 
Symbol 
ID7406944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp681588 
End bp682904 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content39% 
IMG OID643714986 
Productxylose isomerase 
Protein accessionYP_002572502 
Protein GI222528620 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2115] Xylose isomerase 
TIGRFAM ID[TIGR02630] xylose isomerase 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTACT TCAAAGACAT TCCAGAAGTA AAATATGAAG GACCACAGTC GGACAACCCA 
TTTGCTTTCA AGTACTACAA TCCTGACGAA ATCATTGACG GCAAGCCTTT GAAAGACCAC
CTTCGTTTTG CTATTGCTTA CTGGCACACA TTCTGTGCAA CAGGAAGCGA TCCGTTTGGA
CAACCTACAA TTGTTCGTCC TTGGGATAAG TTTTCAAACC GAATGGACAA CGCAAAAGCA
AGGGTTGAGG CAGCATTTGA ATTTTTTGAA CTGTTAGATG TACCATTTTT CTGCTTCCAT
GACAGAGATA TTGCACCTGA AGGGGAAAAT TTAAAAGAGT CAAATAAGAA TTTGGATGAG
ATTGTTTCTT TAATAAAAGA GTATTTGAAA ACCAGCAAGA CAAAAGTATT ATGGGGAACA
GCAAACCTAT TTTCACATCC GCGATATGTT CATGGTGCTG CAACATCCTG CAATGCCGAT
GTTTTTGCAT ATGCAGCAGC GCAAGTGAAA AAGGCGTTAG AGGTTACAAA AGAGCTTGGC
GGCGAAAACT ATGTGTTCTG GGGCGGAAGG GAAGGTTATG AGACACTTCT AAATACAGAT
ATGGGATTGG AACTTGATAA CCTTGCAAGA TTTTTGCATA TGGCGGTTGA GTATGCAAAG
GAAATAGGTT TTGACGGACA GTTTTTAATA GAACCAAAAC CAAAAGAGCC AACTAAGCAT
CAGTACGATT TTGATTCGGC TCATGTTTAT GGATTTTTGA AAAAGTATGA TCTTGACAAA
TACTTCAAGC TCAACATAGA GGTAAACCAT GCAACCTTAG CAGGACATGA TTTCCACCAT
GAGTTGAGAT TTGCGCGAAT AAACAACATG CTTGGTTCAA TTGACGCTAA CATGGGCGAT
TTGCTTTTGG GCTGGGATAC AGATCAGTTC CCAACAGATG TAAGACTTAC TACACTTGCT
ATGTATGAGG TTATTAAAGC TGGTGGTTTT GACAAAGGTG GACTTAACTT TGACGCAAAG
GTAAGAAGAG GTTCTTTTGA GCTTGAAGAC TTGGTCATTG GTCACATTGC TGGCATGGAT
GCTTTTGCTA AAGGCTTCAA GATTGCGTAT AAGCTTGTTA AAGATGGCGT ATTTGATAAA
TTTATAGATG AGAGATACAA GAGCTACAAA GAAGGAATCG GTGCTAAGAT TGTAAGCGGT
GAAGCAAACT TCAAGATGTT AGAGGAATAT GCTCTGTCTC TTGACAAGAT AGAAAATAAA
TCTGGCAAGC AAGAGCTTCT TGAGATGATT TTGAACAAAT ATATGTTCAG CGAATAA
 
Protein sequence
MKYFKDIPEV KYEGPQSDNP FAFKYYNPDE IIDGKPLKDH LRFAIAYWHT FCATGSDPFG 
QPTIVRPWDK FSNRMDNAKA RVEAAFEFFE LLDVPFFCFH DRDIAPEGEN LKESNKNLDE
IVSLIKEYLK TSKTKVLWGT ANLFSHPRYV HGAATSCNAD VFAYAAAQVK KALEVTKELG
GENYVFWGGR EGYETLLNTD MGLELDNLAR FLHMAVEYAK EIGFDGQFLI EPKPKEPTKH
QYDFDSAHVY GFLKKYDLDK YFKLNIEVNH ATLAGHDFHH ELRFARINNM LGSIDANMGD
LLLGWDTDQF PTDVRLTTLA MYEVIKAGGF DKGGLNFDAK VRRGSFELED LVIGHIAGMD
AFAKGFKIAY KLVKDGVFDK FIDERYKSYK EGIGAKIVSG EANFKMLEEY ALSLDKIENK
SGKQELLEMI LNKYMFSE