Gene Acel_2064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_2064 
Symbol 
ID4484581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2340704 
End bp2341927 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content63% 
IMG OID639730864 
Productxylose isomerase 
Protein accessionYP_873822 
Protein GI117929271 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2115] Xylose isomerase 
TIGRFAM ID[TIGR02631] xylose isomerase, Arthrobacter type 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTCA CCACTGCATC GTCGAAAACT ATCGAAGTTG CCACGCCGTC GAAGGAAGAC 
CGTTTCAGCT TTGGTCTGTG GACCGTCGGC TGGCAGGCGC GTGATCCCTT CGGTGAAGCG
ACACGTCCGC CGCTGGATCC GGTGGAGGCT GTTCACAAGC TCGCCGAATT AGGCGCCTAC
GGGGTCACCT TCCATGACGA CGACCTCGTT CCGTTCGGCA GCAGCGATGC GGAGCGCGCT
CGGCTTATCG ACCGTTTCAA GAAGGCGCTT GCCGACACCG GCCTGGTCGT CCCGATGATG
ACGACGAATC TCTTCACCCA TCCGATCTTC AAGGATGGTG CGTTCACCGC CAATGACCGG
TCCATTCGGC GGTATGCGAT CCGTAAGGTG ATGCGCAATC TGGATCTGGC GGCCGAACTC
GGCGCCCGTA CGTACGTTTT CTGGGGTGGC CGCGAGGGAA GCGAAATCGA CGCCGCAAAG
GACATTCGTG CTGCGCTTGA CCGGTACCGC GAGGCGATTG ACACCCTCGC GCAATACGTC
AAAGACCAAG GCTACGGCAT CCGGTTCGCC CTCGAACCGA AACCCAATGA GCCGCGGGGT
GATATTTTCC TCCCGACGAT CGGGCATGCG CTCGCGTTCA TCAACTCCCT GGAGCATTCG
GACATCGTCG GGCTGAATCC TGAAGTCGGT CACGAGCAGA TGTCCAATTT GAATTTCGTG
CACGGGATCG CACAGGCCCT CTGGCACGGG AAACTCTTCC ACATCGACCT CAACGGCCAG
CACGGGCCGA AGTACGACCA GGACTTGGTC TTCGGTCACG GTGACCTGCT CAGCGCGTTC
TTCCTCGTCG ACCTGCTGGA GAACGGCTTT CCCGGCGGCG GCCCGGTGTA CGACGGTCCG
CGGCACTTCG ACTACAAGCC GATGCGCACG GAGGACATCG ACGGCGTCTG GGCGTCCGCC
GCGGCGAACA TGCGCACGTA CCTCCTCCTC AAGCAGCGGG CAAAGGCGTT TCGAGCGGAT
CCGGAGGTGC AGGCCGCACT CACCGCAAGT CGTGTTCCCG AGTTGGCCGT CCCCACCCTC
GGCGAGGGCG AGTCGTACGC CGACCTGCTG GCCGACCGGT CCGCGTGGGA GGAATTCGAC
GTCGATCGAG CCGCCAACCA AGGCTACGGG TACGCCCGTC TCGATCAGCT CGCCATCGAG
CACCTCCTCG GCGCACGCGG CTGA
 
Protein sequence
MSLTTASSKT IEVATPSKED RFSFGLWTVG WQARDPFGEA TRPPLDPVEA VHKLAELGAY 
GVTFHDDDLV PFGSSDAERA RLIDRFKKAL ADTGLVVPMM TTNLFTHPIF KDGAFTANDR
SIRRYAIRKV MRNLDLAAEL GARTYVFWGG REGSEIDAAK DIRAALDRYR EAIDTLAQYV
KDQGYGIRFA LEPKPNEPRG DIFLPTIGHA LAFINSLEHS DIVGLNPEVG HEQMSNLNFV
HGIAQALWHG KLFHIDLNGQ HGPKYDQDLV FGHGDLLSAF FLVDLLENGF PGGGPVYDGP
RHFDYKPMRT EDIDGVWASA AANMRTYLLL KQRAKAFRAD PEVQAALTAS RVPELAVPTL
GEGESYADLL ADRSAWEEFD VDRAANQGYG YARLDQLAIE HLLGARG