Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0873 |
Symbol | |
ID | 4485651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 964542 |
End bp | 966047 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639729647 |
Product | L-arabinose isomerase |
Protein accession | YP_872632 |
Protein GI | 117928081 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2160] L-arabinose isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACC TGCCCTATCC GGAATATGAG TGTTGGTTTC TCACCGGCAG TCAGCATTTG TACGGCGAGG ACGTTCTTTC CGCGGTGGCC CGGCAATCCC AGGCCATCGT GGAGGCGCTC AATGCCGCAG GACTACCGGT GCGGTTGGTG TGGAAACCGG TCCTGACCGA CGCCACAGGC ATCCGGCGCA TGTGCAGCGA GGCAAGCGCA ACCGACGCGT GCATCGGCGT CATCGCGTGG ATGCACACGT TCTCCCCGGC AAAAGCCTGG ATCAATGGAT TGCTGGCGCT GCGCAAGCCG CTGCTTCACT TGCATACGCA GGCGAATCTC ACCCTTCCGT GGTCGACCAT CGACATGGAC TTCATGAATC TCAACCAGGC CGCCCACGGC GATCGCGAGT TCGGCTATGT GGCCGCTCGG TTGGCCATTC CGCGGAAAAT CGTCACCGGT CATTTCTCGG ACCCCGACGT GGTGCGGGAC ATTGCCGCAT GGCAGCGCGC CGCCGCCGGA TTGGCGGATC TGCGGTCCAC GCGGCTCGTC CGGTTCGGCG ACACGATGCG AAACGTTGCG GTCACCGACG GCGACCGGGT CGAGGCGCAA ATCCGGCTGG GCAGTGCCAT CGAGACGTAC GGCGTTCACG ACCTCGGGGT ACGGGTCGAT GCCGTCGCCG AGAGCGACGT GGACGCCCTC GTTGACCGGT ACCTTGCCGA CTATGACATG GCACCGGAAC TCACCATCGG GGGCGCGCGC CACGAGTCAC TGCGGTACGC AGCGAAACTC GAACTTGCCT TGCGGTCCTT CCTCCACGAC GGACGATTCA CTGCCTTCAC CACGAATTTC GAGGACCTCG GACCGCTCCG CCAGCTTCCC GGCATCGCAG TTCAACGGCT GATGGCCGAC GGCTTCGGGT TCGGCGCCGA AGGTGATTGG AAAACCGCCC TCCTGGTCCG CGCGGTCAAG ACGATGAGCC GCGGCCTGCC GGGCGGCACC TCATTCATGG AGGATTACAC CTACCACCTG GAACCCAGCG GCCGGCTCGT CCTCGGTGCG CACATGCTCG AAGTCTGTCC GACACTGACG TCTGCAACGC CGCGCTGTGA GATTCACCCG CTGCTCATGG GCGGACGCGA GGACCCGGTG CGGCTCGTCT TCACCGCGGA TCCAGCTCCG GCCGTCATCG TGGGACTGTG CGACATGGGT GACCGACTCC GCCTTGTCGC GAACACCGCC GACCTGGTTG CCCCGCCTGA ACCATTGCCG CGGCTGCCGG TGGCTCGCGC CGTGTGGCAG CCGCACCCGG AGCTGAAAAC CGCCGCTACG GCGTGGATCG CCGCCGGCGG ACCGCATCAC ACCGCGCTGT CAACCGCCGT CAGCGCAAGG GAAATCCGGG ACTTCGCCCG GATGGCCGGC CTCGAGCTTG TCCTCATTGA CGAGCACACC GCACTTGACG CCGCCCTCGA CCGGCTCTGG GCAATCGAAC AAACCCGCGC ATCGCGGCCG TGGTGA
|
Protein sequence | MTDLPYPEYE CWFLTGSQHL YGEDVLSAVA RQSQAIVEAL NAAGLPVRLV WKPVLTDATG IRRMCSEASA TDACIGVIAW MHTFSPAKAW INGLLALRKP LLHLHTQANL TLPWSTIDMD FMNLNQAAHG DREFGYVAAR LAIPRKIVTG HFSDPDVVRD IAAWQRAAAG LADLRSTRLV RFGDTMRNVA VTDGDRVEAQ IRLGSAIETY GVHDLGVRVD AVAESDVDAL VDRYLADYDM APELTIGGAR HESLRYAAKL ELALRSFLHD GRFTAFTTNF EDLGPLRQLP GIAVQRLMAD GFGFGAEGDW KTALLVRAVK TMSRGLPGGT SFMEDYTYHL EPSGRLVLGA HMLEVCPTLT SATPRCEIHP LLMGGREDPV RLVFTADPAP AVIVGLCDMG DRLRLVANTA DLVAPPEPLP RLPVARAVWQ PHPELKTAAT AWIAAGGPHH TALSTAVSAR EIRDFARMAG LELVLIDEHT ALDAALDRLW AIEQTRASRP W
|
| |