Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2548 |
Symbol | |
ID | 4809304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3016032 |
End bp | 3017543 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640107963 |
Product | Alpha-N-arabinofuranosidase |
Protein accession | YP_001038942 |
Protein GI | 125975032 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG CCAGAATGAC CGTTGACAAA GATTATAAAA TTGCCGAGAT CGACAAGCGT ATCTACGGCT CTTTTGTAGA ACATTTGGGA AGGGCCGTAT ATGACGGATT GTATCAGCCT GGAAATTCCA AATCGGACGA AGACGGTTTT CGTAAAGATG TTATTGAACT GGTGAAAGAA TTGAATGTGC CAATTATCCG TTATCCGGGA GGCAATTTTG TGTCCAATTA TTTCTGGGAA GATGGAGTCG GGCCGGTAGA GGATAGGCCC AGACGCTTGG ATTTGGCTTG GAAAAGTATA GAACCCAACC AGGTTGGGAT TAATGAATTT GCAAAATGGT GCAAAAAAGT AAATGCTGAG ATAATGATGG CAGTGAACCT TGGCACCAGA GGGATTTCGG ATGCATGTAA TTTGCTGGAA TACTGTAATC ATCCGGGTGG TTCAAAATAT AGTGACATGA GAATAAAACA TGGAGTAAAG GAACCTCACA ACATAAAGGT TTGGTGTCTT GGCAATGAAA TGGACGGTCC GTGGCAGGTC GGGCATAAAA CAATGGATGA GTACGGCCGG ATTGCTGAAG AGACTGCAAG GGCCATGAAA ATGATTGACC CTTCAATTGA GTTGGTTGCC TGCGGAAGTT CCTCCAAAGA CATGCCCACT TTTCCCCAAT GGGAAGCAAC AGTTCTGGAT TATGCTTATG ATTATGTGGA TTATATATCA TTGCATCAGT ATTATGGGAA TAAAGAAAAT GACACAGCTG ATTTTTTGGC AAAATCCGAT GATTTGGATG ATTTTATACG TTCTGTCATT GCCACTTGTG ATTATATAAA AGCAAAGAAA AGAAGCAAGA AGGATATATA CCTAAGTTTT GATGAATGGA ATGTATGGTA TCACTCAAAT AATGAAGATG CAAACATTAT GCAGAACGAA CCATGGAGAA TAGCGCCTCC TTTACTGGAG GATATATATA CGTTTGAAGA TGCGTTACTT GTCGGTTTGA TGCTAATTAC CCTTATGAAA CACGCCGATA GAATAAAAAT TGCCTGCCTG GCACAGTTGA TTAATGTAAT TGCGCCTATT GTGACTGAAA GAAATGGCGG GGCGGCTTGG AGGCAGACCA TATTTTATCC GTTTATGCAT GCTTCAAAAT ATGGCAGAGG AATAGTACTT CAACCGGTGA TTAACAGTCC GCTTCATGAT ACTTCAAAAC ATGAAGATGT TACCGATATT GAAAGTGTTG CAATTTACAA TGAAGAAAAA GAAGAAGTCA CAATCTTTGC AGTTAACAGA AATATTCATG AGGATATTGT TCTTGTATCG GATGTCAGGG GTATGAAAGA TTATCGTCTG CTGGAGCATA TTGTCTTGGA GCATCAAGAT CTAAAAATCC GTAATAGTGT AAATGGTGAG GAGGTATATC CGAAAAATTC GGATAAATCC TCATTTGATG ATGGTATTTT AACGAGTATG CTTCGCAGAG CCTCTTGGAA TGTAATTCGG ATAGGTAAAT AA
|
Protein sequence | MKKARMTVDK DYKIAEIDKR IYGSFVEHLG RAVYDGLYQP GNSKSDEDGF RKDVIELVKE LNVPIIRYPG GNFVSNYFWE DGVGPVEDRP RRLDLAWKSI EPNQVGINEF AKWCKKVNAE IMMAVNLGTR GISDACNLLE YCNHPGGSKY SDMRIKHGVK EPHNIKVWCL GNEMDGPWQV GHKTMDEYGR IAEETARAMK MIDPSIELVA CGSSSKDMPT FPQWEATVLD YAYDYVDYIS LHQYYGNKEN DTADFLAKSD DLDDFIRSVI ATCDYIKAKK RSKKDIYLSF DEWNVWYHSN NEDANIMQNE PWRIAPPLLE DIYTFEDALL VGLMLITLMK HADRIKIACL AQLINVIAPI VTERNGGAAW RQTIFYPFMH ASKYGRGIVL QPVINSPLHD TSKHEDVTDI ESVAIYNEEK EEVTIFAVNR NIHEDIVLVS DVRGMKDYRL LEHIVLEHQD LKIRNSVNGE EVYPKNSDKS SFDDGILTSM LRRASWNVIR IGK
|
| |