Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1273 |
Symbol | |
ID | 4809778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1548830 |
End bp | 1550275 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640106696 |
Product | alpha-L-arabinofuranosidase B |
Protein accession | YP_001037698 |
Protein GI | 125973788 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACACA AAGGCATTGT ATTAAAGCTG ACAAAAAGCA AAGCCATAAT AAGTACCAAT GATTTTCAAT GCTACTATAT CAAAAGAAGC CCTACAATTT ATGTAGGAAA GGAAGTTGAA TTTACAAATA AAGATATTGT GACAAAGAAG TCTGTTTTAA TAAAACCGGC TTTAAGCGTT GCCTGTTTTA TATTGTTGAT AGCTTGTGTT TTAAGCCTTT CAAAAATAAT CAATAATATT AGCCCTAAAG TTTTTGCCTA TATCAGCGTT GATATAAATC CCAGCTTTGA AATTGAAATC GATGACATGG GAAATGTTTT GAATTTGCTT CCGTTAAATG ATGATGCAAA GGTTATTGCC GATAAATTGG AAATTGATAA AATCAACGTT TCCAATGCCA TTGATATTAT AATAAATGAA GCAATAAAAA GCAATGTTAT AAATGAAAAT GAAAAGGACT TTATATTAGT TTCAAGCACC CTGAATATTA AAAAAGAGGA GAACAGCCAA CAGTATCAGA GTGAAAAAGA AAAACTTGAT ATTATCATAA ATTCCCTGAA AGACAGCATA GAAAAAAGCG GAAAAGCGGA TGTTTACATT GTCCAGGCTG ACGTGAATGA AAGGGAAGCC GCACGAAGTA AAGGAATATC TACAGGAAGA TACGTTTTAT ATAACAAGTA TAAAGATCTG GAAAACGATC TGTCTTTGGA AGATGCCAAA GATGCTGATG TCAATGTGTT AATAAAAAGT ATGTTGGATG TGGCATCAGA AGAAAGAAAT CCGGAAGAAT CACCAAAAAT GACCCCAACT CCAACACCGA CACATACAGC AACACATACA CCGACAGATG CACCAACGCC GAAACCGGCA AATACACCAA CATCAACACC GGCAGCAAAA CCTTCACCAA AAACGGCATC GAACTCAGCC TCGACATCAA CACCTGCCCC GAAACCTACA TCAACACCGA CACCAACATT GATGCCAACA CCTACTCCAA CACCGACACC TGCTGATAAA ATCGCATATG GTCAGTTTAT GAAATTTGAA TCCAGCAACT ACCGCGGATA TTATATACGG GTTAAATCGT TTTCCGGCCG TATCGACCCA TATGTGAATC CTGTGGAAGA TTCCATGTTC AAGATAGTTC CCGGTCTTGC AGACCCAAGC TGTATTTCTT TCGAGTCGAA GACTTATCCG GGATATTACC TCAAACATGA AAACTTCAGA GTTATTCTTA AAAAATATGA AGATACCGAT TTATTCAGAG AAGATGCAAC TTTCAGAGTT GTACCGGGTT GGGCGGATGA AAACATGATT TCTTTCCAGT CATATAATTA TCCTTACAGA TATATCAGGC ACAGGGATTT TGAGCTTTAC ATAGAAAACA TAAAAACCGA TCTTGACAGA AAGGATGCAA CATTTATAGG GATTAAAGTT GATTAG
|
Protein sequence | MKHKGIVLKL TKSKAIISTN DFQCYYIKRS PTIYVGKEVE FTNKDIVTKK SVLIKPALSV ACFILLIACV LSLSKIINNI SPKVFAYISV DINPSFEIEI DDMGNVLNLL PLNDDAKVIA DKLEIDKINV SNAIDIIINE AIKSNVINEN EKDFILVSST LNIKKEENSQ QYQSEKEKLD IIINSLKDSI EKSGKADVYI VQADVNEREA ARSKGISTGR YVLYNKYKDL ENDLSLEDAK DADVNVLIKS MLDVASEERN PEESPKMTPT PTPTHTATHT PTDAPTPKPA NTPTSTPAAK PSPKTASNSA STSTPAPKPT STPTPTLMPT PTPTPTPADK IAYGQFMKFE SSNYRGYYIR VKSFSGRIDP YVNPVEDSMF KIVPGLADPS CISFESKTYP GYYLKHENFR VILKKYEDTD LFREDATFRV VPGWADENMI SFQSYNYPYR YIRHRDFELY IENIKTDLDR KDATFIGIKV D
|
| |