Gene Cthe_1273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1273 
Symbol 
ID4809778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1548830 
End bp1550275 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content37% 
IMG OID640106696 
Productalpha-L-arabinofuranosidase B 
Protein accessionYP_001037698 
Protein GI125973788 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACACA AAGGCATTGT ATTAAAGCTG ACAAAAAGCA AAGCCATAAT AAGTACCAAT 
GATTTTCAAT GCTACTATAT CAAAAGAAGC CCTACAATTT ATGTAGGAAA GGAAGTTGAA
TTTACAAATA AAGATATTGT GACAAAGAAG TCTGTTTTAA TAAAACCGGC TTTAAGCGTT
GCCTGTTTTA TATTGTTGAT AGCTTGTGTT TTAAGCCTTT CAAAAATAAT CAATAATATT
AGCCCTAAAG TTTTTGCCTA TATCAGCGTT GATATAAATC CCAGCTTTGA AATTGAAATC
GATGACATGG GAAATGTTTT GAATTTGCTT CCGTTAAATG ATGATGCAAA GGTTATTGCC
GATAAATTGG AAATTGATAA AATCAACGTT TCCAATGCCA TTGATATTAT AATAAATGAA
GCAATAAAAA GCAATGTTAT AAATGAAAAT GAAAAGGACT TTATATTAGT TTCAAGCACC
CTGAATATTA AAAAAGAGGA GAACAGCCAA CAGTATCAGA GTGAAAAAGA AAAACTTGAT
ATTATCATAA ATTCCCTGAA AGACAGCATA GAAAAAAGCG GAAAAGCGGA TGTTTACATT
GTCCAGGCTG ACGTGAATGA AAGGGAAGCC GCACGAAGTA AAGGAATATC TACAGGAAGA
TACGTTTTAT ATAACAAGTA TAAAGATCTG GAAAACGATC TGTCTTTGGA AGATGCCAAA
GATGCTGATG TCAATGTGTT AATAAAAAGT ATGTTGGATG TGGCATCAGA AGAAAGAAAT
CCGGAAGAAT CACCAAAAAT GACCCCAACT CCAACACCGA CACATACAGC AACACATACA
CCGACAGATG CACCAACGCC GAAACCGGCA AATACACCAA CATCAACACC GGCAGCAAAA
CCTTCACCAA AAACGGCATC GAACTCAGCC TCGACATCAA CACCTGCCCC GAAACCTACA
TCAACACCGA CACCAACATT GATGCCAACA CCTACTCCAA CACCGACACC TGCTGATAAA
ATCGCATATG GTCAGTTTAT GAAATTTGAA TCCAGCAACT ACCGCGGATA TTATATACGG
GTTAAATCGT TTTCCGGCCG TATCGACCCA TATGTGAATC CTGTGGAAGA TTCCATGTTC
AAGATAGTTC CCGGTCTTGC AGACCCAAGC TGTATTTCTT TCGAGTCGAA GACTTATCCG
GGATATTACC TCAAACATGA AAACTTCAGA GTTATTCTTA AAAAATATGA AGATACCGAT
TTATTCAGAG AAGATGCAAC TTTCAGAGTT GTACCGGGTT GGGCGGATGA AAACATGATT
TCTTTCCAGT CATATAATTA TCCTTACAGA TATATCAGGC ACAGGGATTT TGAGCTTTAC
ATAGAAAACA TAAAAACCGA TCTTGACAGA AAGGATGCAA CATTTATAGG GATTAAAGTT
GATTAG
 
Protein sequence
MKHKGIVLKL TKSKAIISTN DFQCYYIKRS PTIYVGKEVE FTNKDIVTKK SVLIKPALSV 
ACFILLIACV LSLSKIINNI SPKVFAYISV DINPSFEIEI DDMGNVLNLL PLNDDAKVIA
DKLEIDKINV SNAIDIIINE AIKSNVINEN EKDFILVSST LNIKKEENSQ QYQSEKEKLD
IIINSLKDSI EKSGKADVYI VQADVNEREA ARSKGISTGR YVLYNKYKDL ENDLSLEDAK
DADVNVLIKS MLDVASEERN PEESPKMTPT PTPTHTATHT PTDAPTPKPA NTPTSTPAAK
PSPKTASNSA STSTPAPKPT STPTPTLMPT PTPTPTPADK IAYGQFMKFE SSNYRGYYIR
VKSFSGRIDP YVNPVEDSMF KIVPGLADPS CISFESKTYP GYYLKHENFR VILKKYEDTD
LFREDATFRV VPGWADENMI SFQSYNYPYR YIRHRDFELY IENIKTDLDR KDATFIGIKV
D