Gene Acel_0678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0678 
Symbol 
ID4486234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp732511 
End bp734181 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content62% 
IMG OID639729446 
Producttrehalose synthase 
Protein accessionYP_872437 
Protein GI117927886 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.026654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.804097 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCGT TGCCACACGA ACCGGCACCG ATCCCACCCG TGGATCCCCT GTGGTACAAG 
CGCGCCGTCT TCTACGAAGT GCTGGTCCGC GGCTTCTACG ATTCGAACAA CGACGGCACG
GGGGATCTCC GCGGCCTCAT CGCGAAGCTC GATTACCTGC AGTGGCTTGG GGTGGACTGC
ATCTGGTTGC TGCCGATTTA CGCCTCGCCA ATGCGGGACG GCGGTTACGA CATCTCGGAT
TACTTCTCGA TCCTCCCCGA GTACGGCGAT CTCGGCGACT TCGTCCAGCT CGTCGATGAG
ACCCACCGGC GCGGCATGCG CATCATCGCC GACCTGGTGA TGAATCACAC CAGTGACCAG
CATCCGTGGT TCCAGGCGTC CCGCATCGAC CCGGACGGGC CGTACGGTGA CTTTTACGTG
TGGTCCGACA CCGACACGAA ATACAAGGAT GCACGGATCA TCTTCGTAGA CACGGAGAAG
TCCAATTGGA CCTTCGACCC GGTCCGCGGT CAGTACTACT GGCACCGTTT CTTCAGTCAT
CAGCCGGACC TGAATTACGA GAACCCCGCG GTCCAGGAAG CGATGCTGGC CGTGCTGCGC
TTCTGGCTGG ACCTCGGCAT CGACGGCTTC CGGCTGGACG CCGTCCCTTA CCTTTTCGAG
GAAGAGGGGA CGAACTGCGA GAACCTGCCG AAGACGCACG AGTACCTCAA GCGGATCCGC
AAAGAGGTCG ATCAGCTCTA CCCGGACAAG GTCCTCCTGG CGGAGGCCAA CCAGTGGCCG
GCGGACGTCG TCCAGTACTT CGGTGACGGC GACGAATGCC ACATGGCGTT TCACTTCCCG
CTGATGCCGC GGATCTTCAT GGCGGTGCGG CGGGAGCAGC GCTTTCCGAT TTCGGAGATC
CTGGCGCAGA CGCCGCGGAT TCCGGAGAAC TGCCAGTGGG GCATTTTCCT GCGCAACCAC
GATGAGCTCA CCTTGGAGAT GGTGACCGAC GAAGAGCGGG ACTATATGTA CCGCGAGTAC
GCGCAGGATC CGCGGATGAA GGCCAACATC GGTATCCGGC GCCGCCTCGC ACCACTGCTC
GACAACTCCC GCGACCAAAT GGAACTGTTC ACCGCGCTCC TGCTGTCGCT CCCCGGCTCC
CCGGTCATGT ACTACGGCGA CGAGATCGGC ATGGGCGACA ACATTTGGCT CGGTGACCGG
GACAGTGTGC GGACGCCGAT GCAGTGGACC CCGGACCGCA ACGCCGGTTT CTCCCAGTGC
GATCCGGGCA GACTCTACCT GCCGGTCATC ATGGACGCCG TCTACGGGTA CCAAGCGCTC
AACGTCGAGG CGCAGATGCG CAGCCCGCAC TCGCTGCTCC ACTGGGTCCG CAGAATGATC
GACATCAGGA AGCGGCACCC GACCTTCGGC TGCGGCAGTT ACGAGGAGCT TGGCGCATCG
AATCCGAGTA TTCTCGCCTT TGTCCGCGAA TTCGGCGACG ACCGGGTCTT GTGCGTCAAC
AATCTCTCGC GCTTTCCGCA ACCCGTCGAG CTGGATCTGC GTCGGTATGA GGGCGTCGTG
CCGATCGAGA TGACCGGCGG TGTACCGTTT CCCCGGATCG GCGAGTTGCC GTATCTGCTG
ACGCTGCCGG GACACGGCTT CTACTGGTTC ATGCTCCCGA CCCATCCGTG A
 
Protein sequence
MSALPHEPAP IPPVDPLWYK RAVFYEVLVR GFYDSNNDGT GDLRGLIAKL DYLQWLGVDC 
IWLLPIYASP MRDGGYDISD YFSILPEYGD LGDFVQLVDE THRRGMRIIA DLVMNHTSDQ
HPWFQASRID PDGPYGDFYV WSDTDTKYKD ARIIFVDTEK SNWTFDPVRG QYYWHRFFSH
QPDLNYENPA VQEAMLAVLR FWLDLGIDGF RLDAVPYLFE EEGTNCENLP KTHEYLKRIR
KEVDQLYPDK VLLAEANQWP ADVVQYFGDG DECHMAFHFP LMPRIFMAVR REQRFPISEI
LAQTPRIPEN CQWGIFLRNH DELTLEMVTD EERDYMYREY AQDPRMKANI GIRRRLAPLL
DNSRDQMELF TALLLSLPGS PVMYYGDEIG MGDNIWLGDR DSVRTPMQWT PDRNAGFSQC
DPGRLYLPVI MDAVYGYQAL NVEAQMRSPH SLLHWVRRMI DIRKRHPTFG CGSYEELGAS
NPSILAFVRE FGDDRVLCVN NLSRFPQPVE LDLRRYEGVV PIEMTGGVPF PRIGELPYLL
TLPGHGFYWF MLPTHP