Gene Acel_0354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0354 
Symbol 
ID4485893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp365722 
End bp366795 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content61% 
IMG OID639729121 
Productputative simple sugar transport system substrate-binding protein 
Protein accessionYP_872114 
Protein GI117927563 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0988695 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAGCA CGACACGCAT CCTCACCTGG CGAAGACTCG CATTGCTCGC GATCGTCCCG 
CTGGTCGCGG CGGGCTGCAG CAGCAAGAGT CAGACGCCGC AGAAGCAGAG CACGAGTGCG
CCGGCAACGG AAACGGCCGC CGGTGCCGTG CCGACGGGCG CCCAGTTCTG CAAGGGCATG
AAGATCGTCT TCTTCCCGGG CGGAACTCCC GGCGGACCGT TTGAGACCGT CGTCTACAAC
GGCGCGAAGG CCGCCGCAGC CGCACTGGGT CCGTCGGTCA CGTATGAATG GTCCGATTGG
GATCCGAACA AGATGATTAC CCAATTCAAG CAGGCAATGG CCACGCATCC CGACGGTATC
GCCATCATGG GACACCCCGG TGATGCGGCC TTCGACCCGC TCATCGACCA AGCCGAGGCG
CAGGGCATCA CCGTGACGGT GATGAACACC GAATTGCCGC AGGCCGAAGC GAAATACCAG
TCACAGGGCA TGGGCTATGT CGGTGCGGTG CTCTACCAGG CAGGCGCCTC GCTCGCTTCG
GAGGCCATCA AGCGCGGGAA TCTAAAGGCG GGCGACCGGG TCTTCGTCTG GGGTCTCCTG
TCGCAACCCG GCCGCGGGGA GCGGACCAAG GGAATTGTCG ACACGTTGAA GAAAGCCGGC
CTCACCGTTG ACTACCTGGA AATCAACGAT GCGACCAACA AGGACCCGGC AGCCGGTGTC
TCCATCTTCA CCGGTTACGT GTCCAAGCAC CCCGACGTCA AAGCCATTTT CATTGACCAC
GGCAACCTGA CCGCCACAAT CCCGACCTAC ATGAAGGCGG CCAACCTCAA GCCCGGATCG
GTCTTCGCCG CGGGCTTCGA CATGTCGCCG GCGACCGTCA AGGGCATCCA GGACGGATAC
ATCAGCCTCG TCATTGATCA ACAGGAATGG CTGCAGGGAT ACTTCGGAAT TCTGCAGTTG
TGTCTCTCCC ACGTGTACGG CTTCAGCGGA TTGCGCATTG ACACCGGCGC AGGCTTTGAT
GACAAGTCGA ACATCGATAA ACTCGCTCCA CTGGTCGACA AGCAGATCCG CTGA
 
Protein sequence
MGSTTRILTW RRLALLAIVP LVAAGCSSKS QTPQKQSTSA PATETAAGAV PTGAQFCKGM 
KIVFFPGGTP GGPFETVVYN GAKAAAAALG PSVTYEWSDW DPNKMITQFK QAMATHPDGI
AIMGHPGDAA FDPLIDQAEA QGITVTVMNT ELPQAEAKYQ SQGMGYVGAV LYQAGASLAS
EAIKRGNLKA GDRVFVWGLL SQPGRGERTK GIVDTLKKAG LTVDYLEIND ATNKDPAAGV
SIFTGYVSKH PDVKAIFIDH GNLTATIPTY MKAANLKPGS VFAAGFDMSP ATVKGIQDGY
ISLVIDQQEW LQGYFGILQL CLSHVYGFSG LRIDTGAGFD DKSNIDKLAP LVDKQIR