Gene Acel_1051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1051 
Symbol 
ID4484833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1157936 
End bp1159051 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content63% 
IMG OID639729826 
ProductABC-type sugar transport system periplasmic component-like 
Protein accessionYP_872810 
Protein GI117928259 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.28211 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0668071 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCAC AGCCCACCCG CCGACGAGGG ACCCCCACCC TCCTCACTGC TGCCATGGCC 
CTGGCTGCCG TCGCTCTCGC CGCGTGCAGC TCGGGAACGA GCGGCACAGC GCGCACGAGT
ACGACACCGA ACGCGTCCGC TTCGTCAAGC TCTACCGCCG GCACCCCGGC ATCGGCGTCA
GCCGCCGGCA GCGCCAACAC GGGCAAAACA CTGCAAATCG CATATCTATC CTTTGCGGTC
GCCAACAGCT ACGACGCCCC GATGCTCGCC GCGGCGCAGG CGGTGGCCTC AGGGGAGAAT
GCGAAAGTCA CCGTATTCGA CGCCAACAAC AATCCGCAGA CCCAGTTCGC CCAGTTCCAG
AATGCCATCA CGGCCGGAAA GTACGACGGC ATTCTCATCC AACCGATTCT TGCGACAAAT
CTCGTCGATC TGGTCAAGCA AGCCGTCGCC AAAGGCATCA AAGTGGTTGA CATCGACCAG
ATCCTCGGAC CCGACTTCCA TACGTACGAT CCGCAGGTGC CTGGGATGTC AGCCGCCGTC
GTCGACCGCA TCCCTGACAT CGGCCGCCTT CTCGGCGAAC AGGTCGTCGC CGCGTGTCAG
TCCGTCAACG CCAATCCGTG CAACGTGGGC TATCTCTACG ACATCAAAGC GTCCACGCTT
GACGGTGTCA TCCACGACGA CTTCATGAAA GTCGTTCAGG GCACGCCCTC CATCAAAGTT
GTCGCGGAAG GGCAGGACTT CTTCACACCT GCAGGCGGCC TCAAGGCCGT CCAGGACATG
CTGCAAGCCC ATCCCGACCT GACGCTCATC GTGGGTTCCG ACCAGGGCAT CGAGGGTGCA
GTGCAGGCGC TGGCGGCGGC AAAGAAGACG GGAAAGGTGC TTCTCGTGGG CTTTGGTGCC
AGTGCCGCAG GCATCCAGGG CGTCGCCTCC GGCCAGTGGT TCTCGACAGT GGCCCAGGCT
CCGGCCAGCA CCGGACGGCT TGGAATGCAG GCGCTCATCA AAGCGATCCG CGACGGTCAG
GACAGCGGCG GGATCAACCC GACGGCCGGA CTGCCCAACA ACGGAATCGT CACCAAGGCG
ACGGCAAGTG AGTTCACCGC CGAGTGGCCG GGGTGA
 
Protein sequence
MSSQPTRRRG TPTLLTAAMA LAAVALAACS SGTSGTARTS TTPNASASSS STAGTPASAS 
AAGSANTGKT LQIAYLSFAV ANSYDAPMLA AAQAVASGEN AKVTVFDANN NPQTQFAQFQ
NAITAGKYDG ILIQPILATN LVDLVKQAVA KGIKVVDIDQ ILGPDFHTYD PQVPGMSAAV
VDRIPDIGRL LGEQVVAACQ SVNANPCNVG YLYDIKASTL DGVIHDDFMK VVQGTPSIKV
VAEGQDFFTP AGGLKAVQDM LQAHPDLTLI VGSDQGIEGA VQALAAAKKT GKVLLVGFGA
SAAGIQGVAS GQWFSTVAQA PASTGRLGMQ ALIKAIRDGQ DSGGINPTAG LPNNGIVTKA
TASEFTAEWP G