Gene Acel_1116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1116 
Symbol 
ID4485779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1240080 
End bp1241825 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content61% 
IMG OID639729891 
Productextracellular solute-binding protein 
Protein accessionYP_872874 
Protein GI117928323 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.749385 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0404913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGGCA CACGCAGTGG ATTCCGGGTG GCTGCCGGAT TCGCCGTCCT GGCGTTGGCG 
GCGACAGCGT GCGGCAGCAA AGGGGGCGGC GGCAGCGCCT CTCAGGGCCG GATCAAAGGA
GATCCCAACA CCATCAACTC GGTCGCCAAT CCGAAGAAGG GCGGTACGTT CACATACGCC
CTGGAGAAGA CGATCGACAA CTGGAACATC CTGACGTCGG CGGGGAACAC CTTCGAGACG
GCCGAGGTGA TGGATGCCAT CTACCCGTCG ACTTTCATTG CGCAGCCTGA CCTCTCCGTC
AAAATGAATG ACGACCTGCT GCTTAGCGCC GAGGTCACCA ACCAGAACCC GGAAACGATC
GTCTACAAGA TCAACCCGAA GGCTGTCTGG TCCGACGGGG TGCCGATCAA CGCCGATGAC
TTCAAGTACG TCTGGCAAGC CAGCGCCAAC TGTAAGGACT GCGACGTCGC GTCGTCCGTC
GGATACACCT CAATCCAGTC GATCGAGGGC TCCGGCCCGA ACAACCAGAC GGTGACCGTG
ACGTTCAAGC AGCCGTTCTC TGACTGGAAG TCGCTGTTCG GCCCGCTGCT CCCGGCGCAT
GCGGCGACGC CGAAACCGAC CGACGAGCAG ACGCTCGCGG ACAGCTGGAA CCACGGCTTC
GCCGACAATC CACCGAAGAT TTCCGGTGGT CCGTATGTCA TCAGCTCCTA CCAGAAGGAC
CAGAGCGTCA CGTTGGTGCC GAATCCGAAG TGGTACGGCA AGAAAGGCCC GTACCTGGAC
AAGCTCGTGT TCCGAATGAT TACGGATGCG ACCCAGGAAC CGCCGGCGTT GCAGAACCAC
GAGATTGACG CGATGTATCC CCAGCCTGAG GTTGACCTCA TCAAGACGCT GAACGGGATG
AAGAACGTCG TCTACCAGGA GGATCTCGGC CTCATATTCG AACACTTCGA CTTCAACCTG
GAGAACAAGG CGCTCACCCT GCCGGTGCGT CAGGCGCTGT TCACGGCGGT GAACCGCCAG
CAAATCCTGG ACGCCACGGT GAAGCAGTTC GACCCTGACG TCACGGTGCT GAACAACCGG
ATGTTCGTGC CAGGCCAGAA GGGCTACCAG GACAACGTCA CCAAGTACGG TCTTGGCACT
GGGGACATCG CCAAGGCGAA GGACATCCTG ACCAAGGCCG GTTACAAGAT TGAGAACGGT
AAGCTCATCC AGCCGGACGG CACGCCCTTC CCGGCTCTCA CGATGCAATA CACGGTCGGC
AACCAGATCC GGCAGACGGA GTGCCAGCTC TTCGCCCAGG CGGCGAAGCA ACTGGGTGTG
ACCGTCAACG TCCAGAGCAC CGACAGTCTG GGCAAGACAC TCACCAACGC GGACGCGCAG
CACCACTTCG ACGTCGTCGT CTTCGCGTGG GTGGCGACAC CGTTCCCGGC GTCAGGTAAC
GCCCCGTTGT ATGAGAGCAA CAAGGCGCGA GGCGGCGACT GGGGCGGCAA CTACGGTCAC
TGGGTGAACT CGCAGGCCGA CCAACTGCTC CAAGATGCGA CGTCCAACCT GAATCCGGAC
CAGGTCATCA AGGATCTCAA CGAGGCCGAC CAGCTCATCT CCCAGGACGC ATACACGTTG
CCGCTCTACC AGAAGCCGAC GCTGCTGGCC TACTACAACA CGTGGGGCAA CATCCGAGAC
AACGCGACCA GCGTCGGACC GCCGTACAAC GTGCAGGAAT GGGGTTTGAA GCCCTCGGCG
GCCTGA
 
Protein sequence
MLGTRSGFRV AAGFAVLALA ATACGSKGGG GSASQGRIKG DPNTINSVAN PKKGGTFTYA 
LEKTIDNWNI LTSAGNTFET AEVMDAIYPS TFIAQPDLSV KMNDDLLLSA EVTNQNPETI
VYKINPKAVW SDGVPINADD FKYVWQASAN CKDCDVASSV GYTSIQSIEG SGPNNQTVTV
TFKQPFSDWK SLFGPLLPAH AATPKPTDEQ TLADSWNHGF ADNPPKISGG PYVISSYQKD
QSVTLVPNPK WYGKKGPYLD KLVFRMITDA TQEPPALQNH EIDAMYPQPE VDLIKTLNGM
KNVVYQEDLG LIFEHFDFNL ENKALTLPVR QALFTAVNRQ QILDATVKQF DPDVTVLNNR
MFVPGQKGYQ DNVTKYGLGT GDIAKAKDIL TKAGYKIENG KLIQPDGTPF PALTMQYTVG
NQIRQTECQL FAQAAKQLGV TVNVQSTDSL GKTLTNADAQ HHFDVVVFAW VATPFPASGN
APLYESNKAR GGDWGGNYGH WVNSQADQLL QDATSNLNPD QVIKDLNEAD QLISQDAYTL
PLYQKPTLLA YYNTWGNIRD NATSVGPPYN VQEWGLKPSA A