Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1116 |
Symbol | |
ID | 4485779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 1240080 |
End bp | 1241825 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639729891 |
Product | extracellular solute-binding protein |
Protein accession | YP_872874 |
Protein GI | 117928323 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.749385 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0404913 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGGCA CACGCAGTGG ATTCCGGGTG GCTGCCGGAT TCGCCGTCCT GGCGTTGGCG GCGACAGCGT GCGGCAGCAA AGGGGGCGGC GGCAGCGCCT CTCAGGGCCG GATCAAAGGA GATCCCAACA CCATCAACTC GGTCGCCAAT CCGAAGAAGG GCGGTACGTT CACATACGCC CTGGAGAAGA CGATCGACAA CTGGAACATC CTGACGTCGG CGGGGAACAC CTTCGAGACG GCCGAGGTGA TGGATGCCAT CTACCCGTCG ACTTTCATTG CGCAGCCTGA CCTCTCCGTC AAAATGAATG ACGACCTGCT GCTTAGCGCC GAGGTCACCA ACCAGAACCC GGAAACGATC GTCTACAAGA TCAACCCGAA GGCTGTCTGG TCCGACGGGG TGCCGATCAA CGCCGATGAC TTCAAGTACG TCTGGCAAGC CAGCGCCAAC TGTAAGGACT GCGACGTCGC GTCGTCCGTC GGATACACCT CAATCCAGTC GATCGAGGGC TCCGGCCCGA ACAACCAGAC GGTGACCGTG ACGTTCAAGC AGCCGTTCTC TGACTGGAAG TCGCTGTTCG GCCCGCTGCT CCCGGCGCAT GCGGCGACGC CGAAACCGAC CGACGAGCAG ACGCTCGCGG ACAGCTGGAA CCACGGCTTC GCCGACAATC CACCGAAGAT TTCCGGTGGT CCGTATGTCA TCAGCTCCTA CCAGAAGGAC CAGAGCGTCA CGTTGGTGCC GAATCCGAAG TGGTACGGCA AGAAAGGCCC GTACCTGGAC AAGCTCGTGT TCCGAATGAT TACGGATGCG ACCCAGGAAC CGCCGGCGTT GCAGAACCAC GAGATTGACG CGATGTATCC CCAGCCTGAG GTTGACCTCA TCAAGACGCT GAACGGGATG AAGAACGTCG TCTACCAGGA GGATCTCGGC CTCATATTCG AACACTTCGA CTTCAACCTG GAGAACAAGG CGCTCACCCT GCCGGTGCGT CAGGCGCTGT TCACGGCGGT GAACCGCCAG CAAATCCTGG ACGCCACGGT GAAGCAGTTC GACCCTGACG TCACGGTGCT GAACAACCGG ATGTTCGTGC CAGGCCAGAA GGGCTACCAG GACAACGTCA CCAAGTACGG TCTTGGCACT GGGGACATCG CCAAGGCGAA GGACATCCTG ACCAAGGCCG GTTACAAGAT TGAGAACGGT AAGCTCATCC AGCCGGACGG CACGCCCTTC CCGGCTCTCA CGATGCAATA CACGGTCGGC AACCAGATCC GGCAGACGGA GTGCCAGCTC TTCGCCCAGG CGGCGAAGCA ACTGGGTGTG ACCGTCAACG TCCAGAGCAC CGACAGTCTG GGCAAGACAC TCACCAACGC GGACGCGCAG CACCACTTCG ACGTCGTCGT CTTCGCGTGG GTGGCGACAC CGTTCCCGGC GTCAGGTAAC GCCCCGTTGT ATGAGAGCAA CAAGGCGCGA GGCGGCGACT GGGGCGGCAA CTACGGTCAC TGGGTGAACT CGCAGGCCGA CCAACTGCTC CAAGATGCGA CGTCCAACCT GAATCCGGAC CAGGTCATCA AGGATCTCAA CGAGGCCGAC CAGCTCATCT CCCAGGACGC ATACACGTTG CCGCTCTACC AGAAGCCGAC GCTGCTGGCC TACTACAACA CGTGGGGCAA CATCCGAGAC AACGCGACCA GCGTCGGACC GCCGTACAAC GTGCAGGAAT GGGGTTTGAA GCCCTCGGCG GCCTGA
|
Protein sequence | MLGTRSGFRV AAGFAVLALA ATACGSKGGG GSASQGRIKG DPNTINSVAN PKKGGTFTYA LEKTIDNWNI LTSAGNTFET AEVMDAIYPS TFIAQPDLSV KMNDDLLLSA EVTNQNPETI VYKINPKAVW SDGVPINADD FKYVWQASAN CKDCDVASSV GYTSIQSIEG SGPNNQTVTV TFKQPFSDWK SLFGPLLPAH AATPKPTDEQ TLADSWNHGF ADNPPKISGG PYVISSYQKD QSVTLVPNPK WYGKKGPYLD KLVFRMITDA TQEPPALQNH EIDAMYPQPE VDLIKTLNGM KNVVYQEDLG LIFEHFDFNL ENKALTLPVR QALFTAVNRQ QILDATVKQF DPDVTVLNNR MFVPGQKGYQ DNVTKYGLGT GDIAKAKDIL TKAGYKIENG KLIQPDGTPF PALTMQYTVG NQIRQTECQL FAQAAKQLGV TVNVQSTDSL GKTLTNADAQ HHFDVVVFAW VATPFPASGN APLYESNKAR GGDWGGNYGH WVNSQADQLL QDATSNLNPD QVIKDLNEAD QLISQDAYTL PLYQKPTLLA YYNTWGNIRD NATSVGPPYN VQEWGLKPSA A
|
| |