Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0434 |
Symbol | |
ID | 4485669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 464708 |
End bp | 466066 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639729201 |
Product | extracellular solute-binding protein |
Protein accession | YP_872194 |
Protein GI | 117927643 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.396307 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAGGT CCACCTTGCC CAGGAAATCC GTCGTTGCCG CGCTTGTCGT GACCGCGCTC GCTGTCGCGG CATGCTCCAG CGGCACAAAG AGCCAGACCA ATACCAGCAA CTCCGCGACC TCGAGCGGCG CCGGCGGCGC GTCATCGAAC GCGAACACCG CACCGGTCAC GCTGACGCTC TGGCACAACT ACGGCACGGA ACAGAACGCC ACAGCCACGC AGAACCTCGT CAACGCCTAC GAAAAGCTTC ATCCCAACGT GACGATCAAA GTGGTGAGCC AACCGGCGGA CAACTACTTC TCCCTGCTGC AGGCTGCAGC GATCTCCAAG ACCGGGCCTG ACATCGCCGT CATGTGGACC GGCTTATTCA CCCTGCAGTA CAAGGATTTC CTCACGCCGC TCAAGGGACT CGTCCCCGAT GCGGCGCTCA CCAAGCAGCA GGGTCTTGAA TGGATGACGG ACAATTTCAA CCCGAACGGC AATCCCTACG TCATGCCGCT CGAGACCCAG TTCTATATCG GTTTCTATAA CAAGGCCGCA TTCGCGAAAG CCGGCATCAC CCAGGTTCCG CAGACCTGGA ATGAGCTTTA CGCGGCATGC GACAAGCTCA AGGCGGCCGG TTACACGCCG CTGGTGTACG GGAACGGCGG GCAGAGTCTC GGTGCGGAGT TCTACCCGTG GTACGATGCG AGCTACATCG AGATCGGCCT TCTCCCCGTC GATCAGTGGC GCAACCTCTA CGACGGGAAA ACGCCGTGGA ATTCTCCGGA AGAAATCGCC GCCTTCAGCA AGTGGGCTGC GCTCCACGAC AAGGGTTGCA CCAATCCGGA CGTGCTGACC AAGACGAACA ACATCGATGA TTTCGTCAAC GGGAAGGCGG CGATGATTAT CGACGGCACC TGGGACACCC AGAAGTTCAC CGACGCGATG AAGGACAACG TCGCCGCGTT CGTCCCGCCG TTCTCAGACA CACCGATCAA GGGTGTCGTC AACTACCCCG GGGACGGTTT CAGCATCATG AGCTACTCCA AGCACAAGGC GGAGGCGGCG GACTTCCTCG CCTTCATCGC GTCGCCGGAA GGGCAGGCGG CCATCAACGC CGCCGGTCTG ATCCCCGACA CCGAGGGTGC GACCACCTCG AATCCGGTGA ACCAGCAGAT GCTGGACTTT GTCAGCAAGG ACGGGATGAC CCCGTACCCG ATGCTCGACA ATGTCGTCCA GGGTGAGATC GTCGATGCGG GCAACAAGAT TCTGCCGTCC ATTCTTGCCG GCAAGATTTC GCCGTCCGAC GGTCTCGGCC AACTGCAATC GACGTGGGCG CACCTGAGCC CGGACAAGAG GTCCAACGTT TACCAGTGA
|
Protein sequence | MRRSTLPRKS VVAALVVTAL AVAACSSGTK SQTNTSNSAT SSGAGGASSN ANTAPVTLTL WHNYGTEQNA TATQNLVNAY EKLHPNVTIK VVSQPADNYF SLLQAAAISK TGPDIAVMWT GLFTLQYKDF LTPLKGLVPD AALTKQQGLE WMTDNFNPNG NPYVMPLETQ FYIGFYNKAA FAKAGITQVP QTWNELYAAC DKLKAAGYTP LVYGNGGQSL GAEFYPWYDA SYIEIGLLPV DQWRNLYDGK TPWNSPEEIA AFSKWAALHD KGCTNPDVLT KTNNIDDFVN GKAAMIIDGT WDTQKFTDAM KDNVAAFVPP FSDTPIKGVV NYPGDGFSIM SYSKHKAEAA DFLAFIASPE GQAAINAAGL IPDTEGATTS NPVNQQMLDF VSKDGMTPYP MLDNVVQGEI VDAGNKILPS ILAGKISPSD GLGQLQSTWA HLSPDKRSNV YQ
|
| |