Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0552 |
Symbol | |
ID | 4484979 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 583253 |
End bp | 584590 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639729319 |
Product | extracellular solute-binding protein |
Protein accession | YP_872311 |
Protein GI | 117927760 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.764632 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAATCTC GGTACCTCGC CGCGGCGGGC ATTGCCGCGG TTCTCGCTGT TGCCGGATGC AGCAGCGGCA AGTCGTCAAG CGGCGGGTCG AGTGCCAGCC CGACGCCGGC CGGCTCGAGC TCGGCTTCGG CGAGCTCGAG CGCCAACACT GCCAGCGGCA CGCTGACGGT CTGGCTGCAG ACCGATGCCC AGACCGGGTG GCCCAAGGAG GTCGCGGCCG CCACCCAACG GTTCAACGCG AAGTACCCCA ACGTCAAGGT CGACGTCCAG TACCAAACGT GGGCCGATCA CCTCACCAAA CTCGATGCCG CGCTTGCGGC CACCCCACCG GATGTCGTCG AACTTGGGAA CACCGAGACG ACGAAGTACA TGGCCACCGG GGCGCTCGCC GACATCACGG CGGACAAGGG CAAGTTCGAC AACTCCGACA CGTGGTTGGC GGCGCTCGAG CAGTCGGTCA CCTACAACGG CAAGCTGTAC GGCGTGCCGT ACTACGCCGG GGCGCGGGCC GTGATTTACC GGACGGACCA GTGGCAGGGC GCCGGCTTGT CCAACCCGCC GACCACCCTT GACGAGTTCC TCAAGGACGG CGAGGCGCTC AAAGCCAAGT ACGGCGCGAA CGACCCGAAC TACTCGGCGT TGTACTGGCC GGGCAAGAAC TGGTACGGTG CGATGTCCTT CGTCTACGGT TACGGCGGCG CCATCGCAAC GCAGGACTCC AGCGGCAAGT GGACGGCGAC CCTTGAGTCA CCGCAATCCC AGGCCGGCCT CGCTGAACTC AAGAAGATTT ACGACGCGCT CAGCAAGGCA CCGGCCGATT CCGATGAGTC CAAGCAGGAC AGCATCATGG CGCAAGGACA CGTCGGCATG ATTTACGGCA ACGGCTGGGA GGTCGGTGTC ATCACCGGCC CGAAGGACAA GGGCGGCAAC CCGGACCTCA AGCTGGCGGC ATTCCCGATG CCGGGTCCGG ACGCGAGCAA GCCGCTGCCG AGCTTCCTGG GCGGCTCCGA CCTCGTCATT CCAGCGGCCA GCCAGCACAA GGACTGGGCC GAGGAGTGGA TCAACGACTT CGTCGGGAAC ACCGGGATGA CGGCCCTCGT CAACGACGCC AAGGTGATTC CGAACACGAC GTCGTTGCTG AGCCTTCTCG GCGACAACGC ATTCGCCAAG GCGGCGTCCA ACAGCTGGTT CGTGCCGACC GCCAAGAACT GGGCGGACGT TGAGAGCCAG AACATCCTGC AGACGATGCT CGCCAACATC CTCGGCGGCA AGCAGTCGAT TGCGGATGCG ACGAAAGCGG CGGACGCGCA AATCAACCAG ATCCTCAACG GAAGCTGA
|
Protein sequence | MKSRYLAAAG IAAVLAVAGC SSGKSSSGGS SASPTPAGSS SASASSSANT ASGTLTVWLQ TDAQTGWPKE VAAATQRFNA KYPNVKVDVQ YQTWADHLTK LDAALAATPP DVVELGNTET TKYMATGALA DITADKGKFD NSDTWLAALE QSVTYNGKLY GVPYYAGARA VIYRTDQWQG AGLSNPPTTL DEFLKDGEAL KAKYGANDPN YSALYWPGKN WYGAMSFVYG YGGAIATQDS SGKWTATLES PQSQAGLAEL KKIYDALSKA PADSDESKQD SIMAQGHVGM IYGNGWEVGV ITGPKDKGGN PDLKLAAFPM PGPDASKPLP SFLGGSDLVI PAASQHKDWA EEWINDFVGN TGMTALVNDA KVIPNTTSLL SLLGDNAFAK AASNSWFVPT AKNWADVESQ NILQTMLANI LGGKQSIADA TKAADAQINQ ILNGS
|
| |