Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_2049 |
Symbol | |
ID | 4484728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | - |
Start bp | 2320454 |
End bp | 2321761 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639730845 |
Product | extracellular solute-binding protein |
Protein accession | YP_873807 |
Protein GI | 117929256 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTGCCT GCAGCTCCGG CAAGAGCACG GGAGGCTCCA GCGGAACGTC TCCCAGCGCC GGCGGGTCAA GCAGCACGAA CGCCAGCGCG TCGAGCAGCG GCGGGGCGTC ATCGTCCTGT CCGTTCCCGA CCGACTCCGT CACGCTGACG TGGTGGCACA ACGCCACAGC CGATCCGGGG AAGGCGGCGT GGCAGAAGAC GGCCGACGAT TTCCACGCAC AGCACCCGAA CGTGAGCTTC AACATTGTGC CCATTCAAAA TGAGCAATTC ACGACAAAGG TGCCGGCCGC ACTCGAGTCC AACAATCCGC CGTCTCTGTA CCAGCAGTGG GGCGGGGGAT CCGAAGCGAC GCAGGTCAAG TCCGGCAAAC TGATGGACAT GACGGCCTGT GTCTCGAGCT GGGTCGACCG GCTCGGCCCG TCGGCCAAGG GTTGGCAGGT CGACGGCAAG TGGTACGGAA TTCCGTACGA CTACCACATC GTCGGCTTCT GGTACCGCAC GGACCTGTTC CAAAAGGCGG GCATCACCTC GCCGCCGAAG ACGATGGACG AGCTGTACCA AGACATCGAC AAGCTGAAGG CGGCAGGGAT CACCCCGATC GCGCTGGGCG GCAAGGACCG TTGGCCGGAC GCCTTCTACT GGGAGTACTT CGTGCTTCGG GAATGCCCGA AGGACACGGT GACGTCGTCC ATCGCCAACG TCAAATTCTC TGACCCCTGC TTCGTTAAGG CCGGTCAGGA CATGAAGAAG TTCCTTGACG CCAAGCCGTT CCAGACCGGA TTCCTTGGCA CGCCCGCACA ACAAGGCGCC GGCAGCTCGG CCGGCTTGGT GGCTAACGGC AAGGCGGCAA TGGAGCTGCA GGGTGACTGG GAAATCCTGG TCATGCCGTC GCTCACCCAG GACAAGAACT TCGCGTCGAA ACTCGGCTGG TTCCCCTTCC CGTCGGTGTC CGGCGGTGCG GGTGACCAGA ACGCCGGACT CGGCGGTGGC GACGGTTTCA GCTGCACCTA CAAGGCCACC AACGCCTGCC CGGCGTTCCT GGAGTACATC ACCAGCGCCG ATGTCCAGCG CTACCTGGTG AAGCAGAGCG CCGTCAGCCT GCCGTCCAAC AGCGAGGCAA GCGACGCCAT CACCGACCCC ACGCTGAAGA CGGTCCTTCA GTACATCGGA ACGGTGTCGT ACAACCAGCT GTACTTCGAC CAGGCGCTGC CGACCGATGC CGGACAGGCG CTTGACTCGG CGGTCGCCGA CTTCTTCGCG GGTTCCGGCA GTCCGGAGAG CCTGGCGGCG TCGGTGTCGT CGAAGTAA
|
Protein sequence | MAACSSGKST GGSSGTSPSA GGSSSTNASA SSSGGASSSC PFPTDSVTLT WWHNATADPG KAAWQKTADD FHAQHPNVSF NIVPIQNEQF TTKVPAALES NNPPSLYQQW GGGSEATQVK SGKLMDMTAC VSSWVDRLGP SAKGWQVDGK WYGIPYDYHI VGFWYRTDLF QKAGITSPPK TMDELYQDID KLKAAGITPI ALGGKDRWPD AFYWEYFVLR ECPKDTVTSS IANVKFSDPC FVKAGQDMKK FLDAKPFQTG FLGTPAQQGA GSSAGLVANG KAAMELQGDW EILVMPSLTQ DKNFASKLGW FPFPSVSGGA GDQNAGLGGG DGFSCTYKAT NACPAFLEYI TSADVQRYLV KQSAVSLPSN SEASDAITDP TLKTVLQYIG TVSYNQLYFD QALPTDAGQA LDSAVADFFA GSGSPESLAA SVSSK
|
| |