Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_1941 |
Symbol | |
ID | 8333284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 2196203 |
End bp | 2197666 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644955090 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003112702 |
Protein GI | 256391138 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.948884 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.342758 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCAGATC ACGGTTTGCC AGCCGACGTC TCCCGCCGGC AGATGTTGGT CCGTTCGGCC GCGATCGCGG CCATGGTCGG CCCGGGCTCG GCCCTGGCAG CCGGGTGCGC GGCCGGCAGC GGGAGCAGCC CCAAGAACAA CAACGCGGGC TCCACCACCG CCGCGAACTC CTCGGACCCG AAGAACCCGT TCGGAGTGAA GGCCTCCAGC CCGCTGGATG TCTACGTCTT CAAGGGCGGC TACGGCGACG ACTACGCGAA GGCCTTCGAG GCGATGTACT CCTCGAAGTT CTCCGGCTCG CAGGTCTCGC ACCACAGCGG CCAAAACCTC ACCGGCGACC TGCAGCCCCG GTTCAACGCC GGCAGCCCGC CGGACGTCAT CGACGACTCC GGCGCCCAGC AGCTGAAGCT GGACGTGTTG AACAGCAGCG GCCAGCTGAC CAACCTGGAC CGGCTCTTGG ACGCGCCCTC GATCGACGAC CCGAGCAAGA AGGTCCGCGA CACCCTGCTG GCCGGCACGA TCGAGACGGG CCAGCTCGGC CAGAGCATGT TCTCGCTGAA CTACGCGTTC ACCGTCTTCG GGCTGTGGTA CTCCACCGCG CTGTTCCAGA AGAACAACTG GCAGGTCCCC ACCTCGTGGG AAGACTTCAT GACGCTGTGC GCGACCATCA AGGCCAGCGG CATCGCCCCG TTCGCGCACC AGGGCAAGTA CCCGTACTAC ATGCTGGTGC CGCTGATGGA CATGGTCGCC AAGAACGGCG GCCCGGACGT GCAGACCGCC ATCGACAACC TGGAGCCCAA CGCCTGGAAG TCCGACGCGG TCAAGAACAG CGTCGATGCC CTTTACGAGC TGGTGGACAA GGGCTACATG CTCCCGGGCA CCGAGGGTCT GACCCACATC CAGTCCCAGA CCCTGTGGAA CCAGGGCAAG GCGGCGGTCA TCCCCTGCGG TTCGTGGCTG GAGAACGAGC AGCTGTCCGC CACCCCGGCG GGCTTCAACA TGGCCGTGTT CGCCATGCCC TCGCTCAGCG GCGACAAGAT GCCGCAGACC GCGATCCGGG CCGGCGCCGG CGAGCCGTTC ATCGTCCCGA GCAAGGCCAA GAACCCGGCC GGCGGCCTGG AGTTCCTGCG CATCATGTGC TCCAAGGCCG GCGGCGCCTC CTTCGCGCAG AAGGCGAACT CCCTGTCGGT GGTGAAGGAC GCGATCACCC CGGACATCGA GGCCAAGCTG CTGCCGGGCA CCAAGTCCAG CAACGACCTC TACCAGGCCG CCAACGGCAA GGTCATCTCC TGGTACTACC TGAACTGGTA CTCCCAGATG GAGAAGGACC TCGAGGACGC CATGGGCCAG CTGATGGCGA ACAAGATCAA GCCGGCCGAG TTCATCACCC GCGCGCAGGC GGCGGCCGAC AAGTGCGCCG GCGACTCCTC GGTGCAGAAG TTCAAGCGCC CGACCACCGC CTGA
|
Protein sequence | MSDHGLPADV SRRQMLVRSA AIAAMVGPGS ALAAGCAAGS GSSPKNNNAG STTAANSSDP KNPFGVKASS PLDVYVFKGG YGDDYAKAFE AMYSSKFSGS QVSHHSGQNL TGDLQPRFNA GSPPDVIDDS GAQQLKLDVL NSSGQLTNLD RLLDAPSIDD PSKKVRDTLL AGTIETGQLG QSMFSLNYAF TVFGLWYSTA LFQKNNWQVP TSWEDFMTLC ATIKASGIAP FAHQGKYPYY MLVPLMDMVA KNGGPDVQTA IDNLEPNAWK SDAVKNSVDA LYELVDKGYM LPGTEGLTHI QSQTLWNQGK AAVIPCGSWL ENEQLSATPA GFNMAVFAMP SLSGDKMPQT AIRAGAGEPF IVPSKAKNPA GGLEFLRIMC SKAGGASFAQ KANSLSVVKD AITPDIEAKL LPGTKSSNDL YQAANGKVIS WYYLNWYSQM EKDLEDAMGQ LMANKIKPAE FITRAQAAAD KCAGDSSVQK FKRPTTA
|
| |