Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0130 |
Symbol | |
ID | 8331455 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 133375 |
End bp | 135060 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644953297 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003110926 |
Protein GI | 256389362 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTTCA CCAGCACCGC GCAGCAGATC TCCCGCAGGA CCGTCCTGCG TACGACCGGA GCCGCCGCCG TCGCGGCCGT CGCCGTTCCG GCCCTGGCGG CTTGCGGGGG GTCGAAGACC TCCTCCGGCG CCGCGCAGTC CAACGTCGAC AAGAAGCTCA TGGCCTGGCC GACCTACACT CCGGCGGCGG GTCTGCACCC TGATATGCCT GGCACCGCGG CAGGCGTGCA GGACACGTTC CTGCGCTATC CGTCCAACCT GATCCAGTCG GTGCCCGCCA AGCCCGGCGA CGGGTCGAAG GTGCGCGCGC TGATCGTCAC CTACGGTACG CAGCCCAAGG GCCCGGACCA GAACCAGCTG TGGAAGGCGG TCAACGACGC CGTCGGCGTG GACCTCGAGC TGACCATGGT CACGGACGCC GACTGGCAGA CCAAGCTCGG CGCCATGATG GCCGCCTCGG ATCTGCCGGA CATCATCATG CTCGGGCTCT ACCAGCTGCC GAACGAGGCG CAGTTCCTGC AGGCCAAGTG CGAGCCGCTG GGGCAGTACC TGGCCGGGGA CGCGGGCGCG AAGGCGTATC CGAACCTGGC CGCGATACCG CCGTACAGCT GGGATTCGGT GGGACGCGTC GGCGGCGACT TCTATGGGAT CCCGATCCAC CGGCCGCGTC TGGGGAACTC GTTCTTCGCC GACTCCGACC TGTTCCAGCA GGCCGGGATC TGGAACCCGA AGCCGGGCGG GCTGTCGAAG GCGGAGCTGA CCGCCGGGCT GATGAAGCTG AACACGCAAG GGCACTTCGC GCTCGGTACC AACAAGGTCG CCTCATTCGG TTACCTGACG CACTCCGGGG TGCACGGCAC GCCTAACCTG TGGTCGCTCG CCAACGGTCA GTTCACCACC GCGTACGGCA CCGACAGCAT GAAGCAGTCG CTGGCGACCA TGGCCGATTG GTACGGCAAG GGACTCTACG ACCCGGCGGC GCTGACCGTG TCGAGCACGC AGTGCAAGAC CGACTTCCAG AACGGTACTT ACGTCACCAC CACCGACGGC TTCGGCGGGT TCGGCGGCTA CGCGACCGCC GTCAACGAGA AGTGGAAGGT CGACTTCGTC CGGCCCTTCG ACGCCGGTAC CGGCGCCAAG CCGACGCCGT GGCTCACCCC CGGCTACTTC GGCTACACGG TACTGAAGAA GACGACACCG GAGCGCGCCA AGATGCTGCT CGGCGTGCTG AACTTCCTCG CCGCGCCGTT CGGCTCCAAG GAGTGGGAGC TGATCAACTA CGGGCTCGAA GGCGTGCACT TCAACCGGGG TGCGGACGGC GGTCCGTCGG CGCCGACCGC GTTGGGCAAG ATCGAGAACT CGGTGAACGT GCCGGTCAAG TACGTCATGG CCGCTCCGCT GGTGAACTAC CTCGCCGGCG AGCCGGAGGC AGCCAAGCGC TGCTATCAGG CGCAGGTGGA CATCGTGCCC ATCGGCGTGA CAGACCCGAG CCTGGGCGTC CAGTCGGCGA CCCGGAACAA GCAGTGGCCG ACGCTCTTGC AGCAGATCCA GGACGGGATG AACCAGATCA TCACCGGGAA GGCGCAGCTC TCGTCCTGGG ACGACGTCAT CAAGAAGTGG AAGAGCAGCG GCGGGGACCA GATCGCCGCC GAACTCGGCG CCGAGTACGC CAAGACTCAC GGTTGA
|
Protein sequence | MPFTSTAQQI SRRTVLRTTG AAAVAAVAVP ALAACGGSKT SSGAAQSNVD KKLMAWPTYT PAAGLHPDMP GTAAGVQDTF LRYPSNLIQS VPAKPGDGSK VRALIVTYGT QPKGPDQNQL WKAVNDAVGV DLELTMVTDA DWQTKLGAMM AASDLPDIIM LGLYQLPNEA QFLQAKCEPL GQYLAGDAGA KAYPNLAAIP PYSWDSVGRV GGDFYGIPIH RPRLGNSFFA DSDLFQQAGI WNPKPGGLSK AELTAGLMKL NTQGHFALGT NKVASFGYLT HSGVHGTPNL WSLANGQFTT AYGTDSMKQS LATMADWYGK GLYDPAALTV SSTQCKTDFQ NGTYVTTTDG FGGFGGYATA VNEKWKVDFV RPFDAGTGAK PTPWLTPGYF GYTVLKKTTP ERAKMLLGVL NFLAAPFGSK EWELINYGLE GVHFNRGADG GPSAPTALGK IENSVNVPVK YVMAAPLVNY LAGEPEAAKR CYQAQVDIVP IGVTDPSLGV QSATRNKQWP TLLQQIQDGM NQIITGKAQL SSWDDVIKKW KSSGGDQIAA ELGAEYAKTH G
|
| |