Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_2031 |
Symbol | |
ID | 8333375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 2300510 |
End bp | 2301838 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644955181 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003112792 |
Protein GI | 256391228 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.298737 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCAATGA GGACAACACG GCGTTCGGCA CTGCGGCTGG GCATCGGGGC CCTCGCGGTC GCGGTGACCG GCGGATGCGC GACCGGCGGC GGGAAGAAGA CTCCGGCGCC GGCTGTGAAG ACCGGCGCCG CGGCTCAGGT GGGCGGGACC ATCACGGTGT GGTCCTGGGA CGTGGCGGCC AAGGCGCTCA AGCGGCTGGC GCCCGCGTTC GAGCAGCAGC ACCCGGGCGT GAAGGTGAAC GTCGTCGACA TCGGCTACGA CAACGCCTAC GACAAGATCA CCGTCGGCCT GAAGTCCGGC TCCGGACTCC CCGACGTCCT GCAGGTCGAG GGCCCGAAGA TGCAGAGCTA CATCGGCACC TTCCCCAGCG GCTTCTACGA CCTCAGCACC CTGGCGGCAC CGCTGAAAGC GCAGTTCAAC GCCGCCGCCT GGGCCACCGG CACGGACGCG AACGGCAAGG TCTACGCGCT GCCCTGGGAC ATCGGGCCCT GCGGCGTGTT CTACCGGACC GACATCTTCC AGCAGGCCGG CGTCGACCCC GCGTCGATCC AGACCTGGGA CGACTACATC GCCGCCGGCG TGCGGATCAA GGCCAAGACC GGCAAGAAGC TGCTGGTGGT GGACCCCACC GGCGACAGCA CGTTCCCGAT GATGCTCCAC CAAGAAGGAC AGGGCTACTT CGTCGGCGAC AAGATCGCCG TCGACACCCC GGCGGCGGTG AAGGCCATGA CCGTCATGAA GGAACTCAAC GACAAAGGCC TGGTCGACTA CGAAAAGGGC TGGGACGCCC TGGTCGCCGC CACCAAGGAC GGCACCGTCG CCACCACCCC GACCGCGGTC TGGTGGTCCG GCACCCTCAC CGACGAGATG CCGGAGCTGA AGGGCAAGTT CGCCGCGATC CCGCTGCCCG CCTTCACTTC CGGCGGCATC CGTACCTCCA ACAACGGCGG CTCCCTGCTC ACCATCTCAG CGCAGAGCAA GAACTCGGCG ACCGCCTGGG CGTTCATCCA GTTCGTCCTG GCCGACGCCG ACAACCAGGT CTCGATGCTG AAGAACGAAG GCATCTTCCC CGCCTTCGAG CCGGCCCTGT CAGACCCCTA CATCACCGGC CCGCAGGACT ACTACGGCGG CCAAACCACC TTCAAGATCT TCGCCGACCT GGCCAAGAAC ATCCCGGCAG TGCAGTACAC CGCCGACTTC TCCAAAGCCT CCGACCTGAT CAACACCGCC ACCGGCGCGG TGATGCAAGG CGGCAAGGAC CCGAAGTCCG CCCTGGACTC GGCAGCGCAA CAGATCGCCT CCGCGACCAA TCGGCAGATC GCGCACTAG
|
Protein sequence | MSMRTTRRSA LRLGIGALAV AVTGGCATGG GKKTPAPAVK TGAAAQVGGT ITVWSWDVAA KALKRLAPAF EQQHPGVKVN VVDIGYDNAY DKITVGLKSG SGLPDVLQVE GPKMQSYIGT FPSGFYDLST LAAPLKAQFN AAAWATGTDA NGKVYALPWD IGPCGVFYRT DIFQQAGVDP ASIQTWDDYI AAGVRIKAKT GKKLLVVDPT GDSTFPMMLH QEGQGYFVGD KIAVDTPAAV KAMTVMKELN DKGLVDYEKG WDALVAATKD GTVATTPTAV WWSGTLTDEM PELKGKFAAI PLPAFTSGGI RTSNNGGSLL TISAQSKNSA TAWAFIQFVL ADADNQVSML KNEGIFPAFE PALSDPYITG PQDYYGGQTT FKIFADLAKN IPAVQYTADF SKASDLINTA TGAVMQGGKD PKSALDSAAQ QIASATNRQI AH
|
| |