Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3703 |
Symbol | |
ID | 8335056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4154043 |
End bp | 4155443 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644956843 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003114446 |
Protein GI | 256392882 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0104923 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGTT ACGCCCAGCG GCCCACTGTT CGCAGGACCG CCGCCCTCGT CCTCGTCGCG GGCCTGAGCC TGTCCGCGGC GGCGTGCTCC AGCAGCAAGA GCGGCGCCAA GGGCGCCGCG GCGCCGGCCG GCGGCGACGG CCCGGTCACG ATCTCCGTGC CCTGCGAGCC GCCGACCAGC CAGGCCGGGC AGCGCAAGGA ATGGCTCGCC GACGTCGCCA CGTTCGAGAA GGCCAACCCG ACCATCACGA TCAACGGCAT AGACACCTAC CCGTGCGAGG ACACCGCGAC GTTCACGGCC CAGATGCGGG CCGGCACCGA GCCGGACGTG TTCCACACCT ACTTCACCGA CCTGAACCAG GTGCTGGACT CCGGACAGGC CGCCGACATT ACGTCCTATG TCAACGACAC GACCCTCCCC GGCTTCAAGG ACATCGCGCC GTCCGCGTTG GCCGCCGTCA CCGCGGGCAA GACGCTCTAC GGGCTGCCGA CCGGCATCTA CACCCAGGGC CTGATCATCA ACCGCAAGCT GTTCAAGCAG GCCGGTCTGG ACCCGGACCA CCCGCCGACA ACGTGGGCCG ACGTGGAGAA GGACGCCAAG GCGATCACCG CGCTGGGCAA CGGGATCTAC GGCTATGGCG AGTACAGCGC CGGCAACAAC GGCGGCTGGC ACTTCTCCTC CGAGGTCGAC GCCAACGGCG GGCAGATGAT CGGCAGCGAC GGGACGAAGG CGGCGTTCAA CGCCCAGCCC GCGACCGAGG TGCTGCAGGC GCTGCACCAG ATGCGCTTCG TCGACAACAG CATGAGCCCG ACCCAGCAGC TGAAATGGGG CGACCTGCAG AAGCAGATGG GCGCCGGGAA GCTCGGCATG TACGTCGCCT CCCCGGACGA CATCTACAAC GTCATCGTCC CCCAGGACGG CGGCAACGTC GACGACTACG GCATCGGTCC GCTGCCCAGC ACCAGCGGGA CGCCCGCCGG GTCGCTGTCC GGCGGCGACG ACGTCATGTT CAACAAGCAC GACACCCCGG CGCAGATCCG GGCCGGCATC AAGTGGATCG CGTTCTCGAT GCTGACGCCG GGCCAGGGCC AGTTCAACTA CGCGCGCACC AAGGCCGACG GGCTGCCGGT CGGCTTCCCG GAGCCGCTGG ACTGGATCGG CGACACCTCG GCGAAGAACG AGACGCTGAA GGCGGCCAGC GCGACGGTGA ACACCGCGTA CTTCAAGACG TTCGTCGACG CGCACGAGAA GGGCATGGGC GAGCCCGCCG ACGCCCAGGC GGTCTACAAG ACGCTGGACG CCACGATGCT CAGTGTCCTG AACGACCCGA ACGCCAACAT CCCGAACCTG CTCAAGACCG CGGAGACCCA GGTCAACACG CTGCTCGCGA ACGCCGGCTG A
|
Protein sequence | MKSYAQRPTV RRTAALVLVA GLSLSAAACS SSKSGAKGAA APAGGDGPVT ISVPCEPPTS QAGQRKEWLA DVATFEKANP TITINGIDTY PCEDTATFTA QMRAGTEPDV FHTYFTDLNQ VLDSGQAADI TSYVNDTTLP GFKDIAPSAL AAVTAGKTLY GLPTGIYTQG LIINRKLFKQ AGLDPDHPPT TWADVEKDAK AITALGNGIY GYGEYSAGNN GGWHFSSEVD ANGGQMIGSD GTKAAFNAQP ATEVLQALHQ MRFVDNSMSP TQQLKWGDLQ KQMGAGKLGM YVASPDDIYN VIVPQDGGNV DDYGIGPLPS TSGTPAGSLS GGDDVMFNKH DTPAQIRAGI KWIAFSMLTP GQGQFNYART KADGLPVGFP EPLDWIGDTS AKNETLKAAS ATVNTAYFKT FVDAHEKGMG EPADAQAVYK TLDATMLSVL NDPNANIPNL LKTAETQVNT LLANAG
|
| |