Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_2254 |
Symbol | |
ID | 8333603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 2557924 |
End bp | 2559222 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644955407 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003113013 |
Protein GI | 256391449 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.534892 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00235651 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCAGGA CCACAGCGAT ATCCGCCATA GCCGTCGCCG CTACGCTGCT GGCCGGCTGC GGCGTCGGAA GCAAGTCCTC GTCGTCGGAC GCGAACAAGA CCGTCTCCAC CACGGCGGCG CTGTCGGGCA CCATCACCTT CCAGACCTGG TCGCTGAAGA ACGACAAGTT CACCCCGTAC TTCACGAAGC TGATCGCCGA CTTCAAGACG GCGCACCCGG GCACCGAGGT CAACTGGATC GACCAGCCCG GCGACGGCTA CCCGACCAAG GTCACCAGCC AGGTCTCCAC CGGCAGCCTG CCGGACGTGA TCAACCTGCC GCCGGACATC GCGCACGCGG TGGCCAAGAC CGGCAACCTG CTGGACCTGA AGCAGAACGT GCCGACGCTG CAGACGGACT ACGTCAAGAG CGGTCTGAGC GCCTACAACT ACCAGGACAT CTCCGGCGGC TCCCAGTTCT TCGGCTTCCC CTGGTACCTC GGTACCGACG TGAGCTACTG GAACAAGACG ATGATGGCCA AGGACGGCCT GGACGCGGCC AACCCGCCGA AGACGTTCGA CGACCTGGTC GCCCAGGCGA AGATCATGCA CGACAAGTCC GGCGGCAAGG ACTACCTGAT GTCCCGCGCC CCGGGGCTGT CGGACATCGC GAACACCGGC ACCAAGCTGC TGTCCGCCGA CGGCACGAAG TTCGCCTTCA ACACCTCGCA GGCCGAGGCG ATGCTGGACA AGTACACCGC CGCCTACAAG GCCGGGTACC TCCCCTCGAA CGTGCTGGAC AACAGCTACG AGGGCAACTC CACGCTGTTC AGCAAGCAGC AGGTCGCCTG GACCACCGGC GGCGGCAACC TGATCACCAG CCTGCAGCAG GACAACCCGA CGCTGGCGCC GAACGTGGTC CCCTCCCCGG CGCTGGACAC CGCGCCGCTG TACGTGCAGG GCCTGTCGGT CTCCAGCAAG AGCAAGAACC TCCCGCTGGC GGTGGCCTTC GCCGAGTTCG TGACGGACAA CGAGAACCAG GCCGGCTTCG TCAAGCTGGC GCCGGGCTTC CTGCCGGGCT CCGCGGCCTC CGCGAACGAC CCGCAGTACA GCAAGAGCGA CGGCACCACG CAGGGCGACG CCTCGGTGAT CGCCTACAAG GACATGCAGA CCGCGGTGAA CTTCACCCCG CCGATCTGGA CCGACGCGAT GAACACGCTC CTGAACCAGG AGATCGCCAA GGCGATGACC GGCAAGGAGA CCTCCAAGCA GGCCTTGGAC AACACGGTGA ACCAGGCGAA CGCACTGCTC TCGCAGTGA
|
Protein sequence | MRRTTAISAI AVAATLLAGC GVGSKSSSSD ANKTVSTTAA LSGTITFQTW SLKNDKFTPY FTKLIADFKT AHPGTEVNWI DQPGDGYPTK VTSQVSTGSL PDVINLPPDI AHAVAKTGNL LDLKQNVPTL QTDYVKSGLS AYNYQDISGG SQFFGFPWYL GTDVSYWNKT MMAKDGLDAA NPPKTFDDLV AQAKIMHDKS GGKDYLMSRA PGLSDIANTG TKLLSADGTK FAFNTSQAEA MLDKYTAAYK AGYLPSNVLD NSYEGNSTLF SKQQVAWTTG GGNLITSLQQ DNPTLAPNVV PSPALDTAPL YVQGLSVSSK SKNLPLAVAF AEFVTDNENQ AGFVKLAPGF LPGSAASAND PQYSKSDGTT QGDASVIAYK DMQTAVNFTP PIWTDAMNTL LNQEIAKAMT GKETSKQALD NTVNQANALL SQ
|
| |