Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3808 |
Symbol | |
ID | 8335161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4307496 |
End bp | 4308779 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644956947 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003114550 |
Protein GI | 256392986 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.750227 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAGAA GGAAGTTCCT CACGTTCTGC GCCACCGCGG CAGCCGCCGC ACCCCTGGCA GCGTGCGGCT CGAGCAAGAG TTCCCCCGCC GGCGCGGCGG CCGCCGACAG CTCCCCGGTC GAGCTGACGG TCAGCGTCTG GAGCCTGGCC TCGACCCCTG AGTTCCACGC CTTGTTCGAC GCCTTCCACC AGGCGAACCC GAACATCACC ATCAAGCCCG TCGACATCCT CGCCGCCGAC TACCCGACCA AGGTCACCAC CATGCTCGCC GGCGGCGACA GCACCGACGT CATCACGATG AAGCAGGTCA GCGACTACTC CCTCTACGCC GCGCGCGGGC AGCTCAAGGA CCTGACCTCG ATCGCCACCT CGGGTGCGGC CGCGACGCTC AACGGCCTCG ACAGCTACAA GACGAAGGAC GGCAAGTACT ACGCCCTGCC CTATCGGCAG GATTTCTGGG TCCTGTTCTA CAACAAGAAG CTCTTCACCG CGGCGGGCAA GCAGGCTCCG GACAACCTCA CCTGGGACCA GTACAGCGAC CTGGCCAAGT CCGTCACCAC CGGCTCCGGC GGCGACAAGG TCTACGGCAC GTACCACCAC ATCTGGCGCT CGGTGGTCCA GGCGATATCG GCCGCGCAGA CCGGTGGGAA CCTGCTCGGC GACGACTACA GCTTCTTCAC CGACCAGTAC AAGATGGCGC TCGGGATCCA GGACGCGGGC GCGACGCTCG ACTTCGCCAC CGCGACCACG CAGAAGACCG GGTACGCGAC GGTGTTCGAG ACCTCGAAGG CCGCGATGCT GCCGATGGGC ACGTGGTTCA TCGCCAAGCT GCTCGCGGAC AAGAAGTCCG GCGACACCGA CGTCGACTGG GCCATCGCGC CGATGCCGCA GCGGCCGGGC GGGAGCACGG TCACCACGTT CGGTTCGCCG ACGGCGTTCG CCGTCAACAA GAAGGCCAAG CACGCGGCCG CGGCCGAGAA GTTCGTCCAG TTCGCCGCGG GTCCCGAGGG GGCCAAGGCG ATCACCGCGA TCGGCGTCGT GCCCTCGCTG CTGTCCGACC AGACTCGGCA GGACTACTTC GCGCTGACCG GGATGCCGAC GGACGACGTC TCGAAGAAGG CGTTCAAGCC GGACAAGGTC GTGCTGGAAA TGCCGGCCAG CGACAAGTCC TCGAAGATCG ACGCGATCCT CACCGAGGAG CACCAGCTGG TCATGACCAA GCAGAAGTCC ATCGACGCCG GGATCAAGGA GATGGGCTCC CGCGTCAAGA ACGAAGTCAG CTAG
|
Protein sequence | MDRRKFLTFC ATAAAAAPLA ACGSSKSSPA GAAAADSSPV ELTVSVWSLA STPEFHALFD AFHQANPNIT IKPVDILAAD YPTKVTTMLA GGDSTDVITM KQVSDYSLYA ARGQLKDLTS IATSGAAATL NGLDSYKTKD GKYYALPYRQ DFWVLFYNKK LFTAAGKQAP DNLTWDQYSD LAKSVTTGSG GDKVYGTYHH IWRSVVQAIS AAQTGGNLLG DDYSFFTDQY KMALGIQDAG ATLDFATATT QKTGYATVFE TSKAAMLPMG TWFIAKLLAD KKSGDTDVDW AIAPMPQRPG GSTVTTFGSP TAFAVNKKAK HAAAAEKFVQ FAAGPEGAKA ITAIGVVPSL LSDQTRQDYF ALTGMPTDDV SKKAFKPDKV VLEMPASDKS SKIDAILTEE HQLVMTKQKS IDAGIKEMGS RVKNEVS
|
| |