Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4688 |
Symbol | |
ID | 8336042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5342253 |
End bp | 5343509 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644957788 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003115390 |
Protein GI | 256393826 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.733472 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCATAC GCACAGTGGT GGCGGCGTCG GCGGTTCTCG CGCTGGCCAC TTCGACGGCC GCTTGTTCGA GCAGCGCGAG TTCCTCGGCG AGCGCCGGCA AGGTTTCCCT CAGCTACGGG GTCTGGGACG CGACGCAGGT CCCGGCCATG CAGAAGATCA TCGCAGCCTT CGAGGCACAG AACCCGACCA TCACGGTCAC CATCCAGCAG ACGCCGTGGG CGGACTACTG GACCAAGCTC CAGGCGGCCG CCTCCGGCGG TTCGGCGCCC GACGTCTTCT GGATGAACGG CCCGAACTTC CAGCTCTACG CCGCCAACCA CGTCCTGCGG CCGCTGACCG ACCTGCACCC GGACACCTCG GTCTACCCCC CGGCGCTGGC GCAGCTCTAC CAGTACAAGG GCGTGCAGTA CGGGCTGCCG AAGGACTTCG ACACCGTGGG GCTCTGGTAC AACAAGGCCA TCTTCGACGC CGCGGGCGTC GCCTACCCCA CCACCGCCTG GACCTGGGCT GATTTCCAAG CGGCGGCGAA AAAACTCACC GACCCCGCCA AGGGCGTCTA CGGTGTCGGC GCCAACCTGG AAGGCCAGGA GAACTACTAC GACACGATCT ACCAGGCCGG CGGCTACGTC ATCTCCCCCG ACGGCAAGAA GTCCGGATAC GCCGATCCGG CCGGTATCGC CGGGCTGAAG TTCTGGACCG ATCTGGTCGC GGCCAAGGAG TCGCCGAGCC TGAAGCAGAT GACGGACACC GCGCCGCTGA ACCTGTTCGA GTCCGGCAAG CTCGCCATGT ACTGGGGCGG GTCGTGGGAC GCGAAGGCGT TCGCCGCGAA CGACTCCACC AAGACCGCCG TCGACGTGAC CGCGCTGCCA GCCGGGGTGA AGAAGGCGAC GGTCATCCAC GGCCTGGCCA ACGTCGTCTT CACGCACACC TCGCACCCGG CGCAGGCGGA GAAGTTCGCC GCGTTCCTCG GCTCGCAGGC GGCGGCGCAG ATCGAGGCGG ACACCGGGAC CGTGATCCCG GCGTACAACG GCACACAGCA GAGCTGGGTC AAGGCATACC CGCAGTACCA CCTCCAGTCC TTCTTGGATC AGCTTCCTGA CGCGGTCCCG TACCCGATCT CCAAGGACAC CGCGGCCTGG AACACCCTGG AGACGAACGT CCTGACCAAG GCCTGGGACG GCAGCGAACC GATCGACAAG GCCGCCGGCG ACCTCGCCAC GCAGATGAAC GCGGCGCTGG CCAAGGAGGG TCCGTGA
|
Protein sequence | MGIRTVVAAS AVLALATSTA ACSSSASSSA SAGKVSLSYG VWDATQVPAM QKIIAAFEAQ NPTITVTIQQ TPWADYWTKL QAAASGGSAP DVFWMNGPNF QLYAANHVLR PLTDLHPDTS VYPPALAQLY QYKGVQYGLP KDFDTVGLWY NKAIFDAAGV AYPTTAWTWA DFQAAAKKLT DPAKGVYGVG ANLEGQENYY DTIYQAGGYV ISPDGKKSGY ADPAGIAGLK FWTDLVAAKE SPSLKQMTDT APLNLFESGK LAMYWGGSWD AKAFAANDST KTAVDVTALP AGVKKATVIH GLANVVFTHT SHPAQAEKFA AFLGSQAAAQ IEADTGTVIP AYNGTQQSWV KAYPQYHLQS FLDQLPDAVP YPISKDTAAW NTLETNVLTK AWDGSEPIDK AAGDLATQMN AALAKEGP
|
| |