Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_2622 |
Symbol | |
ID | 8333971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 3004944 |
End bp | 3006257 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644955774 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003113380 |
Protein GI | 256391816 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0574924 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.833644 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCA CTCCTTTGAA CGCCGCAGCC TCACGCCGCT CGTTCCTCCT CGGCGGCCTG TCCATAGTCG GCGCGGCCGC CCTGTCCGGG TGCAGCGTCA CCGGCACCTC GCAGAAGAAG GGCTCCGCGG GCTCCGGCTC CGGCACCATC AATGTCCTGT TCATGCAGCA GGCCGGCTAC AGCACCGACG ACGTCACGAA GATGACCGCC GCGTTCACCA AGCAGTACCC GGACATCAAG GTCAACCCCA CCTACGTCGC CTACGAGGCG CTGCACGACA AGATCGTCAC CGCCGCCGCG GCCGGCACCT ACGACGTGGT GCTCATCGAC GTCATCTGGC CGGCGGAGTT CGGCAAGAAG AACATCGTCG CCGACGTCAC CTCCCGCTAC CCGGCGGACT GGAAGGACAC GATGCTCGGC GGCGCGCTGC TGACCGCCGA CTACGACGGC AAGCAGTACG GCGTGCCGTG GGGGATGGAC ACCAAGTTCT TCTACTACAA CAAGGCCCTG CTGGCGAAGG CCGGCGTCGA CGCCTCCACG CTGGGCACTT GGAGCGGCGT GCTCCAGGCG GCCAAGGCGC TGAAGCAAGC CAAGGTCGTG GAGTACCCGC TGGCGTGGAG CTGGTCGCAG GCCGAGGCCA TCATGTGCGA CTACACGCAG CTGGTCGGCG CGTTCGGCGG GTCGTTCACC GACAGCGCCG GCAACCTCAC CCTGAACAAG GGCGGCGCGG TGGACGCGCT GGCCTGGATG CGCCAGAGCA TCGTCGACGG TCTGACCAAC CCCTCCTCCA CCACGTTCCT GGAAGCCGAC GTGGAGAAGA CGATGAACAA CGGCCAGGCG GCGTTCGGTC TGAACTGGAC CTACTACCTG GGCTCCTCCA ACGACCCGAA GAACTCCCAG GTCCCCGGGC AGATCGTGGT CGCCCAGACC CCGGCCGGCC CGAGCGGGAA GCGCCCGGGC GTCAACGGCG CGATGGCGCT GTCGGTGTCC ACGGGCAGCA AGAACCAGGA CGCCGCCTGG AAGTACATCT CCTGGATCGC CGGGGAGGAC CAGGTCGACC AGTTCGCCAA GGACGAGATG CCGATCTGGA AGAAGTCCTT CACCACCCCC TCGGTGGTCT CCTCGGCGCC GGACATGTTC GCCGTCGCCG CCAAGCAGCT CGACGACCTG GTCGTGCGCC CGCAGTTCGT GAACTACAAC GCGGTCTCCC AGGTCATCCA GGTCGAGCTG CAGAACGCGC TGCTGGGCAA GAAGCCCGCG CAGCAGGCGC TGGACGACGC GGTGAAGGCC GCGCAGCCGC TGATGGGGGG CTGA
|
Protein sequence | MNATPLNAAA SRRSFLLGGL SIVGAAALSG CSVTGTSQKK GSAGSGSGTI NVLFMQQAGY STDDVTKMTA AFTKQYPDIK VNPTYVAYEA LHDKIVTAAA AGTYDVVLID VIWPAEFGKK NIVADVTSRY PADWKDTMLG GALLTADYDG KQYGVPWGMD TKFFYYNKAL LAKAGVDAST LGTWSGVLQA AKALKQAKVV EYPLAWSWSQ AEAIMCDYTQ LVGAFGGSFT DSAGNLTLNK GGAVDALAWM RQSIVDGLTN PSSTTFLEAD VEKTMNNGQA AFGLNWTYYL GSSNDPKNSQ VPGQIVVAQT PAGPSGKRPG VNGAMALSVS TGSKNQDAAW KYISWIAGED QVDQFAKDEM PIWKKSFTTP SVVSSAPDMF AVAAKQLDDL VVRPQFVNYN AVSQVIQVEL QNALLGKKPA QQALDDAVKA AQPLMGG
|
| |