Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3719 |
Symbol | |
ID | 8335072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4185855 |
End bp | 4187180 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644956859 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003114462 |
Protein GI | 256392898 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.707833 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0540576 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAAGC GAACAACGGT CCTGGCCGCC GTGATCACGG CCTGCGCCCT GGCTCTGGCC GCCTGCTCCG GCGGCGCCCA CGGCACCGGC GGCAGCGCCG CCGCCACCGC CACCGACCCG GCCGGCGTGA GCGGCGACAT CACGGTCCTG ACTCACAAGA CCGACCTCGC CGCCGACGGC ACCCTGGCCC GCTACGCCGC GGAGTTCAAC AAGATCTATC CGAACGTGCA CGTCAAGTTC GAACCCGTCG TCGACTACGA AGGCGACGTG AAGATCCGCC TCAACAGCAG CGACTACGGC GATGTGCTGA TGATCCCGGC CTCGGTCCCG GTCGCGGACT ACCCCAAGTT CTTCGCCCCG CTCGGCACCC CCGCCGACCT GGACCAGAAG TACCGCTTCA TCGACCACGG CACGTACAGC GGCCAGGTGT ACGGCATAGC CATCAACGGC AACGCCACCG GCATGGTCTA CAACAAGACG GTGTGGCAGC AGGCCGGCGT CACCAGCTGG CCCACCACGC CCGACCAGTT CCTCGCCGAC CTGCAGGCGA TCAAGACCAA GACCCAGGCC ACCCCGCTCT ACACGGTCTA CCACGAGGGC TGGCCGATGA CCGCCTGGCA GTCCTACCTC GGCGAGCTCA GCTGCGATCC CAAGGCCTCC GACGACCTCG CCACCGACGC CGCGCCCTGG GGTCCGGGCA AGGAGCTCAA CCAGATCGAC ACGATGCTCT ACAACGTCGT CCACAATCAG CTCACTGAGA AGGACCCGAC GACGACGGCC TGGGACGCCG CCAAGAGCGG CATGGGCTCG GGCAAGATCG GCACGCTGGC GCTGGCCTCC TGGGCGGTCT CCCAGATGCA GCTGGCCGCC AAGACCGCCG GCGCCGACCC CGCCAGCATC GGCTTCATGC CCTACCCGAC GCAGGTCGGC GGACACTTCT GCTCCGTGGT CTCGCCGGAC TACATGGAGG CCGTGAGCAT CCACTCCCAG CACAAGGCCG CGGCGCGCGC CTGGGTCGAC TGGTTTGTCG ACAAGTGCAC CTACGCTCAG GACCAGGGTC TGCTCCCGAC GTTGAAGACC GGGGCGATGC CGCCGGAGCT GGCCGCGTAC CAGAGTGCCG GCGTGCAGTT CATCGAGCTC GCGCAGAACG CCAACACGCA GATCTCCACC ATCGACAACG ATTCCGAGAT CGGTCTGCAG AAGCCGGACT ACCGGCAGCA CATCGTGGAC TTGGCGCGCG GCGCCGCCGG CGGGAGTCTG GACGGTTATT TCGCCGACCT GGACAAGAAA TGGGCGGCAG CCGTGAAGAC CGCCGCCGGG TCCTGA
|
Protein sequence | MRKRTTVLAA VITACALALA ACSGGAHGTG GSAAATATDP AGVSGDITVL THKTDLAADG TLARYAAEFN KIYPNVHVKF EPVVDYEGDV KIRLNSSDYG DVLMIPASVP VADYPKFFAP LGTPADLDQK YRFIDHGTYS GQVYGIAING NATGMVYNKT VWQQAGVTSW PTTPDQFLAD LQAIKTKTQA TPLYTVYHEG WPMTAWQSYL GELSCDPKAS DDLATDAAPW GPGKELNQID TMLYNVVHNQ LTEKDPTTTA WDAAKSGMGS GKIGTLALAS WAVSQMQLAA KTAGADPASI GFMPYPTQVG GHFCSVVSPD YMEAVSIHSQ HKAAARAWVD WFVDKCTYAQ DQGLLPTLKT GAMPPELAAY QSAGVQFIEL AQNANTQIST IDNDSEIGLQ KPDYRQHIVD LARGAAGGSL DGYFADLDKK WAAAVKTAAG S
|
| |