Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6971 |
Symbol | |
ID | 8338337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 8061540 |
End bp | 8062835 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644960051 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003117642 |
Protein GI | 256396078 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.487837 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCTCCA CTAAGTACGC GGTCTGCGTG GCATTGGCCG CGGCCGTGTC GCTGGTTTTG GCCGGCTGCG GGAGTTCCAG CTCCAAAGCC GCCTCGGGCG GCGGCGGCAA GACGCTCGTG GTCTGGGACT ACGAAGCCAA CGACAGTGCC TCCGGCATCG CACGCGCCGA GGCGATCAAG GAGTTCCAGG CCGCCCATCC GGGGGTGACG GTCAAGTTCG AGGCGAAGAG CTTCGACCAG ATCCAGCAGA ACGCCGGGAT GATCCTCAAC TCCAACGACG TGCCCGACGT GATGGAGTAC AACAAGGGCA ACTCCACCGC CGGCCTGTTG TCCAAGCAGG GTCTGCTGAC CGATCTGAGC AGCCAGGCCG CCTCGCGCGG CTGGGACAAG ACGTTGAGTC CGTCGCTGCA GACCACCGCC AAGTACACCG GCGGCATCAT GGGCGGCAGC ACTTGGTACG GCGTGCCGAT GAACGGCGAA TTCCTCACCG TCTACTACAA CAAGGACCTG TTCGCGAAGT ACAACGTCCC GGTCCCCACC ACGCCCGATC AGTTCACGGC GGCGATGGCG ACGTTCAAGG GCGCCGGGGT CACCCCGCTG GCCATGAGCG GCGCGGACTA CCTCGGGGTG CACCTGTTCT ACGAACTGGC GCTGTCCAAG GCCGACCGCA CCTGGGTCAA CGACTACCAG CTGTTCAAGG GCAAGGTCGA CTTCCAGGGC CCGCAGATGT CCTACGCCGC GAACACCTTC GCCGACTGGG TGAAGAAGGG CTACATCAGC AAGGACTCCG CCGCGGTCAA GGCCCAGGAC GAGGCCAACG CCTTCGAGCA GGGCAAGATC CCGATGATGT TCTCCGGGAA CTGGTGGTAC GGCCAATTCC TGAGCGAGGT CAAGGGCATG CAGTGGGGCA CGTTCCTGTT CCCCGGCAAC ACGCTGCAGG TCGGCTCCAG CGGCAACCTG TGGGTGGTGC CCACCAAGGC CAAGAACAAG GACCTGGCCT ACGACTTCAT CGACACCACG CTCAGCAAGA ACGTCCAGAA CCTGATGGGC AACAGCGGCG GGGTGCCGGT GGCCGCGGAT CCGGCGGCGA TCACGAACCC CAGCAGCAAG GAACTGATCA CCGAGTTCGA CTCGATCACC GCCAAGGACG GCCTGGGGTT CTACCCGGAC TGGCCGGTCA CCGGCTACTA CGACACGCTC CAGCACGCGA TCCAGGAACT GATCAACGGA TCCAAGAGCC CGAGTTCGAT GCTCGACACC ATCGGCTCCG CCTACAAGCA GAACGCACCG CAGTAG
|
Protein sequence | MRSTKYAVCV ALAAAVSLVL AGCGSSSSKA ASGGGGKTLV VWDYEANDSA SGIARAEAIK EFQAAHPGVT VKFEAKSFDQ IQQNAGMILN SNDVPDVMEY NKGNSTAGLL SKQGLLTDLS SQAASRGWDK TLSPSLQTTA KYTGGIMGGS TWYGVPMNGE FLTVYYNKDL FAKYNVPVPT TPDQFTAAMA TFKGAGVTPL AMSGADYLGV HLFYELALSK ADRTWVNDYQ LFKGKVDFQG PQMSYAANTF ADWVKKGYIS KDSAAVKAQD EANAFEQGKI PMMFSGNWWY GQFLSEVKGM QWGTFLFPGN TLQVGSSGNL WVVPTKAKNK DLAYDFIDTT LSKNVQNLMG NSGGVPVAAD PAAITNPSSK ELITEFDSIT AKDGLGFYPD WPVTGYYDTL QHAIQELING SKSPSSMLDT IGSAYKQNAP Q
|
| |