Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5949 |
Symbol | |
ID | 8337311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 6868172 |
End bp | 6869332 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644959053 |
Product | solute-binding protein |
Protein accession | YP_003116648 |
Protein GI | 256395084 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4213] ABC-type xylose transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAAGG CGATACTCGC AATCACCGCA CTCGGCGCCG CGGTTGCGTT GAGCGCAGCT GGATGCAGTA GCTCTAAGAG CAGCAGCAGC GGTTCCACCA CGGGCGGTGG CAGCTCGACG ACCAGCGCCG CTGCGGGCTC CAGCAGCAGC AGTAGCAGCA GTGGCACGCC CACGTACAAG AACAACAAGG TCGGCATCCT GCTGCCGGAC ACCAACTCCT CGCCGCGGTG GGTCAACTCC GACCCCGACG AGCTGAAGAC GCAGTGCGCG CAGTACGGCC TGACCTGTGA CATCCAGAAC TCCAACGGTT CTGCCACGAC GATGACCTCG CAGGCGCAGT CGATGCTGAA CGAGGGCGTC GGCGTGCTGA TGCTCACCAA CCTGGACTCC GGCTCGGCCA AGGCGATCGA GGCGCAGGCG CAGGCCAAGG GCGTCGTCAC CATCGACTAC GACCGGCTCA CGCTCGGCGG CACCGCGCAG TACTACGTCT CCTTCGACAA CGTGGCCGTC GGCAAGGCGC AGGGCACCGC CCTGACCAAG TGCACCCAGG TCGCCGGCAA GACCGCGGTG AAGTACGTCG AGGAGGACGG CGCCGCGACC GACAACAACG CGACGCTGTT CAAGCAGGGC TACGACAGCG TGCTGAAGGC GCAGACCGGC TGGACCCAGG CCGGCGACCA GTCCGGCAAC TGGGACAACC CGGCCGGCAC CGCGCAGTCG GTGTTCCAGA AGCTGCTGCA GGGCGCTCCG GACCTGAACG CGGTCATGGT CGCCAACGAC GAGATGGCCA ACGCGGCCAT CACCGTCCTG AAGCAGCAGG GCCTCAACGG CAAGGTGGCT GTCTCCGGCC AGGACGCGAC CGCGACCGGT CTGCAGAACA TCCTCAACGG CGACCAGTGC TTCACGATCT ACAAGCCGGT CAAGGGCGAG GCCGACGTGG CCGTCAAGCT GGCCAGCCAG GTCCTGTCCG GCCAGAAGCC GACCGCGCCG GCCGTGGTCC ACGACCCGAC CGGCAACCGT GATGTCCCGT CCTACCTGGC GACCCCGGTC GTGGTGGACA AGTCCAACAT CACCCTGCCG TTCACCGACG GCTACCAGAA GGCCGCCGAC GTCTGCACCG GCGACTTCGC CGCCAAGTGC ACGGCGGCCG GCATCAAGTA G
|
Protein sequence | MRKAILAITA LGAAVALSAA GCSSSKSSSS GSTTGGGSST TSAAAGSSSS SSSSGTPTYK NNKVGILLPD TNSSPRWVNS DPDELKTQCA QYGLTCDIQN SNGSATTMTS QAQSMLNEGV GVLMLTNLDS GSAKAIEAQA QAKGVVTIDY DRLTLGGTAQ YYVSFDNVAV GKAQGTALTK CTQVAGKTAV KYVEEDGAAT DNNATLFKQG YDSVLKAQTG WTQAGDQSGN WDNPAGTAQS VFQKLLQGAP DLNAVMVAND EMANAAITVL KQQGLNGKVA VSGQDATATG LQNILNGDQC FTIYKPVKGE ADVAVKLASQ VLSGQKPTAP AVVHDPTGNR DVPSYLATPV VVDKSNITLP FTDGYQKAAD VCTGDFAAKC TAAGIK
|
| |