Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4518 |
Symbol | |
ID | 8335872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5144950 |
End bp | 5146797 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644957620 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003115222 |
Protein GI | 256393658 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.872953 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGAC CCAAACCACT GGTCGCGGCC CTAGCCGTCG CGACGATAGC GGCCACCGCG CTGTCGGCGT GCTCCAGCTC CTCGGCGAAG AAAACCAACG GTGGCACCGG TGGCGGTACC GGCGGGGTCT TCACCAGCAT CGACGCCAAC AACAAGATCA CCGCCGGCGC GCCGATGAAC CCGTACAACG CCGCGCCCAA CATGTTCCTG GGCTACAACA TCATGGAGCT GGGCTTCACC AAGAACGACC CCGCGGACCC CAACGCCCTG CTCCCGGGTC TGGCCGCCAG CTGGACCGCC TCGGACACCG GGCTCACCAT CCAGCTGCAG CCCGGCGCCA AGTGGTCCGA CGGCACCCCG GTCACCGCCG CGGACATCAA GACCTCGCTG GCCATCGCCT ACACGCAGGG CACGGCAGGT CCCGTGGCCG GCGCCGGCGG CACCGTCGTG GCCGGCAGCA ACTTCGAGGT CTCCGACGTC AAGGACCTCG GCGGCGGCAA GATCGAGATC GACCAGCAGC CCGGCGTGAA GAACCTGTAC TTCCAGCGCC TGGTGCTCAC CTCGACCATC GTCAACGACA AGGTCTACGG CAGCCAGCTC CCAGCGGACA TCTGGACCCA GATCGCCGCC GTGCAGGGCA CCGACGCCGC CGCGGCGTCC GCGGCGTCCA CCAAGCTGGC CGCCGAGGGC AAGACGATCG CCGCCTTCGC CCCGGCCAAG GACATCTCGG CCGGCCCGTT CGTGGAGACC CGGGTCAACC CCGGCGAGGC GCTGCTGGAC CGCAACCCCT ACTTCTACGC CGCGAGCAAG ATCTCGCCGA AGCAGGTCAT CCTGCGCAGC TACTCCGGCA ACCAGCAGAT CTGGGGCTAC ATGAACGGCG GCGAGCTGGA CTACGCCCCG TACACCTCGA TGCCCACCAA CATCCTGAAC CAGGTCCTCA AGGCCGGCTA CACCCGCATC GACGCCCCCA GCTACGTCAG CGCCTCGATC GCGTTCAACG AGAAGCAGGC GCCGTACAAC CTGACCCCGG TGCGCCAGGC GCTGGCCTAC GTCATCGACC GCGACGCCGT CACCAAGGTC GGCGAGCCGG TCGGCGGCAT CGCCGCCCCG ACCACCACCG GCCTGGTCGG CTCGCAGTCC GACACGATCT TGTCCGCCGA CCAGAAGGCG GCGCTGAACC CCTACAAGCC GGACCCGGCC AAGGCCGCGT CCCTGCTGCA GGGCGCAGGC TTCACCAAGG ACGCCTCCGG CCAGTGGCAC CTGCCCGACG GCACGCCGTG GAAGATCACG CTGCAGACCG TGAACGGCTT CTCCGACTGG ATCGCGGCCT CCACGATCGT GGCCAACGAG CTGACCCAGT TCGGCATCCC GACCACCGCG GCGATCACCG CCGACTTCGC CACGTACCAG AAGGAGATGG GCGCCGGTAA GTACGCGGTC GGCTGGTGGC TGGTCGCCCT GGGCCCGCAG ACGGACAAGG CCTACGCCCG CATCTACGGC TCCGCCGACG GCTTCAGTGT CGCCAACGGC CAGGCCACGC ACAACGACAG CGCGGCCGGC AACTGGGAGC ACACCCCGGC GACCTACACC GTCAACGGCC AGAGCATCAA CCCCGGCCAG CTCGCCGCGC AGCTGTCGGT GACCCCGGTC TCCGCCCAAG GGCCGATCAT CGCCCAGCTG GCAGCGGCCA CCAACCAAGA AGTGCCGATG ATCCAGATCT GGAACTACAC CCACGTGATG TTCACGCTGG ACAAGCGGTT CACGAACTAC CCGAAGACCG GGCAGGACGA TCTGCTGGCC AACCCGCCCG GCGTGTGGAT GATGCAGGGG TACGTGCAGG GCAAGTAG
|
Protein sequence | MSRPKPLVAA LAVATIAATA LSACSSSSAK KTNGGTGGGT GGVFTSIDAN NKITAGAPMN PYNAAPNMFL GYNIMELGFT KNDPADPNAL LPGLAASWTA SDTGLTIQLQ PGAKWSDGTP VTAADIKTSL AIAYTQGTAG PVAGAGGTVV AGSNFEVSDV KDLGGGKIEI DQQPGVKNLY FQRLVLTSTI VNDKVYGSQL PADIWTQIAA VQGTDAAAAS AASTKLAAEG KTIAAFAPAK DISAGPFVET RVNPGEALLD RNPYFYAASK ISPKQVILRS YSGNQQIWGY MNGGELDYAP YTSMPTNILN QVLKAGYTRI DAPSYVSASI AFNEKQAPYN LTPVRQALAY VIDRDAVTKV GEPVGGIAAP TTTGLVGSQS DTILSADQKA ALNPYKPDPA KAASLLQGAG FTKDASGQWH LPDGTPWKIT LQTVNGFSDW IAASTIVANE LTQFGIPTTA AITADFATYQ KEMGAGKYAV GWWLVALGPQ TDKAYARIYG SADGFSVANG QATHNDSAAG NWEHTPATYT VNGQSINPGQ LAAQLSVTPV SAQGPIIAQL AAATNQEVPM IQIWNYTHVM FTLDKRFTNY PKTGQDDLLA NPPGVWMMQG YVQGK
|
| |