Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_7148 |
Symbol | |
ID | 8338516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 8314242 |
End bp | 8316002 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644960229 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003117818 |
Protein GI | 256396254 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.118228 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCAT CCAGGCGCCG GTCACGGTCG GCCATCGCCG TCACCGTGCT GGCAACACTG ACGATCACCG CCGGGTGCTC GTCGTCGTCC GCCAAGAAAT CCGGCTCCGG TCAGCAGGAG GCCGCGCAGA ACGTGGCCAA GCAAGCCGTC GCGGTCGGCA CGGCTGCCGA CTCCCGGGGA CCGGATCCGG CAGTGTCCGG CGCCAAGTCC GGCGGGACTG TCACAGTGCT CGAACACTCG GACTTCAGCC ACCTGGACCC GGCCCGCGTC TGGTCCTCCA CCAACCAGAC CGCCGATCTG CTGCTGACTC GCCAGCTCAC CAGCTACCAG CAGGTCGGCA ACACCACCAA GCTGGTCGGA GACCTGGCCA CCGACACCGG GAGCAGCACC GACGGCAAGA CCTGGACCTA CCACCTCAAG GCCGGCCTGA AGTACGAGGA CGGCAGCACG ATCACCGCTC AGGACGTGAA GTACGGCATC GAGCGCACCT TCCAGAAGGA GCTGTCCGGC GGTCCGCAGT ACCTCCAGAT GTGGCTGACC GGCAAGACCG ACTACTCCTC CACCTACTCA GGGCCCTGGG GCGGCCAGGA CCTGCCGCAG ATCCAGACGC CCGACGCCAC GACGATCGTC TTCCACCTCG CGTCGGTGCA CGCCGACTTC CCGTTCGCGT TGGCGATGCA GGCTTACAGC CCGCTTCCCA AGGACAAGGA CGCCAAGTCC GCCCTGGACC AGCACCCCTT CTCCTCCGGG CCCTACAAGG TCGACAGCCA CGACATCGAC AAGGGCATGG TCCTGTCCCG CAACACCCAT TGGGACGCGA ACACCGACCC GGTCCGCCAC GCCTACCCCG ACCAGTGGAA GTTCGAGTTC GGGGCGCAGG ACGTCGACAT CAACCAGCGC CTGGCGGCCG CCAACGGCGC CGACAAGGAC GCGATGACCT TCAAGGTCAC CATCGGCTCG GACCTGGCAT CGCAGGTGAA CTCGAGCCCT GATCTGAAGG CTCGTCTGGT CAACCAGGTC ACGCCCTTCT CAGAGTTCTA CAACATCAAC ACCCGGCGCG TGACGGACGT CAAGGTCCGC GAAGCCCTGC TGGAGGCGTT CCCGCGCGCC CAGACGCGGC AGCTGCTCGG GGGACCGATC TACGGCGACT TCACCACCAC GATCCTGTCG CCGGTGACCA ACGGCTACCA GGACTACGAC CTGTACGGCG CCCCCGACAC CGGCGACCCA GCCAAGGCCA AGGCGCTGCT GGCGACGACA TCGACGCCGC ACCCGACCAT CGTGTACGCC TACCAGGACG ACACCGCCTG GCAGCAGGGC GCCGTCGCCA TCCAGCAGGC GCTGACCAAG GCCGGCTTCA CCGTCGTCAC CAAGGCGATC AGCGACAAGA ACTACTACGA CGAGACGCAG AAGACTGACA ACCAGTTTGA CGTCTACTGG GGCGGCTGGG GTCCGGACTG GCCCAGCGCC TCCTCGGTCA TCCCGCCGTT GTTCGACGGC CGGCAGATCA CCGACGGCGG CAGCGACAAC TCGCTGCTGA ACGACCCGAC GGTGAACGCC GAGATCGACC GGATCCAGTC CATGACCGAC CTGAGCCAGC AGAACACCGC CTGGGCCGCG CTGGACAAGA AGATCATGCA GGAGGTCCCC ATCATCCCCT GGGTCGATCC CCGGCAGGTC TCGCTCTATG GGCCCGGACT CGGTGGCGTC CACACCGGCT TCATCGGCAC CTGCTATCCG CTCGACGTCT ACGTCAAGTA G
|
Protein sequence | MSASRRRSRS AIAVTVLATL TITAGCSSSS AKKSGSGQQE AAQNVAKQAV AVGTAADSRG PDPAVSGAKS GGTVTVLEHS DFSHLDPARV WSSTNQTADL LLTRQLTSYQ QVGNTTKLVG DLATDTGSST DGKTWTYHLK AGLKYEDGST ITAQDVKYGI ERTFQKELSG GPQYLQMWLT GKTDYSSTYS GPWGGQDLPQ IQTPDATTIV FHLASVHADF PFALAMQAYS PLPKDKDAKS ALDQHPFSSG PYKVDSHDID KGMVLSRNTH WDANTDPVRH AYPDQWKFEF GAQDVDINQR LAAANGADKD AMTFKVTIGS DLASQVNSSP DLKARLVNQV TPFSEFYNIN TRRVTDVKVR EALLEAFPRA QTRQLLGGPI YGDFTTTILS PVTNGYQDYD LYGAPDTGDP AKAKALLATT STPHPTIVYA YQDDTAWQQG AVAIQQALTK AGFTVVTKAI SDKNYYDETQ KTDNQFDVYW GGWGPDWPSA SSVIPPLFDG RQITDGGSDN SLLNDPTVNA EIDRIQSMTD LSQQNTAWAA LDKKIMQEVP IIPWVDPRQV SLYGPGLGGV HTGFIGTCYP LDVYVK
|
| |