Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_8321 |
Symbol | |
ID | 8339700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 9645092 |
End bp | 9647023 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644961407 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003118985 |
Protein GI | 256397421 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.860873 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTCA GAAATACCCG AAAAATCGTG GCGATAGCGG GTGTGCTCGC TGTCCTCGCG TCCGCCGCCG CGTGTGGGAG CTCGAAGAAG AGCTCCAACG GCAACGGCGC GACGAACCCG GTCACGTCGG CCAGCAGCTC GTCTGCACCG GCCAAGCAGG GCGGCGTCGC ACACGTCGCG GAGTGGCCGG CCGGCTCGAG CCCGGACGCG ATCTGGCCGT TCATGAGCAG CGAGCAGCTG AGCACTCAGA ACGCGGGCCA GTTCCAGTAC TACTTCTACC GCCCGCTGTA TTTCGTCGGC CTCAACGACA AGCTCGCGGT CAACTACGAC ATGGGTCCGG CGGAGAAGCC GACCTGGAGC GCGGACGGCC TGACCATCAC GGTTCCGCTG AAGTCGACCT GGAACTGGAG CAACGGCGAG AAGGTCACCG GCCAGGACGT CCAGTTCTGG CTGAACATGA TGAAGGCCGA GGAGAAGAAC TCGGGCTACT ACAGCCCGCC GAACGCGGCC GCGGCGGTCA ACTACCTGCC GGACAACGTC AAGTCCACGA CGGTCAGCGA CTCCAGCATC AGCATCACGT TCGACCAGCA GTACAACCAG AACTACATCG TCGGCAACGC CCTGCAGACG GTCACGCCGA TGCCGCTGGC CTGGGACGTC ACCGACGGCA ACGGGACCAA GGGCAAGTGT TCGACGGACA CCCTGACCTC CCCGACCCTG CAGGCCGACT GCGACGCGGT CTGGAAGTAC ATGAACGCGG CCGGCAAGGA CGTCAAGACT TTCGCCAGCA ACCCGCTGTG GAAGATCGTC GACGGCCCGT GGGTTCTGAA GGACTTCAAC GCCACCTCCG GCGGGTTCTC CATCGTGCCG AACACCAAGT TCACCGGCGA GCACAAGCCG GTCCTGGACG AGGTCGACTT CGTGCCGTTC CAGAGCCAGG ACGCGGAGTG GACGGCCCTG AAGGCCGGCT CGACCGCCGC GAACTCGCTG CAGATCGGCG TGTTCCCGAA CGCCGACTCC CCGCAGTACA ACGGTGACAA CCTGCAGGCG GGCAACCCGC TGCTCTCGGC CGGGTACGAC GTCGAGAAGG GTCCGCTGCT GGACTCGATC GGCTACTACC AGGTGAACTT CGGCTCCAAG AACCACGGGA ACCTGTTCAA GCAGCCGTAC TTCACCAAGG CGCTGCAGGA CGACATGGAC CAGACCGGCG CCATCAAGGG CCCGTACAAG GGGTGGGGCT ACCCGACCAC CGGCATCGTG CCCGGCTACC CCGACGGCAA CGTTCTGTCC CCGGCCGCCA AGGCCGCCGC GGCGACCTTC AACCCGACCG AGGCCAAGTC GCTGATGCAG GCGCACGGCT GGGATCTGTC GACCACCCCG GCCACCTGCA AGACCCCCGG TACCGGTGAC AACCAGTGCG GCGCGGGCAT CAACGCCGGC GACAAGGCGG AGTTCACGCT GGAGTACCCC TCGGCGCACT CGGCCATGGA CACCATGCTG GCCTCCTACA AGCAGACCGC GGCCCAGTCC GGCATCGGGA TCACCCTGAC CACCAAGACC CAGAACACCC TGGGCGGCGA GCTGGTCGGC TGCGACCCCA GCACCCCGGC GGGCTGCCAG TGGGACGCGA TTCTTTACGG CGGCTGGGTG TTCTCGCTGA ACCCGACCGC GGACTCGCTG CTGACCACCG GCGCCGGCTC GAACATCTTC GGGTTCTCCG ACCCGAAGTT CGACGCCGCG GTGGCCAAGA CCATCAAGAG CAGCGACCCG CAGGCCTGGT ACGACTACGA GGCCTACGCC TCCAGCATCT CCCTGCCGCT GATCTTCATG AACAACGACA TCTGGCCGTT CGCCGTGGCG AAGAACTTCC ACGACTCCGG TCAGGACGCG TTCCAGGGCT TCGAGCCTGA GTTCTGGTAC TACACCCAGT GA
|
Protein sequence | MAVRNTRKIV AIAGVLAVLA SAAACGSSKK SSNGNGATNP VTSASSSSAP AKQGGVAHVA EWPAGSSPDA IWPFMSSEQL STQNAGQFQY YFYRPLYFVG LNDKLAVNYD MGPAEKPTWS ADGLTITVPL KSTWNWSNGE KVTGQDVQFW LNMMKAEEKN SGYYSPPNAA AAVNYLPDNV KSTTVSDSSI SITFDQQYNQ NYIVGNALQT VTPMPLAWDV TDGNGTKGKC STDTLTSPTL QADCDAVWKY MNAAGKDVKT FASNPLWKIV DGPWVLKDFN ATSGGFSIVP NTKFTGEHKP VLDEVDFVPF QSQDAEWTAL KAGSTAANSL QIGVFPNADS PQYNGDNLQA GNPLLSAGYD VEKGPLLDSI GYYQVNFGSK NHGNLFKQPY FTKALQDDMD QTGAIKGPYK GWGYPTTGIV PGYPDGNVLS PAAKAAAATF NPTEAKSLMQ AHGWDLSTTP ATCKTPGTGD NQCGAGINAG DKAEFTLEYP SAHSAMDTML ASYKQTAAQS GIGITLTTKT QNTLGGELVG CDPSTPAGCQ WDAILYGGWV FSLNPTADSL LTTGAGSNIF GFSDPKFDAA VAKTIKSSDP QAWYDYEAYA SSISLPLIFM NNDIWPFAVA KNFHDSGQDA FQGFEPEFWY YTQ
|
| |