Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_1178 |
Symbol | |
ID | 8332513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 1326870 |
End bp | 1328600 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644954325 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003111944 |
Protein GI | 256390380 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.412652 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGCA ATACGTCGTC ACTGCTCGCG GTCGCCATCT CCATAGGCCT GGCGGCCACC GCGTGCTCCA GCACCAAGCA CACCTCGGAC TCCAGCACTG GGGGCGGCTC CAGCGCGTCG TCGTCGGCCG GGGGCTCGTC GACGGGAACC CAGACGAGCT ACAAGCAGGG CGGGACGCTC ACCATCTCGA ACGAGCAGGG CCAGACCTGG CCGTGTTCGT TCAACCCGTA CAACTCGACG TTCAACCTGG AGAGCCTCGG CTTCATCTAC GAGCCGCTCA TCTACACCAA CATCCTGCAG GACGCCAAGG AGACGCCGAT GCTGGCGTCG GACTACAAGT GGAACGCGGA CAAGACCCAG ATCACCTTCA CCATCCGCGA CGGGGTGAAG TGGAGCGACG GCCAGCCGCT GACCGCCGAC GACGTGGCCT ACACGTACAA CCTGATGAAG CAGACCCCGG CGCTGGACAA CTACTCGCTG TGGTCCGCGG CCGGCCTGAC GAGCGTCGCG GCCACCGGCA ACCAGGTCAC GATGACCTTC AAGCAGAACG CGCAGGTGTA CTTCTACACC TTCGCCACGC TGGTCGGCAT CGTCCCCAAG CACATCTGGT CCACCGGCGA CGCCGCGGCG CACCCGGACA CCTGGACCGA CCCGAACCCG ATCGGCTCGG GTCCGTACAC GGTCAAGTGC ACGCCGAACA ACATGGAGTA CAAGGCCAAC GCCAGCTACT GGCAGCCCGG CAAGCCCTAC GTGACCACCC TGGAGTACCC GGCCTACCTG GACAACGGCC CGGCGAACCA GGACCTGGCC AGCGGCAAGG CGCAGTGGGG CTCGCAGTTC ATCACCGGCA TCAAGTCCTT CTACCTGAAC AAGTCGCAGG ACAACCACAC CTGGTCCCCG CCGGTGCTCA ACGTCTCGAT CATCCCGAAC CTGGACCCCT CGCACGCGGC GACCAGCAAG CTCGGCGTCC GCCAGGCGAT CGCCTACGCG ATCGACAAGG CCAAGGTCTC GGCGATCGGT GAGGACGGCC AGCAGCTGAC GGGCAACCAG AGCGGCGTGG TCACCCCGAC CTTCGACAAG TTCAACGACG CGGCGGCGAT CTCCGCGGCC GGGTACGACA AGCCGGACAT GGCCAAGGCC GCGGCGGCGC TGCAGGGTGC CGGGTACAGC CCGAGCAACC CGCTGAACCT GACCATCATC TCGATCCAGG GCTACACCGA CTGGGACGCC TCGATCGCGA TCATCAAGGA CGAGCTCAAG CCGCTGGGCA TCAACCTCAC CGAGAGCTCG CTGACCAACC AGACGTACTA CGACAAGCTC TACAAGGGCG ACTTCGACCT GGCCTACGGC TCCCAGCCCT CCGCCGGACC GTCTCCCTAC ACCGAACTGC GCGCGTGGCT GCACTCGGCC AACACCGCGC CGCTGGGCCA GAGCGCGTCG GCGGGCAACT TCGAGCGGTA CAAGAACCCC GCCGTGGACA GCCTGCTCGA CCAGTACGCC ACGGCCGGCT CGGAGGACCA GCAGGTCTCG ATGATCAAGC AGGTGAGCCA GCACGTGCTG CAGGACCTGC CGTTCATCCC GGTGACCGAG TCCGCGGACT GGTTCCAGTA CAACACCAAG AACTTCGGCG GCTGGCCGAC GTCGGACAAC CCGTACGCGC AGCCCGCGGC CTACAACTAC CCGGACAACG AGCAGGTGCT GCTGCACCTG TACTACAAGC CGGCCCAGTA A
|
Protein sequence | MRRNTSSLLA VAISIGLAAT ACSSTKHTSD SSTGGGSSAS SSAGGSSTGT QTSYKQGGTL TISNEQGQTW PCSFNPYNST FNLESLGFIY EPLIYTNILQ DAKETPMLAS DYKWNADKTQ ITFTIRDGVK WSDGQPLTAD DVAYTYNLMK QTPALDNYSL WSAAGLTSVA ATGNQVTMTF KQNAQVYFYT FATLVGIVPK HIWSTGDAAA HPDTWTDPNP IGSGPYTVKC TPNNMEYKAN ASYWQPGKPY VTTLEYPAYL DNGPANQDLA SGKAQWGSQF ITGIKSFYLN KSQDNHTWSP PVLNVSIIPN LDPSHAATSK LGVRQAIAYA IDKAKVSAIG EDGQQLTGNQ SGVVTPTFDK FNDAAAISAA GYDKPDMAKA AAALQGAGYS PSNPLNLTII SIQGYTDWDA SIAIIKDELK PLGINLTESS LTNQTYYDKL YKGDFDLAYG SQPSAGPSPY TELRAWLHSA NTAPLGQSAS AGNFERYKNP AVDSLLDQYA TAGSEDQQVS MIKQVSQHVL QDLPFIPVTE SADWFQYNTK NFGGWPTSDN PYAQPAAYNY PDNEQVLLHL YYKPAQ
|
| |