Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_7779 |
Symbol | |
ID | 8339155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 9019113 |
End bp | 9020906 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644960863 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003118444 |
Protein GI | 256396880 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.354248 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGTCA GCAGGAGGAC ACTCGCGCTG CAGACCGGAG CGATCGCGGC GGCGGTGGCC TTGGCCGCCA CCGCCTGCGG CAGCAGCAAG AGCGGCGGAG GATCCACCAC CGGTTCCGGT TCGGGCGCCG GCACGCCCAT CGCCGACCGG AACTCGGTCA ACGCCGCGAC CGTGAAGCAG GGCGGGAAGA TCACCTGGAC CATCGAGAAG ACGGTCCAGG ACTGGAACCC GCTCACCTCT CTGGGCAACA CGTTCGACTA CGCGCAGACC ACGAACGGCA TCTACCCGGA CGTCTACGTC CCGCAGCCGG ACTACTCGCT GGTGCTGAAC ACCGACCTGA TGGCCGGGGA CCCGGTGGTC ACCAACGCCA CCTCGACCGA GCCGCAGAAG ATCGTCTACA AGATCCAGCC GAACGCCAAG TGGTCCGACG GGACCCCGGT CACCGCCGAC GACTTCATCT ACCTGTGGCA GGCGCAGAAC GGCACCAACC CGAACGTCGA CGTGGCCAGC ACCACCGGCT ACAGCGACGT GGCCTCGGTG ACCGGCAGCG ACAACGGCAA GACCGTGACC GTCGCCTTCA AGCAGGACAA GCCCTTCTCG GACTGGAAGA GCCTGTTCAC CTCGATCCTG CCGGCGCACG TCGCCAAGCA GCACGGCGAC GTCGCGGCCT CCTTCACCTG GCTGGACGCC AACCCCCCGA CGGTCTCCGC CGGCCCGTTC GAGATCGCCC CCGGCGGCGT CTCGGCCGAC AAGAGCCTGA TCAAGACGAT CAAGAACCCG CAGTACTACG GCAAGCCGGC CAACCTCGAC GAGGTCGACT TCCGCGCGAT CACCGACTCC TCGCAGGAGC CGACCGCGCT GGCCAACGGC GAGGTGGACG GCATCTACCC GCAGCCGCAG CTGGACCTGG TGAACCGGGT CAAGAGCATC GCCGGCGTGG ACTACCACAT CAACCAGGGC CTGGTCTGGG AGCACATCGA CCTGAACCTG CGCAACAGCG CCTTCGGCGG CCCGGCCGAC GCCGACCAGA CCCAGCCGGC CAAGGTCGCG CTGCGCCAGG CGATGTTCAC CGCCTTCGAC CGGCTCGGCC TGCTGAACCG GACGATCAAG CAGTTCGACA GCGACGCGGC GGTGCTGAAC AACCGCATGG TGGTGCCCGG CCAGCCCGGC TACCAGGACA ACGCCTCCGC GATGTACCCG GAGTCCGGGG ACCTGAACAA GGCCAAGCAG CTGCTCACCA CGGCCGGCTA CAAGGGCGTG GGCACCGCGC TGGTGGACCC CAGCGGCAAG GCCGTCCCGG CGTTCAGCAT GCGCTACACC GTCGGCAACC AGCTGCGCCA GGACACCTGC AACCTGTTCG CGCAGGCCAT GAAGCAGCTG GGGATCACGG TCAACGTCAG CTCCACCGAC GCCCTGGGCA AGACCCTGAC CCAGTCCGAC GCGCAGCACA CGTACGACAT CATCGTCTTC GCCTGGGTGG ACACCCCGTT CCCCAACTCG GCGAACCAGC CGCTGTACAC CACCACGACG CAGGGCAACC CGCAGAGCAA CTACGGGTAC TACAGCAACG CGAACGTGGA CAAGTGGCTG GCCGACGCCA CGGTCAACCC CGACCAGACG GCCCGGGAGA AGGACCTGAA CCAGGCCGAC GCGCAGATCA CCAAGGACGC GTACACGCTG CCGCTGTACC AGAAGCCGAC GATGATCGCG TACAAGAACA CCCTGGGCAA CGTGCGGGAC AACCCGACGC AGATCGGCCC GACGTACAAC ATCGCGCAGT GGGGCCAGAA GTAG
|
Protein sequence | MGVSRRTLAL QTGAIAAAVA LAATACGSSK SGGGSTTGSG SGAGTPIADR NSVNAATVKQ GGKITWTIEK TVQDWNPLTS LGNTFDYAQT TNGIYPDVYV PQPDYSLVLN TDLMAGDPVV TNATSTEPQK IVYKIQPNAK WSDGTPVTAD DFIYLWQAQN GTNPNVDVAS TTGYSDVASV TGSDNGKTVT VAFKQDKPFS DWKSLFTSIL PAHVAKQHGD VAASFTWLDA NPPTVSAGPF EIAPGGVSAD KSLIKTIKNP QYYGKPANLD EVDFRAITDS SQEPTALANG EVDGIYPQPQ LDLVNRVKSI AGVDYHINQG LVWEHIDLNL RNSAFGGPAD ADQTQPAKVA LRQAMFTAFD RLGLLNRTIK QFDSDAAVLN NRMVVPGQPG YQDNASAMYP ESGDLNKAKQ LLTTAGYKGV GTALVDPSGK AVPAFSMRYT VGNQLRQDTC NLFAQAMKQL GITVNVSSTD ALGKTLTQSD AQHTYDIIVF AWVDTPFPNS ANQPLYTTTT QGNPQSNYGY YSNANVDKWL ADATVNPDQT AREKDLNQAD AQITKDAYTL PLYQKPTMIA YKNTLGNVRD NPTQIGPTYN IAQWGQK
|
| |