Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_1289 |
Symbol | |
ID | 8332624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 1463641 |
End bp | 1465473 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644954436 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003112055 |
Protein GI | 256390491 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.172232 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGCT CCAAGCTGTT GGGAGGCGTC GCGCTCGCCG CGTGCGTCGC CCTGGCAGCC TCGGCGTGCT CCAGCTCGAA GACGAACTCC GGCTCGAGCG CGCCGACGAC TGCCGTCACC ACCCTGCCTT CGGGCAACAC CAGCGCCGCC ACCGGCTTCA ACGCCGCCGT GACCGGCATC GTCCGCCCGA CAGACGCCGA GGGCGGCACC ATGACACTGG TGGACCGCTC CGACTTCGAC TCGCTCGACC CGGGCAACAC CTACGACGCC TTCTCCTGGT CGATCATCGG CGACTGGGCT CGTCCGCTGA TGACCTACGC CCAGCAGCCC GGCAACGCCG GCGCCAAGGT CGTGCCGGAC CTGGCCGAGG GTCCCGGTGT CGTGTCCAAC AACGGCATGA CCTGGACCTA CAAGATCAAG CCGGGCGTGA AGTACCAGGA CGGCTCCGTG GTCACCTCCG CGGACGTCAA GTACGCGATC GAGCGCTCCA ACTGGGGCCA GGACACCCTG GTGAACGGTC CGGCGTACCT GCCGAACTTC ATCCAGGACA CCACGAAGTA CGCCGGTCCC TACAAGGACA AGAACCCGAA CGACGGCGTC TCCGGCATCA CCACGCCGGA CAACCAGACC ATCGTGTTCA ACCTGACCTC GCCGTTCTCG GACTTCGACT ACCTGATGGC GCTCCCGGGC TCGGCCCCGG TGCCGCGGGC GAAGGACACC GGCGCGGACT ACTTCAAGAC GCTGCTGTCG ACCGGTCAGT ACAAGGTCGA CAACTACCAG GTCGGCAACG AGCTGGACCT GTCGCCGAAC CCGAACTTCG ACAAGTCCAC GGACCCGGAC AAGCTGCACG TCGTCCGCGC CTCGAAGATC GTGGTCAAGC TCAAGCAGGA CAAGGCGACG ATCGACGACA CGCTGTTCGA CGGCTCGGCC AACGCCGACC TGACCGGCGT GGGCGTGCAG CCGGCCACCC AGAGCAAGAT CCTGGGCGAC CCGAAGCTCA AGGCCGACTC GGACTCCGCG TACGCGAACT CCACTGAGTA CTTCGCGATC AACGCGACTC AGAAGCCGTT CGACAACGTG AACTGCCGCC AGGCCATCGA GTGGATCATC GACAAGGCCA CGCTGCAGAC CGAGGCCGGC GGCTCGCAGG GCGGCGGCGA CATCGCCAGC ACCATCGACC CGCCGACGAT CCCGGGCTGG AAGGCCGGCG ACCAGTACCT GACGGCGGGC AACAAGGGCG ACGCCACCAA GGCGAAGGCC TCGCTGGCCC AGTGCAAGAC CGCTGAGCCG GACGCCTTCA ACGCCGACGG CAGCATCAAG GACACCTTCG AGATCATGAC TCGCGACAAC TCCACCAAGG AGGCGACCAT GGTCCAGACG ACGCAGACCA ACCTGAAGTC GATCGGCATC AACACCACCA TCGACACCAA GCCGTTCGAC AAGTACAACT CGCAGTTCGC CGGCAACAAG ACGTACGTGG ACCAGCACAA GGTCGCCATC AGCTTCATGA AGTGGGGCGC CGACTTCCCG TCGGGCTACG GCTTCATGTA CAGCCTGCTG GCCTCCTCCG CGATCCACCC GTCCGGCGGC TACAACCTGT CCTGGTACAA GGACGACGCG ATCGACCAGG GCTTCACCAA GGCTCTGGGC GAGAACGACC CGACCGCCCG TGGCGCCGAC TACGCGGCGA TCGACCACCA GGCGCTGGCC GACGCGCTGG TCGTCCCGCT GATCTGGGAC AAGAACCTGG TCTACCGGCC GGAGTCGACG ACCAACGTCA TCTTCAGCCA GGGCTACGGC ATGTACATCT TCTCCGCGAT GGGCGTGAAG TAA
|
Protein sequence | MNRSKLLGGV ALAACVALAA SACSSSKTNS GSSAPTTAVT TLPSGNTSAA TGFNAAVTGI VRPTDAEGGT MTLVDRSDFD SLDPGNTYDA FSWSIIGDWA RPLMTYAQQP GNAGAKVVPD LAEGPGVVSN NGMTWTYKIK PGVKYQDGSV VTSADVKYAI ERSNWGQDTL VNGPAYLPNF IQDTTKYAGP YKDKNPNDGV SGITTPDNQT IVFNLTSPFS DFDYLMALPG SAPVPRAKDT GADYFKTLLS TGQYKVDNYQ VGNELDLSPN PNFDKSTDPD KLHVVRASKI VVKLKQDKAT IDDTLFDGSA NADLTGVGVQ PATQSKILGD PKLKADSDSA YANSTEYFAI NATQKPFDNV NCRQAIEWII DKATLQTEAG GSQGGGDIAS TIDPPTIPGW KAGDQYLTAG NKGDATKAKA SLAQCKTAEP DAFNADGSIK DTFEIMTRDN STKEATMVQT TQTNLKSIGI NTTIDTKPFD KYNSQFAGNK TYVDQHKVAI SFMKWGADFP SGYGFMYSLL ASSAIHPSGG YNLSWYKDDA IDQGFTKALG ENDPTARGAD YAAIDHQALA DALVVPLIWD KNLVYRPEST TNVIFSQGYG MYIFSAMGVK
|
| |