Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_2014 |
Symbol | |
ID | 8333358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 2280283 |
End bp | 2281650 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644955164 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003112775 |
Protein GI | 256391211 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.383085 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTCCA CGGGTCTCCG TATCGCCTGC GCCTGCGCGG CTTCCGCGCT CGCCCTGACC GCCTGCTCGT CGTCGTCCTC AAGCGGCGGC GGCAGCGGCG GTAAAACGCT CAAGGTCGCC TACGTCAAGT TCGGCAACGA GATCCAGGTC GACGCGCACA TGCAGAAGGT GAAGGCGCAG TTCGAGGCCG CGCATCCGGG GGTCACGGTC AAGCTGGACC CGATCGCCGC GGCCGAGAAC GACTACTACA CTAAGATCGA CCTGATGATG GGCTCCGGTT CGACCGCGCC CGACGTGGTC TACGAGGACA CCTTCCTCAT CAACTCCGAC ATCAAGGCCG GCTACCTCGC CCCGCTCGAC TCCTACCTGG CGAGCTGGAG CGACTGGTCG CAGTTCCCGG ACACGGCCAA GTCCGCCGCG CGCGGCATCG ACGGCAAGAC CTACGGCGTC CCGATGGGCA CCGACACCCG CGCGCTGTAC TACAACAAGG CGATGCTGAC CAAGGCCGGC GTCGCGATGC CCTGGCAGCC CAAGTCGTGG CAGGACGTCC TCGCGGCCGC TAAAGCGGTC AAGGCCACCT CGCCCGGCGT GACGCCGCTG AACGTCTACG CCGGCAAACC CGACGGCGAG GGCAGCACCA TGCAGGGCTT TGAGATGCTG CTGTACGGCA CCAAGGACAC CCTGTACGAC AGCGCCAGCA AGAAGTGGAC CGCCCCGAGC AAGGGCTTCA CCGACTCCCT GCAGTTCCTC AAGGACGCCT ACAGCGGCGG CCTGACCCTC CCGCCGCAGA CGGAGCTGGA CCCCAACATC CCCAACGTGG TCAGCGGCCA GATGCTCCCG CAGGGCAAGC TCGCCATCGA CCTCGACGGC TCCTGGGTGA CCGGTACTTG GGGCACCTCC GGAGCCAAGC CCTGGCCGCA GTGGAACACC GTGATCGGCG AGGCGGCGAT GCCGACGCAG GACGGCTCCG GCAAGGGCAC GATCAGCATG TCCGGCGGCT GGACGCTGTC GGTGACCGCG AAGTCGAAGA ACAAGGATCT CGGGGCGCAG TTCGTCGAAC TGGCGCTGAA CAAGGAGAAC TCCGCCTCTT ACGACATAGC GGACAGCCAG ATCGCGGTGC GCAACGACGT CGCCGCCGAT CCCTCCTACC AGAGCACCAA CCCGACCACC GCGTTCTTCA CCGGCCTGGT CCCGGTCACG CAGTACCGGC CGGCGTACGC CGAGTACCCG AAGATCTCCG ACGCGATCCA GGTGGCGATG GAAGCGGTGA TCACCGGCCA GTCGTCCCCG GCGGACGCGA TGAAGGCGTA CACCGCCACG CTGAAGGGCA TCGTCGGCTC CGACAACGTG GCGGCGGGCG CCAGCTGA
|
Protein sequence | MLSTGLRIAC ACAASALALT ACSSSSSSGG GSGGKTLKVA YVKFGNEIQV DAHMQKVKAQ FEAAHPGVTV KLDPIAAAEN DYYTKIDLMM GSGSTAPDVV YEDTFLINSD IKAGYLAPLD SYLASWSDWS QFPDTAKSAA RGIDGKTYGV PMGTDTRALY YNKAMLTKAG VAMPWQPKSW QDVLAAAKAV KATSPGVTPL NVYAGKPDGE GSTMQGFEML LYGTKDTLYD SASKKWTAPS KGFTDSLQFL KDAYSGGLTL PPQTELDPNI PNVVSGQMLP QGKLAIDLDG SWVTGTWGTS GAKPWPQWNT VIGEAAMPTQ DGSGKGTISM SGGWTLSVTA KSKNKDLGAQ FVELALNKEN SASYDIADSQ IAVRNDVAAD PSYQSTNPTT AFFTGLVPVT QYRPAYAEYP KISDAIQVAM EAVITGQSSP ADAMKAYTAT LKGIVGSDNV AAGAS
|
| |