Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0477 |
Symbol | |
ID | 8331804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 542651 |
End bp | 543949 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644953643 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003111270 |
Protein GI | 256389706 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.838392 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGACGA GAATGCGCAG CGCGATAGCC GTCTTCCTCG GCTTGGGCAC GATGCTCGCC GCCACCGGCT GTGCCGGCAG CAGCGGCGGC GGGGGCGGCG GTTCGTCCGA CGGCAAGGTG ACGGTGACCG TCTGGGAGAA CGCGACGAAC GGCCCGGACG GCCTGCAGTA CTTCCAGAGC GCCGCGAAGC AGTACCAGGC CCTGCACCCG AACGTCACGA TCTCGTTCCA GACGATCCAG AACGAGGCAC TCGACGGCAA GCTCCAGACC GCCCTCAACT CGAACAGCGC GCCGGACGTC TTCTTCCAGG TCGGCGGCGG CAAGATGCGG GCCCAGGTCG CCGCCGGCGA ACTCCAGCCG CTGAACCTCA CCGACGCGGA CAAGACCGAC GTCGGCGCGG CGGCCCTGTC CGGCAGCACG CTCGACGGCA AGGTCTACAT GATGCCGGTC GACACGCAGC CCGAGGGCAT CTACTACAGC AAGAACCTGT TCCAGCAAGC CGGCATCACC ACGACGCCCA CGACGATCGA CGAGCTCGAA GCCGACGTCG CCAAGCTCAA GGCAATCAAC GTCGCACCGA TCGCAGTCGG AGCCAAGGAC GCCTGGCCCG CCGCGCACTG GTACTACAAC TTCGCCCTCC GCGAGTGCAG CCAATCCGTC ATGGCGAGCA CCGCCAAGTC GCTCAAGTTC ACCGACCCAT GCTGGACCAC AGCCGGAAAC GCCCTGGCCA CATTCCTCAA GACCAACCCC TTCCCAGCCG GCTTCCTGAC AACCGCAGCC CAGCAAGGCG CCGGCTCCTC AGCGGGCCTA CTCGCCAATC ACAAGGCAGC CATGGAGCTC ATGGGCTCCT GGGACCCCGG CGTAATCGCC AGCTTGACCC CGGACCAGAA GCCGCTCCCC GACCTGGGCT GGTTCCCGTT CCCCGCAGTA GCCGGCGGCC AAGGCGACCC CTCCGCAATC ATGGGCGGCA ACTCCGCCTA CTCGCTGTCC AAGAAGGCAC CAAAGGAGGC CTTCGGCTTC CTGGAGTTCA TGCTGACCAA GGACCAGCAG GAGGCATACT CCAAGGCCTT CCAATCAATC CCGGTGAACC CGGCGTCCCA GGACGTCGTC ACCACCTCCT ACAACATCTC AGCACTGCAA GCCTTCAACA AGGCCGCCTA CTCAATGCAG TACCTCGACA CCCAGTTCGG CCTAAACGTC GGCAACGCCC TAAACACCGC CGTCGTCAAC CTCATGGCCG GCCGAGGCAG CGCCGCCGAA ATCGTCACGC AGGCCAACGC CGCCGCGGCG AAGGGCTGA
|
Protein sequence | MTTRMRSAIA VFLGLGTMLA ATGCAGSSGG GGGGSSDGKV TVTVWENATN GPDGLQYFQS AAKQYQALHP NVTISFQTIQ NEALDGKLQT ALNSNSAPDV FFQVGGGKMR AQVAAGELQP LNLTDADKTD VGAAALSGST LDGKVYMMPV DTQPEGIYYS KNLFQQAGIT TTPTTIDELE ADVAKLKAIN VAPIAVGAKD AWPAAHWYYN FALRECSQSV MASTAKSLKF TDPCWTTAGN ALATFLKTNP FPAGFLTTAA QQGAGSSAGL LANHKAAMEL MGSWDPGVIA SLTPDQKPLP DLGWFPFPAV AGGQGDPSAI MGGNSAYSLS KKAPKEAFGF LEFMLTKDQQ EAYSKAFQSI PVNPASQDVV TTSYNISALQ AFNKAAYSMQ YLDTQFGLNV GNALNTAVVN LMAGRGSAAE IVTQANAAAA KG
|
| |