Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4984 |
Symbol | |
ID | 8336338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5701558 |
End bp | 5702856 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644958083 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003115685 |
Protein GI | 256394121 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000108283 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.000897425 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAGACAA GACGTGTTCG GCTGGCCTGC CTCACCCTTT CCGCCATCTC CCTGCTCACC GTCGCGCTGA CGGGATGCTC CTCCTCGAGC GGATCGGGAT CCGGCGCCAA GAACGTCGCG CTGCGGATGA CCGTCTGGAG CAACGACAAA TCGCAGACAG CGCTGTTCAA CAGCATCGCG GACAGCTACC TCAAGACCCA TCCCGACATC AAGTCGATCA CGTTCGACTA CCTCCCGATC GGCAGCTACA CGACCGCGCT GACGACCCAG ATCGCCGGCA GCAGCCCGCC GGACATGGCC TGGATCCTGG AGCGGGACGC GCCGGACTTC GTCTCCTCCG GAGCGCTCAC CGACGTCTCG GCGGCCCTGC AGAACTCCCC CGGCTACCAG TACGGCGACC TGACCCCGGC AGCGACCAAG CTGTGGACGC AGAATGGGAA GCTCTACGCC TACCCCTTCT CCACGTCCCC GTTCGGGATG TTCTACAACA AGGACCTGCT GACCCAAGCC GGTGTGACCC AGACGCCGGA CCAGCTCGTC GCCGCCGGCC AGTGGACGTG GCAGAACGCC GAGAAGATGG CCGCCCAGGT CGCGGCGCAC ACCGACAAGC AGGGTCTGGT GATCCGGGAC TGGGACTACA AGACCTGGAT CGAGCTGGCG AGCATCTGGC GCGGCTGGGG CGCCGACGCC TGGTCGGCCG ACGGCAAGAC CTGTGACTTC GACGCCCCGC AGATGCAGCA GGCGATGACC TTCCTGCACA ACGCGATCTT CACCGACAAG GCGCTGCCCG CACCGGGCCA GACCGCTGAC TTCTTCGCCG GTGAGTCCGC CATGACCGTC ACTCAGATCA GCCGGGCTTC CCTGCTGGCC AAGCACCCGT TCAACTGGGG GATCGTGCCG CTGCCGTCTG GTCCGACCGG GTCGGCGCAG GTCATCGGAC AGGCCGGCAT CGGTGTGATG ACCAAGGGTT CGCACAAGCA GCAGGCGGCG GACTTCCTGG CCTACTTCAC CGACCCGGCC AACTCCGCCA AGCTCGCCCA GTACTTCCCG CCGGCTCGTC AGAGCCAGCT CAACACCACG ACCCTAGCCG CCGCGAATCC CCTGTTCACC CCGCAGCAAC TTCAGGATGT GGTCATCAAC GGCATCAAGA CCGGCTCGGT GCTGCCGTCC CATGAGAACA GCGCCAAGCT CGCCACCCTC GTGCAGAACG CCTTGGACCC GCTGTGGACG CCCGGAGCCA ACGTCGACTC AGTGCTCGCC GGGGTGTGCA AGGCGATCGA CCCGGCTCTG AGCCAGTGA
|
Protein sequence | MKTRRVRLAC LTLSAISLLT VALTGCSSSS GSGSGAKNVA LRMTVWSNDK SQTALFNSIA DSYLKTHPDI KSITFDYLPI GSYTTALTTQ IAGSSPPDMA WILERDAPDF VSSGALTDVS AALQNSPGYQ YGDLTPAATK LWTQNGKLYA YPFSTSPFGM FYNKDLLTQA GVTQTPDQLV AAGQWTWQNA EKMAAQVAAH TDKQGLVIRD WDYKTWIELA SIWRGWGADA WSADGKTCDF DAPQMQQAMT FLHNAIFTDK ALPAPGQTAD FFAGESAMTV TQISRASLLA KHPFNWGIVP LPSGPTGSAQ VIGQAGIGVM TKGSHKQQAA DFLAYFTDPA NSAKLAQYFP PARQSQLNTT TLAAANPLFT PQQLQDVVIN GIKTGSVLPS HENSAKLATL VQNALDPLWT PGANVDSVLA GVCKAIDPAL SQ
|
| |