Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6724 |
Symbol | |
ID | 8338088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 7754124 |
End bp | 7755815 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644959818 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003117411 |
Protein GI | 256395847 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACACAC CGCAGAGCCT CGGGCGCCGC GGATTCCTGC GCGGCGTCGG CGCCGCGGCG GCCCTCACCG CGGGCGGCTC GACGCTGGCC GCGTGCGGCA GCGGCAAGGC CGCGGCCGTG AACGAGGCGG GCAGCGCCGC GCAGGTCCAG CTGCCGACGT ACACGCCGCT GGCCAACGGT CCGACGCCGG ACCTGCCGGG CACCGACGCC GGCGTCCCGG CCGGGTTCTA CGACTACCCG GCGTCCCCGA CCGCGGCGTT CGCCAGCCCG CCGCTGTCCG GCGGCAAGTT CTCGGCGATG ACGCCGCTGT TCACCGCTCC CCCGCCGGCC CGCGGCTCGA ACCCGGCGTG GCAGGCGATG GAGAAGAAGC TCGGCGCGAG CGTCGACATC ACGATGGTGG TCGGCGACGA CTTCGACACC AAGCTCTCCA CCCTGATCGC CGGCGGCGGA CTGCCGGACC TGATCCAGTA CGACGGCCTC GGCGGCGTCC CGACCATCAG CAACCTGCCG CAGTTCCTGG ACTCGCAGAT CGCCGACCTG ACCGCGCTGA TCGGCGGCGA CAAGGTCAAG GAATACCCGC ATCTGGCCGC CATCCCGAAG GTGTTCTGGG AGCAGTGCAC GGTCGCCGGG AAGCTGTACT TCATCCCCAT CCCGCGCGGC ATCAGCGCCG GCGCCGGGCT GTACCGGCAG GACCTGTTCG CCGCCGCCGG AGTCACCAGC AACAAGGACA TCAAGAACTC CGACGACTTC TTCACGCTTC TCAAGGAGCT GACGAACCCG GGCAAGGACC GCTACGCGCT GGCCGGCAAC TCCGGCAACG GCGGCTATTC CGGGGCGATC TTCGAGCAGA TCTTCGGGGT CCCGAACAAG TGGCGGGTGG ACGGCGGCGG CAAGCTGACC GCGGACATCG AGACCGACGA GTTCCGGGCG GCGCTGGAGT TCATGGTGAA GGTAGCCAAG GCCGGCTGCT TCTATCCGGG CGCGCAGGGC TGGACCAAGG CCAAGATGGA GGACGCCTTC CAGTCCGGTA AGGCGGCCAT GATCTACGAC GGTCTCCCGG CGCTGTCCAC CAGCGTCTGG GCCACCGCGC AGAAGATCGA CCCGAACGCC AAGCTGATGC CGTTCGTCCC GTTCGGCGCC ACCGGCGGAC CCGGCGTCGC CTGGCAGGAC AACGTGGTCT TCGCCGGCAC GATGCTGAAG AAGGCGGACC CGGCGAAGCT GGCGGAGGTC CTGAAGCTCG CCGACTTCCT CGCCGCGCCG TTCGGCACCG AGGAGTACCT GCTCAAGACC TACGGCGTCG AAGGCGCGGA CTACACGCTG GACGCCAACC ACAACCCGGT GCAGACCGCC CAAGGCAAGA ACGACGCGAA CGTCACCTGG AAGTACGTCG CCGCGCCGCA GCTGGTCACC TACAACCCCG GTGTCAATGC TCTGACGGAC GCGGTCCACC AGGCCTACAC CGAGCTGGTG CCGATCGCCG TGCCGAACCC GACCGCGACG CTGTACTCGC CGACCTTCGG CAAGCAAGGC GTGGCGCTGT ACAAGGCGGT CACGGACACC GTGACGCAGG TGATCGGCGG GCAGTCGAGC ATGAGCGCCT TCGACAACGC GGTGAAGACC TGGCGCAGCG GCGGCGGCGA CCAGATGCGG TCGGAGTTCG AGCAGGCGTA CGCGAGCGCG CCGAAGAGCT GA
|
Protein sequence | MHTPQSLGRR GFLRGVGAAA ALTAGGSTLA ACGSGKAAAV NEAGSAAQVQ LPTYTPLANG PTPDLPGTDA GVPAGFYDYP ASPTAAFASP PLSGGKFSAM TPLFTAPPPA RGSNPAWQAM EKKLGASVDI TMVVGDDFDT KLSTLIAGGG LPDLIQYDGL GGVPTISNLP QFLDSQIADL TALIGGDKVK EYPHLAAIPK VFWEQCTVAG KLYFIPIPRG ISAGAGLYRQ DLFAAAGVTS NKDIKNSDDF FTLLKELTNP GKDRYALAGN SGNGGYSGAI FEQIFGVPNK WRVDGGGKLT ADIETDEFRA ALEFMVKVAK AGCFYPGAQG WTKAKMEDAF QSGKAAMIYD GLPALSTSVW ATAQKIDPNA KLMPFVPFGA TGGPGVAWQD NVVFAGTMLK KADPAKLAEV LKLADFLAAP FGTEEYLLKT YGVEGADYTL DANHNPVQTA QGKNDANVTW KYVAAPQLVT YNPGVNALTD AVHQAYTELV PIAVPNPTAT LYSPTFGKQG VALYKAVTDT VTQVIGGQSS MSAFDNAVKT WRSGGGDQMR SEFEQAYASA PKS
|
| |