Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_2151 |
Symbol | |
ID | 8333496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 2437299 |
End bp | 2438699 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644955301 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003112911 |
Protein GI | 256391347 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGCAC GCTCCCACCG GTCCGTCACG GTTGTCATAG CCGTCGGACT GGTCTCCGGC CTGGCACTGG CCGGCTGCTC CTCCAGCTCC TCCAAGCCCT CGGCGGGCGC GTCGACCTCC TCGGCCGCCG GCACCTCGGC GCCGAGCACG TCCGCGGCTT CCTCCTCCGC CGCGGCCGCC GGGCCCGCGC TGCCGGACCT GTCCGGCAAG TCGATCCAGG TGCTGGCCGA GTGGTCCGGC CAGGAGCAGC AGGACTTCCA GAAGGTGATC GACGCCTTCA CCGCCAAGAC CCACGCCAAG GTCAGCTACC AGGGCGCCGG CGACCAGACC CCGACCGTCC TGCGCAGCAA GCTGGCCGGC GGCGGCGCCC CCGACGTGGC GCTGCTGGCC CAGCCCGGCG CCATCGCGCA GTTCGCGAAG GCCGGGCAGA TCAAGCCGCT GGGCGCCAAC GTGCTCTCGG AGATCGACGC CAACTACGAC CCGAGCTGGA AGAAGCTCGG CACGGTCAAC GGCCAGGTCT ACTCGATCAT GTTCAAGGCG GCGAACAAGT CGACCTTCTG GTACAACACC GCGCAGTTCT CGCAGGCCGG CATCACGCCG CCGAAGACCT GGGCCGACTT CCTCAAGGAC TGCCAGGCGC TCTCCGACGC CGGCATCACC CCGGTCTCGA TCGGCGGCGC CGACGGCTGG ACGCTCACCG ACTGGTTCGA GAACGTCTAC CTCTCCCAGG CCGGCGCGGA CAACTACGAC AAGCTCGCGC ACCACCAGAT CCCCTGGACC GACCCCACGG TGGTCCAGGC GCTGACCACG ATGAAGCAGC TGTTCGGCAA CGACCAGTTC ATGGCCGGCG GCAAGGCCGG GGCGCTGCAG ACCTCGTTCA ACGACTCGGT CACCCAGACC TTCAAGAGCC CGCCGAAGGG CGCGATGGTC TACGAGGGCG ACTTCTCCGG CTCGGTGATC ACCTCGACCA CCTCGGCCAA GCTGGGCACC GACGCCAAGT TCTTCGCCTT CCCGGCGGCC GGGTCGCTGA CCAACTTCGT GGACGGCGGC GGCGACGCCG CGCTGGCCAC CAACGACAAC CCGGCGACGA TGGCGTTCAT CCAGTTCCTG GCCTCCCCGG AGGCGGCCGA GGCGTGGGCG TCGGCCGGCG GCTTCGTCTC GCCGAACAAG AACGTCCCGA TGTCCTCCTA CCCCGACGAC ACCACCCGCG CCGAGGCGCA GATGCTCGTC AGCGCCGGCG ACGGCTTCCG CTTCGACATG TCCGACCAGG CTCCGGTCGG CTTCGGCGGG ACCAAGGGCG CCGGGGAGTG GAAGGACCTG CAGGACTTCC TGAGCAACGG CGACGTCAAC GGCACCGCCG CGCAACTGGA GAAGGACGCG GCGAAGGAGA CCTGGCAGTA G
|
Protein sequence | MGARSHRSVT VVIAVGLVSG LALAGCSSSS SKPSAGASTS SAAGTSAPST SAASSSAAAA GPALPDLSGK SIQVLAEWSG QEQQDFQKVI DAFTAKTHAK VSYQGAGDQT PTVLRSKLAG GGAPDVALLA QPGAIAQFAK AGQIKPLGAN VLSEIDANYD PSWKKLGTVN GQVYSIMFKA ANKSTFWYNT AQFSQAGITP PKTWADFLKD CQALSDAGIT PVSIGGADGW TLTDWFENVY LSQAGADNYD KLAHHQIPWT DPTVVQALTT MKQLFGNDQF MAGGKAGALQ TSFNDSVTQT FKSPPKGAMV YEGDFSGSVI TSTTSAKLGT DAKFFAFPAA GSLTNFVDGG GDAALATNDN PATMAFIQFL ASPEAAEAWA SAGGFVSPNK NVPMSSYPDD TTRAEAQMLV SAGDGFRFDM SDQAPVGFGG TKGAGEWKDL QDFLSNGDVN GTAAQLEKDA AKETWQ
|
| |