Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4419 |
Symbol | |
ID | 8335773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5019145 |
End bp | 5020515 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644957522 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003115124 |
Protein GI | 256393560 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATAT CCCGTCGACG CGGCGTCCTG CTGGGCGCGA CCGCGCTGGC CGTGGCGCTG GCCGCGACCG CGTGCTCCAG CTCGGCGAGC AGCGGTGGCA GCGGCAAGAC CTCCTCCGGC TCCTCGCAGG CCGCCACCGT CACCGACGCC GACCTGCAGG CGGCGCTGAC CGCCGGCGGG AACCTGACGG TGTGGGCCTG GGAGCCGACC TTGAAGAAGG TCGTCGCCGA CTTCCAGACC AAGTACCCGA ACGTGCACGT CAACCTGGTC AACGCCGGGA CCGGCAACGA CGAGTACAAG GCGCTGCAGA ACGCGGTCCA GGCCGGCAAG GGCGTCCCGG ACGTCGCGCA CATCGAGTAC TACGCGCTGC CGCAGTTCGA GCTGACCAAG TCGGTGGCGA ACCTGGACGA GTTCGGCGCC GCCGCGCTGA ACGGCACGTT CACCCCCGGG CCGTGGAGCT CGGTCCAGGC CGCCGGCGGC GTCTACGGCC TGCCGATGGA CTCCGGACCG ATGGCGCTGT TCTACAACCA GACGGTCTTC ACCAAGTTCG GCATCACCAC GCCCCCGGCC ACGTGGGACG AGTACATCGC CGACGCCAAG AAGATCCACA CCGCCGACCC CAGCGTCTAC ATGACCAACG ACACCGGCGA CGCCGGCTTC ACCACCAGCA TGATCTGGCA GGCCGGCGGC AAGCCCTACT CGGTCAGCGG CACCACCCTC GGTGTGAACT TCGCCGGCGA CGCCGGCACG CAGAAGTTCG CGACCGCCTG GCAGCAGCTG CTGGACGGCC ACGACCTGGC GCCGATCAGC TCCTGGAGCG ACGCCTGGTA CCAGGGCATG GCCTCGGGCA AGATCGCCTC GCTGACCATC GGCGCCTGGA TGCCGGCCTC CCTGGAGTCC GGCGTGAAGT CCGGCTCCGG CCAGTGGCGC GTCGCCCCGA TGCCGCAGTG GACCGCCGGG GGCAAGGTCA CCTCCGAGAA CGGCGGCAGC TCCCTGGCCG TGATGAAGGC GAGCACCAAC CAGAAGCTGG CCTACGCGTT CCTGAAGTAC GCCACCGTGG ACGAGGGCGC GCAGACCCGC GTGGACAACG GCGCCTTCCC GGCCACGGTG AAGCAGCTGA ACTCCCCGGA CTTCCTGAAC AAGACCGACG CCTACTTCGG CGACCAGAAG ATCAACCAGG TGCTCGCACA GAGCGCCGCC GAGGTCGCCC CGGGCTGGTC CTACCTGCCC TTCCAGGTCT ACGCCAACAG CGTCTTCAAC GACACCGCCG GCAAGGCCTA CATCGGGTCC TCCTCCCTGG CCGACGGGCT GAAGGCCTGG CAGGACGCCT CGATCAAGTA CGCCAAGGAC CAGGGCTTCA CCGTCAAGTA G
|
Protein sequence | MTISRRRGVL LGATALAVAL AATACSSSAS SGGSGKTSSG SSQAATVTDA DLQAALTAGG NLTVWAWEPT LKKVVADFQT KYPNVHVNLV NAGTGNDEYK ALQNAVQAGK GVPDVAHIEY YALPQFELTK SVANLDEFGA AALNGTFTPG PWSSVQAAGG VYGLPMDSGP MALFYNQTVF TKFGITTPPA TWDEYIADAK KIHTADPSVY MTNDTGDAGF TTSMIWQAGG KPYSVSGTTL GVNFAGDAGT QKFATAWQQL LDGHDLAPIS SWSDAWYQGM ASGKIASLTI GAWMPASLES GVKSGSGQWR VAPMPQWTAG GKVTSENGGS SLAVMKASTN QKLAYAFLKY ATVDEGAQTR VDNGAFPATV KQLNSPDFLN KTDAYFGDQK INQVLAQSAA EVAPGWSYLP FQVYANSVFN DTAGKAYIGS SSLADGLKAW QDASIKYAKD QGFTVK
|
| |