Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0625 |
Symbol | |
ID | 8331954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 729344 |
End bp | 730657 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644953777 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003111402 |
Protein GI | 256389838 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.393195 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAACCA GTAAGAACGT CCACCGCGCC GGCGTCATAG CCGCCGCGGT GGCCGTCTCG GCCGCGAGCC TGTCGGCCTG TGGCAGCTCC AGCAAGCCCG GCACGTCCTC CGGCGCCAAG TCCTCGGGCA GCACCTCGTT CACCTACTGG TCGATGTGGC AGCAGAACGA GCCGCAGGCC AAGGTGATCA AGAAGGCCGC CGACGACTTC ACCGCCGCCA CCGGTATCAA GGTGACCATC CAGTGGCAGG GCCGCGACGT CATCACCAAG CTGACCCCGA CGCTGCGCAC CGGCTCCTCG GCGGACCTGG TCGACCAGTC GGTGAACGCG CTCGGCGCGC TGGTCGCCAA GGACGAGACC ACCGACCTGA GCAGCCTGTA CGGCACCACG ATCCCCGGCG AGTCGCAGAC CGTCGGCCAG GTCATCCCCG ACAGCTACAA GCCGTTCCTG AACGACAAGA ACGGCAAGCC TTTCATCGCG CCTTACGAGG TCTCCTCCGA AGGCCTGTGG TTCGACGCGT CGAAGTTCCC CGCCCTCGCT GCCAACCCGC CGAAGACCTG GGACGATCTG CTCGCGCTGT TCGACAAGGC CAAGGGCATG GGGATGACGC CGGTCGCGGT CCCCGGCGAC GACAAGTACT GGGTGCTGCT GACCCTTCAG CGCGAGCTGG GTACTGACAC CCTGAAGAAG TTGGCGAGCG ACAAGACCGG CGCCGCCTGG GATGACCCGA AGGTCCTCAA GGCCGCGCAG CTGGTCCAAG CGATGCGCGA CAAGGGCGAC TTCCTCAAGG GCTACGAGTC CAGCAAGAGC ATCGACGAAG AGGGTGACTG GGCCAAGGGA CAGGCCGCCT TCTACCTGTC CGGCACGTGG GTTCCCTCCG AAGCCGCGCC GAACGCCGCT GCCGGCTTCA CATTCGACAG CATCCAGCTG CCGGCCCTGG ACTCGGGCAA CACCGACACC GGCATCAACT TCTTCGGCTT CGCGGTCCCC AAGACCGCCA AGCACTCCGA CGCCGCCGAG AAGTTCGTGG CCTTCTTCAT GAAGAAGGAG GAGCTCTCCG GCATCTCGAC CCAGGCGCTG AACCTCACAC CGCGCGCCGA CATCGCCCCG CCGGCCCAGC TGGCCTCGAT GAGCAAGGCG CTGACCGGGA CCGTCTACGC CGACCAGGCG CACCTGTCCG TGGACTACGC GGACTGGACC AACAAGATGC TGAACCCCGA GATGATCCGC CTGATCACCG GCCAGGACGA CGCCGCGAAG TTCGTCGCCA ACGGCCGGCA GGGCACGATC GATTACTGGA AGTCCGCGTC GTGA
|
Protein sequence | MRTSKNVHRA GVIAAAVAVS AASLSACGSS SKPGTSSGAK SSGSTSFTYW SMWQQNEPQA KVIKKAADDF TAATGIKVTI QWQGRDVITK LTPTLRTGSS ADLVDQSVNA LGALVAKDET TDLSSLYGTT IPGESQTVGQ VIPDSYKPFL NDKNGKPFIA PYEVSSEGLW FDASKFPALA ANPPKTWDDL LALFDKAKGM GMTPVAVPGD DKYWVLLTLQ RELGTDTLKK LASDKTGAAW DDPKVLKAAQ LVQAMRDKGD FLKGYESSKS IDEEGDWAKG QAAFYLSGTW VPSEAAPNAA AGFTFDSIQL PALDSGNTDT GINFFGFAVP KTAKHSDAAE KFVAFFMKKE ELSGISTQAL NLTPRADIAP PAQLASMSKA LTGTVYADQA HLSVDYADWT NKMLNPEMIR LITGQDDAAK FVANGRQGTI DYWKSAS
|
| |