Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4993 |
Symbol | |
ID | 8336347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5712720 |
End bp | 5714018 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644958092 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003115694 |
Protein GI | 256394130 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0450999 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.012477 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCACTC ATCTGCCTCG CCCACGCCGT GCGGTCCTGA CCGCCGCGGT GGCGGTAGCC GCCCTCGTCC TGGCCGGCTG CGGCTCCACC AAGAGCTCCG GCGGTTCGGC CGGTGCCACC GGCTCGGCCG ACGACGGCAC CACCCTCACC CTGTGGACCC GCGCCGGCAC GCAAACCCAG ACCCAGGCGC TCGTGGACGC GTACAACGCG AGCCACAAGA ACAAGGTCAA GCTGACCGCT TTCCCGAACG AGCAGTACCC TGACAAGATC GCCACGTCGG CCGGCGCCAA GGCGCTGCCG GACATCTTCA CCTCCGACGT GGTGTTCGCG CCGAACTACG TCAGCCAGGG TCTGTGGACC GATATCACCG GCGACTTCAA CACCCTGCCG TTCAAGGACG CGGTGGCTCC CTCGCACGTC AAGGCCGCCA CCTCCGACGG CAAGATCTAC GCTGTCCCGC ACGCGCTGGA CCTGTCGGTC ATGTTCTACA ACAAGGCCTT GTACAAGCAG GCCGGGCTGG ACCCGAACAA GGGCCCTGCC ACGCTCCAGG AGTTCGCCGA CCAGGCGCGC GCCGTCGCCA AGCTCGGCGG CGGCAACCAC GGCACGTACA TCGGCGGCAA CTGCGGCGGT TGCGTGGAGT TCACGTTCTG GCCCTCGATC TGGGCCGACG GCGGCGCGGT GATGGACGAC GCGGGTACCA AGTCCTCGAT CGACAGCTCC CAGGTGCAGG CCGTGTTCAA GGTCTACCAC GACCTGTACG CCGACGGGAC CGCCGACCCG GCCTCCAAGC AGGAGAACGG CACCACCTGG CTCGGCGCGC TGCAGACCGG CAAGGTCGGC ATCGCGCCCG GACCCTCGTC CTGGCTGCCG CTGATCCGCT CCAAGGGCAT CGACGTCGGC GTGGCGCCGA TCCCCGGCGT GCGGGGCGGG AGCTCCACCT TCGTCGGCGG CGACGTGGCC GGCATCGCCT CGACCAGCTC GCACGAGAAG CAGGCCTGGG ACTTCCTGTC CTGGACCCTC GGCGACACCG CGCAGGTCGA CGTGGTGGCC AAGCAGGGCC AGATCATCGC CCGCACCGAC CTGGCGGACA ACCGGTACGC CGCCGCCGAC CCGGCGGTGC TGAGCATCAA CCAGGTCATG GCCAAGGGCA AGACGCCGTA CGCGCTGAAC TTCAACGCCA CCTACAACGA CCCGCAGAGC CCGTGGACGG CGGCGCTGCG CGGAGCCCTG TTCGGGGACG CGGCCAGTGC TCTGGCCTCC GGTCAGGACG CCATCACCAA ATCGCTCGAC CAGCACTGA
|
Protein sequence | MPTHLPRPRR AVLTAAVAVA ALVLAGCGST KSSGGSAGAT GSADDGTTLT LWTRAGTQTQ TQALVDAYNA SHKNKVKLTA FPNEQYPDKI ATSAGAKALP DIFTSDVVFA PNYVSQGLWT DITGDFNTLP FKDAVAPSHV KAATSDGKIY AVPHALDLSV MFYNKALYKQ AGLDPNKGPA TLQEFADQAR AVAKLGGGNH GTYIGGNCGG CVEFTFWPSI WADGGAVMDD AGTKSSIDSS QVQAVFKVYH DLYADGTADP ASKQENGTTW LGALQTGKVG IAPGPSSWLP LIRSKGIDVG VAPIPGVRGG SSTFVGGDVA GIASTSSHEK QAWDFLSWTL GDTAQVDVVA KQGQIIARTD LADNRYAAAD PAVLSINQVM AKGKTPYALN FNATYNDPQS PWTAALRGAL FGDAASALAS GQDAITKSLD QH
|
| |