Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4955 |
Symbol | |
ID | 8336309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5657201 |
End bp | 5658577 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644958054 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003115656 |
Protein GI | 256394092 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.117227 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGTGA AGTTCCGCGC CATCGGCGCG CCAGGCCCGG GCCGGCCGCG TCGGCGTGCC GGGCTGCTCG CGCTCGGAGC CCTGCTCGCC CTCGGCGCCG CCGCCTGCGG CACGAGCAGT GGAAAGGCCA ACACGCCGAG CACCGGTGGC AGCTTCAAGA CCGCGGCGCA GACCGGCGGG ACCCTGACGG TCTGGGTCGA CTCGGACCGG CTGGCCGCCG CGAAGCTGTA CCAGAAGGCC CATCCCGAGG TGAAGATGGA CATCGTCACC TACGACGGGG ACGCCAACGG CTCCAACTAC CTGCAGACCA AGGTGTCCCT GTTCAACCGG ACCAGCTCGG GCTGGCCGGA CGTGGTCTTC AGTTCGCAGA ACAACGAGAC CAGCTGGGCG GTGCAGGCCG GGTTCGCCGC ACCGCTGAAC AAGGGTCTGA TACCGCAGGC CACCCTGGAC GGCTGGGCCA CCGACGCCAA CGCCCCGTGC ACCGTGGACG GCACCGTCTA CTGCCTGCGC AACGACCTGT CGCAGACCGT GCTCTGGTAC AACGACAAGC TGATGAAGCA GTGGGGCTAT CAGGTCCCCA CGACCTGGGA GCAGTACCAG GCCCTCGGTG AGAAGGTCGC CGCCGAGCAC CCCGGCTACC TGGTCGGCGC CGCCGGCGAC ACGTTCGCTC CGGAGATCTA CCTGTGGGCG GGCAAGTGCG GCGCCAACCA GATCACCGGG CCCAAGGCGG TCACGGTCGA CGCCACCAGC GCGGCCTGCA CCAAGATGGC CACGCTGATG GACACCCTGA TCAAGGACAA GACCCTGTCG ACGTCGAGCG TGTTCAGCTC CGACTTCGAC AAGAACGAGG CTGACAAGAT CCTGATGATG CCCGGCCCGT CGTGGTACGG CGGAGCGCTG TTCCAGGGCA CGTTCAAGAC CCCGGCCGGG CAGCTCGGCG TGGCGCCGAT GCCGCAGTGG TCCGGCGACT CCAGCCCGTC GGTGGGCAAC GTGGGCGGCG GCACCTGGCT GGTCTCGGCG CACAGCAAGA ACCTGAAGGC CTCGACCGAC TTCGTGACCT GGGTCACCAC CTCGGATGAC TACCAGGGCA AGCTGGCGCC GGGCTTTCCG GCGTACACGG CGGCGGCCAA GACGTGGCTG GCGGCGCAGC AGTCCTCCGG GTACTACGCC GACGACATCA CCGCGCCGCT CACCGCCGCG GCGAACCAGG TCTGGGCGGG CTGGGGCTAC GGGCAGTTCA GCCAGGAGGC GGTCTGGGCC GCGACCGTCA CCCCCGGCGT CAACGCCGGC AAGACCATCG TCTCGCTGCT GCCGGCCTGG CAGGACTCGA TCGTGAACCA CGCCAAGGCC GACGGATACC AGGTGGCGAC GAAGTGA
|
Protein sequence | MTVKFRAIGA PGPGRPRRRA GLLALGALLA LGAAACGTSS GKANTPSTGG SFKTAAQTGG TLTVWVDSDR LAAAKLYQKA HPEVKMDIVT YDGDANGSNY LQTKVSLFNR TSSGWPDVVF SSQNNETSWA VQAGFAAPLN KGLIPQATLD GWATDANAPC TVDGTVYCLR NDLSQTVLWY NDKLMKQWGY QVPTTWEQYQ ALGEKVAAEH PGYLVGAAGD TFAPEIYLWA GKCGANQITG PKAVTVDATS AACTKMATLM DTLIKDKTLS TSSVFSSDFD KNEADKILMM PGPSWYGGAL FQGTFKTPAG QLGVAPMPQW SGDSSPSVGN VGGGTWLVSA HSKNLKASTD FVTWVTTSDD YQGKLAPGFP AYTAAAKTWL AAQQSSGYYA DDITAPLTAA ANQVWAGWGY GQFSQEAVWA ATVTPGVNAG KTIVSLLPAW QDSIVNHAKA DGYQVATK
|
| |