Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4922 |
Symbol | |
ID | 8336276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5611964 |
End bp | 5613016 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644958021 |
Product | sugar ABC transporter substrate-binding protein |
Protein accession | YP_003115623 |
Protein GI | 256394059 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.180738 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATTCA GGCTCCTGGC CGCGGGTACC GCGGCGGTTC TGTCCGTCAG CGCGCTGACG GCGTGCTCCA GCAGCAAGAA CAGTAGCGGC GGCAAAGCGG TCGTCGGCGT GGACTACCCG CGTTCGGACA CCGACTTCTG GAACTCCTAC ATCCAGTACA CCCCGCAGTT CGCCTCCCAG CTCGGCCTGT CGATCAAGAC CAGCAACTCG CAGAACGACG TCGCCAAGCT CGCCGCGAAC GCGCAGGCGT TCATGGCCGA GGGCGTCAAG GGCGTGGTCA TGGCGCCGCA GGACACCGCG GCGATCGCGC CCACGCTCAG CCAGCTCGCC GCCAAGAAGA TCCCGGTGGT CTCGGTCGAC ACCCGGCCCG ACACCGGCTC GGTCTACATG GTGGTGCGCG CCGACAACCG CGCCTACGGC ACCAAGGCGT GCCAGTTCCT CGGCACGAAG CTGTCCGGCA AGGGCTCGGT GGTCATGCTG GAGGGCGACC TGTCCTCGAT CAACGGCCGC GACCGCACCG ACGCGTTCAA CGCCTGCATG AAGCAGAGCT TCCCCGGCAT CACGGTCTAC GGCGAGGCCA CCAACTGGGA CGCCGCCACC GCGGCGCAGA AACTGCAGAC CGACCTCACG GCGCACCCGG ACGTCAAGGG CGTCTACATG GAATCCAGCT TCGCGTTGTC CGGCACGCTT CAGCTGCTCC AGCAGAAGGG CCTGATGGCT CCGCCGAGCG ACCCCAAGCA CGTGTTCGTC GTCTCCAACG ACGGCATCCC CGAAGAGCTG AAGGACATCG CGGCCGGCAA GATCGACGCC ACCGTGTCGC AGCCCGCGGA CCTGTACGCC AAGTACGCGC TGTTCTATAT CCAGGCCGCG GTGCAGGGCA AGACCTTCAA GCCGGGCCCG ACCGACCACG ACAGCACCAT CATCCAGGTC CGCGCGGGCC TGCTGGAGGA CCAGCTCTCC GCCCCGCTGG TCACCGCCGA CGGCGGCGCA TACGGCGGCA TCGCGAGCCT GAAGAGCACC GACACCTCGC TGTGGGGCAA CCACCTCGGC TGA
|
Protein sequence | MRFRLLAAGT AAVLSVSALT ACSSSKNSSG GKAVVGVDYP RSDTDFWNSY IQYTPQFASQ LGLSIKTSNS QNDVAKLAAN AQAFMAEGVK GVVMAPQDTA AIAPTLSQLA AKKIPVVSVD TRPDTGSVYM VVRADNRAYG TKACQFLGTK LSGKGSVVML EGDLSSINGR DRTDAFNACM KQSFPGITVY GEATNWDAAT AAQKLQTDLT AHPDVKGVYM ESSFALSGTL QLLQQKGLMA PPSDPKHVFV VSNDGIPEEL KDIAAGKIDA TVSQPADLYA KYALFYIQAA VQGKTFKPGP TDHDSTIIQV RAGLLEDQLS APLVTADGGA YGGIASLKST DTSLWGNHLG
|
| |