Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4988 |
Symbol | |
ID | 8668282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 5505936 |
End bp | 5507180 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | extracellular solute-binding protein |
Protein accession | YP_003340531 |
Protein GI | 271966335 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.601947 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTTTCC GCCTACCCCT TGCCTGCCTC GCCGGCCTTG CCGTCATGAC TGCCGGCTGT AGTTCCTCCA GCCAGCCGGA GGCCTCAGGA AAGGCCACGA TCAGCTACGC CATCTGGGAC AAGAACGACC AGGCGAGCGC GGAGAAGATC ATCGCGGCCT TCCAGCAGGC CAATCCCAAT GTCGCGGTGA AGCTCGAGAT CACCCCGTGG GACCAGTACT GGACCAAGCT CCAGACGGCC GCGTCCGGCG GTGCCGCCCC CGACGTGTTC TGGATGAACA GCCTCAATGT CCGCATGTAC GCCAAGGGGG GAATCATCAC CCCGATCGAG GAGTCGAAGG CCCAGGGCCT TCCCCCGGCG GTCGTCGACG GGTACCGCTA CGACGGCAAG CTGTACGGCC TGCCGCACAA CGTGAGCATC CCGGCGCTCT GGTACGACAA GAAGCTCTTC GACGCCGCCG GAGTGGCCTA CCCCACCGCC GACTGGACCT GGGACGACGT CAAGGCTGCG GCCAAGAAGC TGACCGACCC GTCCAAGAAG CAGTTCGGCA TCCTCGCCCA CATGTGGGAC CAGGGCGCCT TCTACCCCAC GATGCTCCAG GCGGGCGGCC ACGTGCTGTC GCAGGACGGC AAGAAGAGCG GCTTCGACGA TCCCGCCTCG ATCCAGGGCC TGGAGTACTG GACCGGCATG ATCAAGGACA AGGTCGGCCC CGTGGCGGAG GTATACACCG ACACCGACCC CATCACGCTC TTCCAGTCTG GCAAGTACGG CATGCTGTAC GGCGGCGTCT GGTTCGCCCC CACCTTCTGG GCCAACCCCG AGATCCGCGA GCGGATCGAC GTGGCCCCGC TGCCCAAGGG CCCTGGCAAG GAAGCCGTCA TCCTGCTCGG CCTGGCCAAC GCCGTCTCGG CCAAGAGCGA GCATCCGAAG GAGTCGGCGG CCTTCGCCGA GTTCGTCGCC TCCGAGCAGG CCCAGAGGAT CCTCAGCGAC AGCGGCGGCG GCGCCCTCTC GCTCCGCGAC GGCACGCAGG AGGGCTGGTT CAAGGCCTTC CCCTCCTTCC ACCTGAAGGA GACCTACGAC GCTTCGATGC CGTACGGCGT GCCGTACCCG GTGTCACTGA ACACCGCGCA GTGGCAGGAC GTGCAGAACA AGCTGCTCGC CGAGGCCTGG GCAGGCAAGC GGCCGGTGGC CGACGTCGCC AAGGAGATCG CGACGCAGAT GAACGAGATC CTGGCCAAGG AGTAA
|
Protein sequence | MRFRLPLACL AGLAVMTAGC SSSSQPEASG KATISYAIWD KNDQASAEKI IAAFQQANPN VAVKLEITPW DQYWTKLQTA ASGGAAPDVF WMNSLNVRMY AKGGIITPIE ESKAQGLPPA VVDGYRYDGK LYGLPHNVSI PALWYDKKLF DAAGVAYPTA DWTWDDVKAA AKKLTDPSKK QFGILAHMWD QGAFYPTMLQ AGGHVLSQDG KKSGFDDPAS IQGLEYWTGM IKDKVGPVAE VYTDTDPITL FQSGKYGMLY GGVWFAPTFW ANPEIRERID VAPLPKGPGK EAVILLGLAN AVSAKSEHPK ESAAFAEFVA SEQAQRILSD SGGGALSLRD GTQEGWFKAF PSFHLKETYD ASMPYGVPYP VSLNTAQWQD VQNKLLAEAW AGKRPVADVA KEIATQMNEI LAKE
|
| |