Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3956 |
Symbol | |
ID | 8667246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 4404701 |
End bp | 4405951 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | ABC-type sugar transport system periplasmic component-like protein |
Protein accession | YP_003339609 |
Protein GI | 271965413 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.282171 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0187852 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACATC CGGGTGCCCG TACCGCCGCA CTCGCCGCCG TCGCTGCGGC TGCCCTGCTG ACCTCGGCGG CCTGCGGCAG CGGCTTCGAC GGCCCCGCGG GCGGGAACAC CGAGCAGAGC GGCGGCCCGG CGGCCCTGCG GATCCTCATC GGCTCCTCCG GCGACGCCGA GACCGCCGCG GTACGCTCGG CGGCCGGCGC CTGGGCCAAG GCGACGGGCA ACACCGCGAC GGTCACTCCC GCCCAGGACC TCTCCCAGCA GCTCGGCCAG GCCTTCGCCG GGAGCGACCC CCCGGACGTG TTCTACGTGG ACGCCTCGCG CTTCGCCGAC TACGCGAGCG TCGGGGCGCT GGAGCCGTAC GGTGACAGGA TCTCCGACTC CGGGGACTTC TACCCGAGCC TGCGCACCAC CTTCAGCCAC GACGGCGTCT TCTACTGCGC GCCGAAGGAC TTCGCCACCC TGGCGCTGAT CGTCAACGAC GACCTGTGGA AGAAGGCCGG GCTGACCGGC GCGGACGTGC CCACCACCTG GGAGCAGCTC ACCTCGGCGG CGGAGAGGAT CAAGGCCGCG GGGGTCACCC CGCTGGTCGT CGGCGACACC CATGAGCGGA TCGGGGCCTT CATGGTGCAG GCCGGGGGCT GGATCACCAG CGACGACGGC AGGCGGGCCA CCGCCGACAG CGCCGCGAAC GTCACCGCCC TGCAGTACGT GCGGGGCCTG CTCAAGGGCG GGCTCGCCCG GTTCCCCAAG CAGCTCGACG CCGGATGGGG CGGTGAGGCC TTCGGCAGGG GCAGGGCCGC GATGACCGTC GAGGGCAACT GGATCAGGGG GGCGATGAGA GCGGACCACC CCGGCGTCGC CTACACCGTC CACGAGCTGC CGGCCGGGCC GGCGGGCAAA GGCACGCTGT CCTTCACCAC CTGCTGGGGC ATAGCCGCCA AGAGCAGGCA CAAGAAGCAG GCGATCAGCT TCGTCGAGGA GATGACCAGG GCCGGCCGGC AGATGGAGTT CGCCAGGGCG TTCGGCGTGA TGCCCTCCCG CCGGTCCGCC AGGGCCGCCT TCACCGGGGA GTTCCCGGAC GACACCCCGT TCGTGAACGG CGCCGACCAC GCCCACGGCC CGGTGAACAC CCCGAAGATG GCCAATGTGC TGGCCGACTT CGACGACGGC CTCCAGCAGC TCGCCTCCAC CGACCCGAAG ACGCTCCTGG CCCGCCTGCA GAAGAACACC CGGGCCGCGC TCGGCGACTG A
|
Protein sequence | MRHPGARTAA LAAVAAAALL TSAACGSGFD GPAGGNTEQS GGPAALRILI GSSGDAETAA VRSAAGAWAK ATGNTATVTP AQDLSQQLGQ AFAGSDPPDV FYVDASRFAD YASVGALEPY GDRISDSGDF YPSLRTTFSH DGVFYCAPKD FATLALIVND DLWKKAGLTG ADVPTTWEQL TSAAERIKAA GVTPLVVGDT HERIGAFMVQ AGGWITSDDG RRATADSAAN VTALQYVRGL LKGGLARFPK QLDAGWGGEA FGRGRAAMTV EGNWIRGAMR ADHPGVAYTV HELPAGPAGK GTLSFTTCWG IAAKSRHKKQ AISFVEEMTR AGRQMEFARA FGVMPSRRSA RAAFTGEFPD DTPFVNGADH AHGPVNTPKM ANVLADFDDG LQQLASTDPK TLLARLQKNT RAALGD
|
| |