Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_8990 |
Symbol | |
ID | 8672332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 9939465 |
End bp | 9941183 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | SL44-1; basic proline-rich protein |
Protein accession | YP_003344364 |
Protein GI | 271970168 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.297465 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACAACAC CTGAGGATCC CAACGAGCCC GGCTCGCGGC GCGAGCCGCG GGACCGGCGA TCCGAGCGGG AGGAGGCGGG CCCGGAGCAG TCCTCGGGCG CCTCCGAAGA GGCGGCCGTT CCGAGCGGAC CGGACGATTC CGACTCCACG GTCTTCCGCC CGGAGGACCC TTCCACGTGG TCGGCGCCCG ATCCCGCCGA CCCTCTGCGC TCGTCGTCCG AGCCCACCCC TCCGACCGGC CGGTCCGATC AGCCCCCCGA GCCCCCTTCC GAGCCCTCCG CGAGGCCCGC TGAGCCCGCG CCCCCGACCG GGTGGCCGTC CGAGCCCACG CCCCCTTCCG AGCCCTCCGC GAGGCCCGCT GAGCCCGCGC CCCCTTCCGA GCCCTCCGAG CCCCCTTCCG AGCCAGAGGA GCCCGAGAAG CCCACGCCGG GGTCGCCCGA GCCACGCGGT CCCGGTGAGA CCCCGGGCTG GCCGCCGATC TCCACGCCCG AGGAGCCGTC GCTGCCGGGG CATCCGGATG TGCCCACCGC GCCGGAGGTC CCCAGGCCGG TCCAGGCGGA TCCGGACACC ACCAGCGTTT TCGAGTCCCC GGGCACTCCG GCCGCCGAGG CCCCGCCCCC CTATCCGGTC CCGGGCGGCA CGCCTGCCTA TCCGGGCTGG GAGAGCGCCT CGGAGACACC GGCCTCAGGC GGCCGGGACG ACAGGGACGG CATGGATGTG CCGCCCGGCG CCGCCCAGCC CTCGCGCTAC GATCCGCCGA CGTCCCCCGA GGGGTTCCCC GCGGCCTCGC ACCGGCAGCC CGAGCAGCCC GAGCAGCCCG AGCAGGCCGG GCTCCCCGAG CAGCCCGGTC GGTTTGAGCA GGCCGGGCCC TCCGAGCAGC CCGGTCGGTT TGAGCAGGCC GGGCCCTCCG AGCAGGACAG GCCGTCCGAG CAGGCCGGTC GGCCTGAGCA GGCCGGGCCA CCCGAGCAGC CTGGCGGAGG CTTCCAGGCG GAACCGCCGC CGGGACCGGG CGGTCCGTAC GGTGGCCCCT CGGCTCCGTA CGGCGGCCCC GCGGCCCCGT ACGGCGGCCC TTCGGCTCCG TACGGCGAGG CCGCCCCCGG TGCCCACCGC GGGCCGGGCG ACGAGGCTCC GACGGAGAAC ATCTCCGGCG CGGGCCCCTA CGGAACCCCT CCCTACACCG GTGCCCACGC GAGCCGGCCG GATGAGCCGC CGCGGCAGCC GCCGTACGCC CCACCGGCCG GCGGTCCGTC CTATCCGTCC TATCCGTCCG AGGGCCCGCC GCAGGGCGGC CTGTCCTACC CGTCGGGCCA GCCGTACCAG GCGGCGCCTC CGGGAGGCGC CTATCCGGGA GGTCCCGGCT ATCCGGGCGG CACGCCGTAC CCGGCCTATC CGGGTGACGA CCGGTCGCGC CAGCAGGGCG GCGGACTCGG CACCACCGCG CTCGTACTCG GCATCGTGAG CCTTTTCCTG CTCGTCGTGT GCGGCCTGGG GGCGCTGACG GCCATCATCG GCCTGATCAT CGGCATCGCC GCGGTCGTCA AGAACTCCAA CCGCGGCCGC GCCTGGGTGG GCATCGCCCT GAGCGTCCTG ACACTGATCA TCGCCGTGGT GGTGCTCAGC TGGTTCTACA GCAAGGTGGG CGACTGCCTG AACCTGCCGC CCGAGTTCCA GCAGCGCTGC ATCCAGGAGA AGTTCGGCGG GCAGTTCACC ACGTCGTGA
|
Protein sequence | MTTPEDPNEP GSRREPRDRR SEREEAGPEQ SSGASEEAAV PSGPDDSDST VFRPEDPSTW SAPDPADPLR SSSEPTPPTG RSDQPPEPPS EPSARPAEPA PPTGWPSEPT PPSEPSARPA EPAPPSEPSE PPSEPEEPEK PTPGSPEPRG PGETPGWPPI STPEEPSLPG HPDVPTAPEV PRPVQADPDT TSVFESPGTP AAEAPPPYPV PGGTPAYPGW ESASETPASG GRDDRDGMDV PPGAAQPSRY DPPTSPEGFP AASHRQPEQP EQPEQAGLPE QPGRFEQAGP SEQPGRFEQA GPSEQDRPSE QAGRPEQAGP PEQPGGGFQA EPPPGPGGPY GGPSAPYGGP AAPYGGPSAP YGEAAPGAHR GPGDEAPTEN ISGAGPYGTP PYTGAHASRP DEPPRQPPYA PPAGGPSYPS YPSEGPPQGG LSYPSGQPYQ AAPPGGAYPG GPGYPGGTPY PAYPGDDRSR QQGGGLGTTA LVLGIVSLFL LVVCGLGALT AIIGLIIGIA AVVKNSNRGR AWVGIALSVL TLIIAVVVLS WFYSKVGDCL NLPPEFQQRC IQEKFGGQFT TS
|
| |