Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_8343 |
Symbol | |
ID | 8671677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 9208541 |
End bp | 9211492 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003343734 |
Protein GI | 271969538 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.715387 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCTTCC GGACCCCCGG AGCCGGTAGG GCAATGCGCT TGCCCCGCCG GCCACGACTG CTACTTCCTG TCGCGATCGC CATCGTCGCG CTCGTTGCGT TGTTCTTCCT CTTCGCCGGC ATCTTCACCG ACTACCTCTG GTACGACTCG GTGAACTACA CGACGGTCTT CTCCGGTGTG ATCGTCACAC AGATCGTGCT CTTCATCGTG GGCGCCGTGC TGATGGTCGG CCTCGTGGGC GGCAACATGC TGCTGGCCTA CCGGATGCGG CCGATGTTCG GCCCGGGGAT GTTCGGCGGC GCCAGCGGCG CCGACCGCTA CCGGATGGCC CTCGACCCGC ACCGCAAGCT GATCTTCCTG ATCGGCGTCG CCGTGCTCGC CCTGTTCTCC GGCTCGTCGT TCTCCAGCCA GTGGAAGACC TGGCTGGAGT TCACCAACGC CACGCCGTTC AAGGAGACCG ACGCGCTGTT CGGCATGGAC GTGTCGTTCT TCATGTTCAC CTACCCGTTC CTGCGGATGG TGCTGAACTT CCTGTTCACG GCGGTGGTGC TGTCGGCCAT CATGGCGGCG ATCGTGCACT ACCTGTACGG CGGCTTCCGG CTCCAGTCGC CGGGCGTGCA CGCCTCGCGG GCCGCCCGGG TGCACCTGTC GGTGCTGCTC GGCGTGTTCG TGCTGCTGAA GGCGGTCGCC TACTGGATCG ACCGGTACGG ACTGGTCTTC TCCGACCGGG GCTTCGCGTA CGGCGCGTCC TACACCGACG TCAACGCCGT GCTGCCGGCC AAGACCATCC TGGCGATCAT CGCCCTCATC TGCGCCGCCC TGTTCTTCGC CGGAGTCGTA CGGCCCGGCG GCATGCTGCC CGGCGTCTCC TTCGGACTGC TGGTGCTCTC GGCCATCCTG ATCGGCGGGG TCTATCCGGC CCTGGTCGAG CAGTTCCAGG TCAAGCCGAA CCAGCAGGAC AAGGAACAGG TCTACATCAA GCGCAACATC GACGCCACCA GGAAGGCCTA CGGCGTCGAC AAGTCCGAGG TCACCGACTA CACGGCGCAG GGTGACGCCA GCAAGGTCAA CGTCACCGCC GACAGTTCGA TCTCGGGCGT GCGGCTGCTC GACCCCAACC TGGTCGCCAA GACCTACCAG CAGAAGCAGC GCATTCGCGG CTACTACGAC TTCCATGACC CGCTCGACGT CGACCGCTAC CCGGATGAGT CGGGCAAGCT GCGCGACACC GTGGTGGCCG TGCGAGAGCT CACCGGCCCG CCGCGCGAGC AGGACAACTG GATCAACCGG CACCTGGTCT ACACCCACGG CTACGGCTTC GTGGCCGCGC CGGGCAACGA GGTCGACTCG CAGGGCCTGC CCAACTTCGA CGCCAAGGAC ATGCCGGTGA CCGGCCCCCT GGTGCAGCGG ACCGGGCTGA AGGAGTCGCG GATCTACTTC GGCGAGTCCC CGACCGCACC CGAGTACGTC GTGGTGGGCG GTGACAAGAA GCAGGAGCTC GACTACCCGG AGAGCGGCGG CACCGGCCAG CAGAACACCA CCTACGTGGG CAAGGGCGGC GTCCCGGTCG GCTCGTTCCT CAACCGGGTT CTCTACGCGG CCAAGTACGG CGAGAAGAAC CTGCTGCTGT CGAGCGACAT CAACGACAAG TCGAAGATCC TCTACGAGCG CAACCCGCTG GACCGCATCG CCAAGGTGGC GCCGTTCCTG AGCCTGGACG ACAACCCCTA CCCGGCGATC GTCGACGGCA GGGTGGTCTG GATCGCCGAC GCCTACACCA CCTCCAACGC CTACCCCTAC TCCGACAGCC GGAGCCTGGA GGCGATGACG CGGGACACCG TCACGGACCC GCGCCTGGTG GTGCAGCAGC CGCGCGACCA GATCAACTAC ATGCGCAACG CGGTCAAGGC CACGGTCGAC GCCTACGACG GCACCGTCAA CCTGTACGCC TGGGACGAGA CGGACCCGAT CCTGCAGACC TGGCGCAAGG CCTTCCCCGG GATCATCAAG CCGCAGAGCG AGATGGGCGA CGGTCTCAAG CAGCACCTGC GCTACCCGGA GGCGCTGTTC AAGGTCCAGC GTGACGTGCT GTCCCGCTAC CACATCGAGG ACCCGAACGC CTTCTACAGC GGTCAGGACT TCTGGAACGT CCCGAACGAC CCGTCGTCGG GCGAGCGCGA CGTCAAGCAG CCGCCGTACT ACCTGTCGGT CAAGATGCCC GACACGACGG CCCCGACGTT CTCGCTGACC ACCACGTTCG TGCCGCGCCA GGGTCCGAAC CTGGCGGCGT TCATGGCAGT GGACGCCACC CCGGGAGCGG ACTACGGAAA GCTCCGCATC CTGCGGATGC CCTCCAACAC CACGATCCCC GGCCCCGGCC AGGTGCAGAA CAACTTCCAG AACAAGTTCT CCGGCGAGCT CAACCTGCTC GGCCTCGGCC AGGCGAAGGT CCGCTACGGC AACCTGCTGA CCCTGCCGTT CGCCGGGGGC CTGGTCTACG TCGAACCGGT CTACGTGGAG ATCGCCGCGG CCTCCGGCCA GGAGCCGTAC CCGATCCTCC GCCGGGTCCT GGTCTCCTAC GGAGACAAGG TCGGCTCCGC CGACACACTG GAAGCCGCGC TGCAGCAGGT CTTCGGGGAA GGCGCCGCGC CGCCGGCCAA GCCTGACGTC ACCAAGCCGA ACACGCCCGC CCAGCCGAGC ACCGCGCTGA GCCAGGCGAT CGGCGAGGCG CAGCAGGCCT ACGAGAAGGC GCAGGCGGCC CTGGCGAAGA ACCCGCCGGA CTGGACCGCG TACGGCGAGG CACAGAAGGA GCTGGAGAAG GCCCTGGAGA AGTTGAAGGG CGTCGCGGTC CCGTCGGCCA CGGCCACTCC GGTCCCCTCG CAGAGCCCGA CTCCGGCCCC TTCCGGAAGC CCCACACCAG CGGCGACACC GACGTCCTCA CCGAGTCCAT AG
|
Protein sequence | MSFRTPGAGR AMRLPRRPRL LLPVAIAIVA LVALFFLFAG IFTDYLWYDS VNYTTVFSGV IVTQIVLFIV GAVLMVGLVG GNMLLAYRMR PMFGPGMFGG ASGADRYRMA LDPHRKLIFL IGVAVLALFS GSSFSSQWKT WLEFTNATPF KETDALFGMD VSFFMFTYPF LRMVLNFLFT AVVLSAIMAA IVHYLYGGFR LQSPGVHASR AARVHLSVLL GVFVLLKAVA YWIDRYGLVF SDRGFAYGAS YTDVNAVLPA KTILAIIALI CAALFFAGVV RPGGMLPGVS FGLLVLSAIL IGGVYPALVE QFQVKPNQQD KEQVYIKRNI DATRKAYGVD KSEVTDYTAQ GDASKVNVTA DSSISGVRLL DPNLVAKTYQ QKQRIRGYYD FHDPLDVDRY PDESGKLRDT VVAVRELTGP PREQDNWINR HLVYTHGYGF VAAPGNEVDS QGLPNFDAKD MPVTGPLVQR TGLKESRIYF GESPTAPEYV VVGGDKKQEL DYPESGGTGQ QNTTYVGKGG VPVGSFLNRV LYAAKYGEKN LLLSSDINDK SKILYERNPL DRIAKVAPFL SLDDNPYPAI VDGRVVWIAD AYTTSNAYPY SDSRSLEAMT RDTVTDPRLV VQQPRDQINY MRNAVKATVD AYDGTVNLYA WDETDPILQT WRKAFPGIIK PQSEMGDGLK QHLRYPEALF KVQRDVLSRY HIEDPNAFYS GQDFWNVPND PSSGERDVKQ PPYYLSVKMP DTTAPTFSLT TTFVPRQGPN LAAFMAVDAT PGADYGKLRI LRMPSNTTIP GPGQVQNNFQ NKFSGELNLL GLGQAKVRYG NLLTLPFAGG LVYVEPVYVE IAAASGQEPY PILRRVLVSY GDKVGSADTL EAALQQVFGE GAAPPAKPDV TKPNTPAQPS TALSQAIGEA QQAYEKAQAA LAKNPPDWTA YGEAQKELEK ALEKLKGVAV PSATATPVPS QSPTPAPSGS PTPAATPTSS PSP
|
| |