Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4060 |
Symbol | |
ID | 8667354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 4519352 |
End bp | 4522531 |
Gene Length | 3180 bp |
Protein Length | 1059 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003339711 |
Protein GI | 271965515 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00663013 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0372283 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCCT ACCTGCGATT CCCGACGATC TTCGGCGACC GGGTCGTCTT CGCCGCCGAG GACGACCTGT GGATGGTGCC CGTCACCGGC GGACGGGCCT TCCGGCTGAC CGCCGGGGTG GCCGAGGCGG GCTACCCCCG GTTCTCCCCG TGCGGCGACC AGCTCGCCTT CGCCGGCCGC GAGGAGGGGC CGGAGGAGGT CTACGTGATG CCCGCCGACG GCGGGGCGGC CCGGCGGATC ACCTATCACG GGGCCCGCTC CACGGTCACC GGCTGGGATC CCGACGGCGC CGTCCTGTAC GCCAGCGACG AGTCCCAGCC CTTCGAGGGG CAGAAGTGGC TGCACCGGAT CCACCCGGAC GGGATTCCGG AGCGCCTGCC GTACGGCCCG GCCAACTCCA TCTCCTACGG CCCGCAGATC GTGCTGGGCC GCAACACCGC CGACCCGGCC CGCTGGAAGC GCTACCGGGG CGGCACCGTG GGCGACCTGT GGATCGGCAC CGAGGAGTTC CGGCGGCTCA TCGCCCTGCC GGGCAACCTG GCCTCGCCCT GCTGGGCCGG GGAGCGGGTC TACTTCATCT CCGACCACGA GGGTGTCGGC AACGTCTACT CCTGCACCGC GGACGGCGGG GACCTGCGCA GGCACTCCGA CCACGCCGAC TACTACGCCC GCAACCTGTC CGGTGACGGC CACCGGCTGG TCTACCACGC CGGAGCCGAG CTCTACCTGG TCGAGGACGG GGAGTCACAC CGTGTCGAGG TGTGCCTGCG CAGCTCCCGC ACCCAGCGCA ACCGCCGTTT CGCCGCGGCC GAGGACTTCC TCGACAGCGC CACGCTCAGT CCCGACGGCA GCGGCCTGGC CATCACCACC CGGGGCAAGG CGTTCTCCTT CGCCGACTGG GAGGGCCCGG TCCGCCAGCA CGGCGCGCCG TACGGCGTCC GCTACCGCCT GCTGACCTGG CTGAACGACG ACGAGCGGCT GATCGCGGCG GCCAGCGACG ACGGCGACCG CGAGGTGCTG TCCATACTCA CCGCCGACGG CAGCGCCGAA CCGGTCCAGC TCGACCACCT CGACACCGGG CGCGTCACCG CGCTGGAGGT CTCCCCCAAG GACGACAGGG TCGCGATCGC CAACCACCGC AACGAGCTGC TCGTGGTCGA CCTCACCGGG GGCACGGTGA CCGGCGCCCG GAGCACGGTG ATCGACGCCA GCAGGTTCGG CGCGATCGAG GACCTCGCCT GGTCTCCCGA CGGCCGCTGG CTGGCCTACG CCTGCCGTGA CACCGCGCAG ACCATGGCCG TCAAGCTGTG CCGGATCGAG ACCGGCGAGA CGTTCTTCGC CACCCGCCCG GTGCTGTGGG ACAGCGGCCC CGCCTTCGAC CCCGGCGGCG ACTACCTCTA CTTCATCGGC CAGCGCGTCT TCAACCCGGT CTACGACGAG CTCCAGTTCG ACCTGGGATT CCCCCTCGGC TCCCGCCCCT ACGCCATCGG GCTCCGCGCC GACGTCCGCT CCCCCTTCGT CCCCGAACCC CGGCCGCTCA AGGACGACGA CGATGACGAC GACGGTGACG GTGACGGTGA CGACGGGCAG GAGACCGAGG TGGTCATCGA CCTGGCGGGC ATCCAGGACC GCGTCGTCGC CTTCCCCGTC CCCGAGGGAC GCTACGACCG CATCGCCGGG ATCAAGGGCA AGGCCGTCTA CCTGACGTTC CCCGTGGAGG GCAGCCTCGG CGACGACTAC GCCGACTCCT CCGACGGCAC GCTGCAGGTC TACGACTTCG CCGGCCAGAA GCAGGAGACC CTGGTCGGGG ACGTCTCGGA GTTCCAGCTG GGCCGTGACG GCACCACCCT GCTCTACCAG GCCGGAAAAC GGCTGCGGGT GATCAAGGCG GGCGAGGCGC CCGAGGACGA CGACACGCCG AGCCGCGGCA GCGGATGGGT CGACCTGTCG CGGGTCAAGG TGTCCATCCG CCCGGAGGCC GAGTGGCGGC AGATGTTCCG CGAGGCCTGG CGGCTGCAGC GGGAGAACTT CTGGACCCAG GACATGGCCG GGATCGACTG GGAGGGTGTC TACCGGCGCT ACCTCCCGCT GGTGGACCGG GTCACCACCC GGGGAGAGTT CTCCGACCTG CTGTGGGAGC TGCTCGGCGA GCTCGGCACC TCCCACGCCT ACGAGAGCGG CGGCGCCTAC CCGTCCCGGC CGCACTACCG GCAGGGCAAG CTCGGCGTCG ACTGGTCCTT CGAGGACGGC CTCTACCGGG TCGCCCGGAT CGTCAACGGC GACCGCTGGG ATCCCGAGGT CACCTCGCCG CTCAACCGCC TCGGGGTGGA CGTACGGCCC GGCGACGTGG TGCTGGCCGT CAACGGCCAG CCCGTCGGCC CGTCGGCCGG CCCGGACGAA AGGCTGGTCA ACCAGGCCGA TCAGGAGGTC CAGCTCACCG TCAGGCGCGG GCAGGACAAG CGGACCTTCA ACGTGAAGGC CATCGGCGAC GAGCAGCCGG GCCGCTACCG CGACTGGGTG GAGGCCAACC GGACCCACTG CCACGAGCGC AGCGGCGGCC GGGTCGGCTA CCTGCACATC CCCGACATGG GGCCGGACGG CTACTCCGAG TTCCACCGCG GCTTCCTCAC CGAATACGAC CGGGAGGGCC TGATCGTGGA CGTCCGGTTC AACGGCGGCG GCCACGTGTC GGCCCTGCTG CTGGAGAAGC TCTCCCGCCG CCGCCTCGGC TACAACTTCC CGCGGTGGAG CGTGCCCGAG CCCTACCCCG ACGAGTCCCC CCGGGGTCCG ATGGTCGCGA TCACCAACGA GTGGGCCGGC TCCGACGGCG ACATCTTCAG CCACACCTTC AAACTGCTCG GCCTGGGCCC GCTGATCGGC AAGCGCACCT GGGGCGGGGT GATCGGCATC TGGCCCCGGC ACCAGCTCGC CGACGGCACG GTCACCACCC AGCCGGAGTT CTCCTTCGCC TTCGACGACG TGGGCTGGCG GGTGGAGAAC TACGGCACCG ACCCCGACAT CGAGGTGGAC ATCACCCCGC AGGACTACGC CCGCGGCGTG GACACCCAGC TCGACAAGGC GATCGAGGTC GCCCTGGAAC GCCTGCTCCT CCATCCCCCG CACACGCCCA ATCCGGCCGA CCGGCCGCGG CTCACGGTCC CGCGCCTGCC ACCTCGCTGA
|
Protein sequence | MSAYLRFPTI FGDRVVFAAE DDLWMVPVTG GRAFRLTAGV AEAGYPRFSP CGDQLAFAGR EEGPEEVYVM PADGGAARRI TYHGARSTVT GWDPDGAVLY ASDESQPFEG QKWLHRIHPD GIPERLPYGP ANSISYGPQI VLGRNTADPA RWKRYRGGTV GDLWIGTEEF RRLIALPGNL ASPCWAGERV YFISDHEGVG NVYSCTADGG DLRRHSDHAD YYARNLSGDG HRLVYHAGAE LYLVEDGESH RVEVCLRSSR TQRNRRFAAA EDFLDSATLS PDGSGLAITT RGKAFSFADW EGPVRQHGAP YGVRYRLLTW LNDDERLIAA ASDDGDREVL SILTADGSAE PVQLDHLDTG RVTALEVSPK DDRVAIANHR NELLVVDLTG GTVTGARSTV IDASRFGAIE DLAWSPDGRW LAYACRDTAQ TMAVKLCRIE TGETFFATRP VLWDSGPAFD PGGDYLYFIG QRVFNPVYDE LQFDLGFPLG SRPYAIGLRA DVRSPFVPEP RPLKDDDDDD DGDGDGDDGQ ETEVVIDLAG IQDRVVAFPV PEGRYDRIAG IKGKAVYLTF PVEGSLGDDY ADSSDGTLQV YDFAGQKQET LVGDVSEFQL GRDGTTLLYQ AGKRLRVIKA GEAPEDDDTP SRGSGWVDLS RVKVSIRPEA EWRQMFREAW RLQRENFWTQ DMAGIDWEGV YRRYLPLVDR VTTRGEFSDL LWELLGELGT SHAYESGGAY PSRPHYRQGK LGVDWSFEDG LYRVARIVNG DRWDPEVTSP LNRLGVDVRP GDVVLAVNGQ PVGPSAGPDE RLVNQADQEV QLTVRRGQDK RTFNVKAIGD EQPGRYRDWV EANRTHCHER SGGRVGYLHI PDMGPDGYSE FHRGFLTEYD REGLIVDVRF NGGGHVSALL LEKLSRRRLG YNFPRWSVPE PYPDESPRGP MVAITNEWAG SDGDIFSHTF KLLGLGPLIG KRTWGGVIGI WPRHQLADGT VTTQPEFSFA FDDVGWRVEN YGTDPDIEVD ITPQDYARGV DTQLDKAIEV ALERLLLHPP HTPNPADRPR LTVPRLPPR
|
| |