Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_2421 |
Symbol | |
ID | 8665707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 2624540 |
End bp | 2627674 |
Gene Length | 3135 bp |
Protein Length | 1044 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | non-ribosomal peptide synthetase/polyketide synthase |
Protein accession | YP_003338142 |
Protein GI | 271963946 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.232071 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCCA GGGAATGGGA GGCTCCCGCC TCCTTCGGCC AGGAACGCAT CTGGCTGGCC GACCAGCTGG ATCCCGGCTC GCCGGTGTTC AACCTGCCGT GCTCGCTGGA GATCCTGCAC CCGATCGAGG CCGATGAGGT CGTCGCGGCG CTGAAGACCG TCGTCGACCG GCACGAGGTG CTCCGCACCT CCTTCAGGAT CGTCGACGGC GCCCTCCGCC AGGTGGTGCA CGCCGAGGTG GACCTCGACG TGCAGGTGCA CGACCTGCGG TCGCTCTCCG CCGAGGAGCG CCGGACCCGG ATGTACGACC ACGCCCTGGC CGCCGCCAGG GCCGCGGTCC CCACCGACCG TGCCCCGCTG TGGCGGGCCA CGGTGACCCG GACCGGCGAC GCGCGCTGGG TGGTCGGCTT CGTGGTCCAC CACGCCGTGT TCGACGCCAC CTCGGCCGTC ATCCTCACGA CGGAGTTGAC CGAGCTGTGC GCCGCCTCGG CGGCCGGCCG GCCCGCCGCC CTGCCGGAGC TGCCCATCCA GTACGCCGAC TACGCGGCCT GGCAGCGCGG CCAGCTCACC GGCCAGGCCC TGGCCGAGCA GACGGGCTAC TGGCGGTCCC GGCTGGCGGG GCTGCCCGCC GTACACACCC TCCCCACCGA CCGGCCGCGG CCCGCCCAGC TCGGCTACGC CGGCGACGAG GTCCGCTTCA CGATCCCGGA GGAACTGCAG GAGCAGGCCG CCGCGGTGGG ACGCCGCTCG GCCGCGACGC CGTTCATGGT GTTCCTGACA GCCTACGTCG CGCTGCTGTC CCGGCTGTCC CGCGACGACG ACACCGTCGT CGGGGTGACC ATGAGCGGCC GGGACCTGCC GGAGGTCGCC GGACTGGTCG GCATGTTCAT CAACCAGATG GTGGTGCGGA CGGACACCTC GGGCGACCCG GCCTTCGCCG AGCTGCTCGG CCGGGTCCGC GCCACCCTGC TCGACGCCAT GGAGAACGGC CAGATCCCGT TCCAGGCGGT CGTGGAGGCG ATCGCGCCGC AGCGGGATCC GGGGGTGCAG CCGCTCTACC AGATCGGCTT CAACTACATC CCCGACTCCG GCATCGAGCC GATCCCCTAC AGCACCTCGA AGGACGACCT GGCGTTCGAC CTGACCACCG GCACCAGCCG GCTGCTGTAC CGCACCGACC TGTTCGACCG GGGCACCGCG GAGACGTTCG TCGCCCGCTA CCTGCGCGTC CTGGCCGCGG GCGCGGCCGA CCCGGAGACC CGGATCGGCG ACCTGCCGCT GATGGACGAG GACGAGCGCG CGTCGCTGCT GGAGGCGGCC GCGCAGGAGC CGCCGCGCGA GCACGCCACG GTGTGCCGCA TGGTGGAGGC ACAGGCCGCG CGCACCCCGG ACGCGACCGC CGTGATCGTC GGTGACCGGC GGCTGACCTA CGCGGAGCTG GAGGAGGCCG CCGGACGGGT GGCGGACCGG CTGCGGCGGT CGGGCGCCGG CCCCGAGTCG CTGGTCGCGG TGTACGCCGA GCCGTGCCTG GAGCTGCTGC CCGCCCTGCT GGGCGTGTGG AAGGCCGGAG CGGGTTACGT GCCGGTGGAT CCCGGCTACC CGGCGGAGCG GGTGGCGTAC ATGCTCGCCG ACTCCGCCGC CTCGGTGGTG CTCACCCAGC GGCACCTGGC GGGCGCCCTG CCGGCCGCCG GCGCGACCGT GCTCGCCGTC GACGACCCCG GCGAGTGGAC CGGGCAGCCG GCCGCCGGCT CCTCCCGGGA GGCCGCGCCG GAGAACGTCG CGTACGTCAT CTACACCTCG GGCTCGACCG GCACCCCCAA GGGCGTCGTG GTCGAGCACG GCTCGGTGGC CGCCTACCTG GCGTGGGCCG GCACCGCCTA CCCCGGCCTG GCCGGACAGG CGCTGCTGCA CTCGCCGATC TCCTTCGATC TGACCGTGAC CGGCCTGTTC GGGCCGCTGA CCGTCGGCGG CGCGGTGCGG TTCGCCGCCC TCGACGACGG CGTCGCGGCT GGAGACCGGC CGACGTTCCT CAAGGCCACC CCCAGCCACC TGGCGCTGCT GGCGGCGCTG CCCGACCACG CCGCCCCCAG CGCCGACCTG GTGCTGGGCG GCGAGGCGCT GCCCGCCGAC TGGGTCACCG CCTGGCGGGA GCGGCATCCC GGCGTCACGG TCGTCAACGA GTACGGTCCC ACCGAGGCCA CCGTCGGCTG CGTCGCGGCC CACGTGGCGC CCGGCGAGGC GCTGCCCGTC GACCAGGCGG GCGCGGTCGG GATCGGCCGT CCGGCCCCCG GCAACACCGC CTACGTGCTG GACGGCGGGC TGCGGCCGGC GCCCGCCGGT GTCGTCGGCG AGCTCTACGT CGCCGGCCCC CAGGTGACCC GCGGTTACCT GAACCGGCCG GGCGCCACCG CCGCCGCCTA CGTGCCGTGC CCCTACGGGC CCCCCGGCGG GCGGATGTAC CGCACCGGCG ACCTGGCGCG CCGGCGCGCC GACGGGAGCC TGGAGTTCGT CGGCCGCGCG GACGACCAGA TCAAGCTCCA CGGCTACCGC ATCGAACCCG GCGAGATCGA GACGGCCCTG CGGGCGCAGC CCGGCGTCCG CGACGCCGCC GTCAGCGTCC GCGAGGACAC CTCCGACGGC AGGCGGCTCG TCGCCTACCT GGTCGGCGAG GCGGACCTCA CGGCGGTCGG CGACGCGCTC GCCGCCACCC TGCCGGCCCA CATGATCCCG TCCGGCTACG TCACCCTCGA CTCCCTGCCC CTGACCGCCA ACGGCAAGCT CGACCACGCC GCGCTCCCCG CGCCGTCGGC GGCGCCGGAC CGCCGGTACG TCGCGCCCCG CACCGCCGCG GAGGAGCTGG TGGCGGAGGT GTTCGCCGAG CTGCTGGGGG TGGAGAAGGT CGGCGCCGAG GACGACTTCT TCGAACTCGG CGGCAACTCC CTGCTGGCGA TCCGGGCCAT CGCCAGGATT CGCGGCCAGA TCGAGGTGGA CATCCCGGTC CGCGGACTTT TCTCCTACGC CACCGTCGCC GACCTCGCGG CTGAAATCGA ACGGCGGCTC AACGAAGACC TCGACCAGCT CAGCGATGAG GATGTCGAGC GGCTACTCAC AGCGGAAGGT GACGGCCGGC TATGA
|
Protein sequence | MTAREWEAPA SFGQERIWLA DQLDPGSPVF NLPCSLEILH PIEADEVVAA LKTVVDRHEV LRTSFRIVDG ALRQVVHAEV DLDVQVHDLR SLSAEERRTR MYDHALAAAR AAVPTDRAPL WRATVTRTGD ARWVVGFVVH HAVFDATSAV ILTTELTELC AASAAGRPAA LPELPIQYAD YAAWQRGQLT GQALAEQTGY WRSRLAGLPA VHTLPTDRPR PAQLGYAGDE VRFTIPEELQ EQAAAVGRRS AATPFMVFLT AYVALLSRLS RDDDTVVGVT MSGRDLPEVA GLVGMFINQM VVRTDTSGDP AFAELLGRVR ATLLDAMENG QIPFQAVVEA IAPQRDPGVQ PLYQIGFNYI PDSGIEPIPY STSKDDLAFD LTTGTSRLLY RTDLFDRGTA ETFVARYLRV LAAGAADPET RIGDLPLMDE DERASLLEAA AQEPPREHAT VCRMVEAQAA RTPDATAVIV GDRRLTYAEL EEAAGRVADR LRRSGAGPES LVAVYAEPCL ELLPALLGVW KAGAGYVPVD PGYPAERVAY MLADSAASVV LTQRHLAGAL PAAGATVLAV DDPGEWTGQP AAGSSREAAP ENVAYVIYTS GSTGTPKGVV VEHGSVAAYL AWAGTAYPGL AGQALLHSPI SFDLTVTGLF GPLTVGGAVR FAALDDGVAA GDRPTFLKAT PSHLALLAAL PDHAAPSADL VLGGEALPAD WVTAWRERHP GVTVVNEYGP TEATVGCVAA HVAPGEALPV DQAGAVGIGR PAPGNTAYVL DGGLRPAPAG VVGELYVAGP QVTRGYLNRP GATAAAYVPC PYGPPGGRMY RTGDLARRRA DGSLEFVGRA DDQIKLHGYR IEPGEIETAL RAQPGVRDAA VSVREDTSDG RRLVAYLVGE ADLTAVGDAL AATLPAHMIP SGYVTLDSLP LTANGKLDHA ALPAPSAAPD RRYVAPRTAA EELVAEVFAE LLGVEKVGAE DDFFELGGNS LLAIRAIARI RGQIEVDIPV RGLFSYATVA DLAAEIERRL NEDLDQLSDE DVERLLTAEG DGRL
|
| |