Gene Sros_2421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2421 
Symbol 
ID8665707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2624540 
End bp2627674 
Gene Length3135 bp 
Protein Length1044 aa 
Translation table11 
GC content75% 
IMG OID 
Productnon-ribosomal peptide synthetase/polyketide synthase 
Protein accessionYP_003338142 
Protein GI271963946 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.232071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCA GGGAATGGGA GGCTCCCGCC TCCTTCGGCC AGGAACGCAT CTGGCTGGCC 
GACCAGCTGG ATCCCGGCTC GCCGGTGTTC AACCTGCCGT GCTCGCTGGA GATCCTGCAC
CCGATCGAGG CCGATGAGGT CGTCGCGGCG CTGAAGACCG TCGTCGACCG GCACGAGGTG
CTCCGCACCT CCTTCAGGAT CGTCGACGGC GCCCTCCGCC AGGTGGTGCA CGCCGAGGTG
GACCTCGACG TGCAGGTGCA CGACCTGCGG TCGCTCTCCG CCGAGGAGCG CCGGACCCGG
ATGTACGACC ACGCCCTGGC CGCCGCCAGG GCCGCGGTCC CCACCGACCG TGCCCCGCTG
TGGCGGGCCA CGGTGACCCG GACCGGCGAC GCGCGCTGGG TGGTCGGCTT CGTGGTCCAC
CACGCCGTGT TCGACGCCAC CTCGGCCGTC ATCCTCACGA CGGAGTTGAC CGAGCTGTGC
GCCGCCTCGG CGGCCGGCCG GCCCGCCGCC CTGCCGGAGC TGCCCATCCA GTACGCCGAC
TACGCGGCCT GGCAGCGCGG CCAGCTCACC GGCCAGGCCC TGGCCGAGCA GACGGGCTAC
TGGCGGTCCC GGCTGGCGGG GCTGCCCGCC GTACACACCC TCCCCACCGA CCGGCCGCGG
CCCGCCCAGC TCGGCTACGC CGGCGACGAG GTCCGCTTCA CGATCCCGGA GGAACTGCAG
GAGCAGGCCG CCGCGGTGGG ACGCCGCTCG GCCGCGACGC CGTTCATGGT GTTCCTGACA
GCCTACGTCG CGCTGCTGTC CCGGCTGTCC CGCGACGACG ACACCGTCGT CGGGGTGACC
ATGAGCGGCC GGGACCTGCC GGAGGTCGCC GGACTGGTCG GCATGTTCAT CAACCAGATG
GTGGTGCGGA CGGACACCTC GGGCGACCCG GCCTTCGCCG AGCTGCTCGG CCGGGTCCGC
GCCACCCTGC TCGACGCCAT GGAGAACGGC CAGATCCCGT TCCAGGCGGT CGTGGAGGCG
ATCGCGCCGC AGCGGGATCC GGGGGTGCAG CCGCTCTACC AGATCGGCTT CAACTACATC
CCCGACTCCG GCATCGAGCC GATCCCCTAC AGCACCTCGA AGGACGACCT GGCGTTCGAC
CTGACCACCG GCACCAGCCG GCTGCTGTAC CGCACCGACC TGTTCGACCG GGGCACCGCG
GAGACGTTCG TCGCCCGCTA CCTGCGCGTC CTGGCCGCGG GCGCGGCCGA CCCGGAGACC
CGGATCGGCG ACCTGCCGCT GATGGACGAG GACGAGCGCG CGTCGCTGCT GGAGGCGGCC
GCGCAGGAGC CGCCGCGCGA GCACGCCACG GTGTGCCGCA TGGTGGAGGC ACAGGCCGCG
CGCACCCCGG ACGCGACCGC CGTGATCGTC GGTGACCGGC GGCTGACCTA CGCGGAGCTG
GAGGAGGCCG CCGGACGGGT GGCGGACCGG CTGCGGCGGT CGGGCGCCGG CCCCGAGTCG
CTGGTCGCGG TGTACGCCGA GCCGTGCCTG GAGCTGCTGC CCGCCCTGCT GGGCGTGTGG
AAGGCCGGAG CGGGTTACGT GCCGGTGGAT CCCGGCTACC CGGCGGAGCG GGTGGCGTAC
ATGCTCGCCG ACTCCGCCGC CTCGGTGGTG CTCACCCAGC GGCACCTGGC GGGCGCCCTG
CCGGCCGCCG GCGCGACCGT GCTCGCCGTC GACGACCCCG GCGAGTGGAC CGGGCAGCCG
GCCGCCGGCT CCTCCCGGGA GGCCGCGCCG GAGAACGTCG CGTACGTCAT CTACACCTCG
GGCTCGACCG GCACCCCCAA GGGCGTCGTG GTCGAGCACG GCTCGGTGGC CGCCTACCTG
GCGTGGGCCG GCACCGCCTA CCCCGGCCTG GCCGGACAGG CGCTGCTGCA CTCGCCGATC
TCCTTCGATC TGACCGTGAC CGGCCTGTTC GGGCCGCTGA CCGTCGGCGG CGCGGTGCGG
TTCGCCGCCC TCGACGACGG CGTCGCGGCT GGAGACCGGC CGACGTTCCT CAAGGCCACC
CCCAGCCACC TGGCGCTGCT GGCGGCGCTG CCCGACCACG CCGCCCCCAG CGCCGACCTG
GTGCTGGGCG GCGAGGCGCT GCCCGCCGAC TGGGTCACCG CCTGGCGGGA GCGGCATCCC
GGCGTCACGG TCGTCAACGA GTACGGTCCC ACCGAGGCCA CCGTCGGCTG CGTCGCGGCC
CACGTGGCGC CCGGCGAGGC GCTGCCCGTC GACCAGGCGG GCGCGGTCGG GATCGGCCGT
CCGGCCCCCG GCAACACCGC CTACGTGCTG GACGGCGGGC TGCGGCCGGC GCCCGCCGGT
GTCGTCGGCG AGCTCTACGT CGCCGGCCCC CAGGTGACCC GCGGTTACCT GAACCGGCCG
GGCGCCACCG CCGCCGCCTA CGTGCCGTGC CCCTACGGGC CCCCCGGCGG GCGGATGTAC
CGCACCGGCG ACCTGGCGCG CCGGCGCGCC GACGGGAGCC TGGAGTTCGT CGGCCGCGCG
GACGACCAGA TCAAGCTCCA CGGCTACCGC ATCGAACCCG GCGAGATCGA GACGGCCCTG
CGGGCGCAGC CCGGCGTCCG CGACGCCGCC GTCAGCGTCC GCGAGGACAC CTCCGACGGC
AGGCGGCTCG TCGCCTACCT GGTCGGCGAG GCGGACCTCA CGGCGGTCGG CGACGCGCTC
GCCGCCACCC TGCCGGCCCA CATGATCCCG TCCGGCTACG TCACCCTCGA CTCCCTGCCC
CTGACCGCCA ACGGCAAGCT CGACCACGCC GCGCTCCCCG CGCCGTCGGC GGCGCCGGAC
CGCCGGTACG TCGCGCCCCG CACCGCCGCG GAGGAGCTGG TGGCGGAGGT GTTCGCCGAG
CTGCTGGGGG TGGAGAAGGT CGGCGCCGAG GACGACTTCT TCGAACTCGG CGGCAACTCC
CTGCTGGCGA TCCGGGCCAT CGCCAGGATT CGCGGCCAGA TCGAGGTGGA CATCCCGGTC
CGCGGACTTT TCTCCTACGC CACCGTCGCC GACCTCGCGG CTGAAATCGA ACGGCGGCTC
AACGAAGACC TCGACCAGCT CAGCGATGAG GATGTCGAGC GGCTACTCAC AGCGGAAGGT
GACGGCCGGC TATGA
 
Protein sequence
MTAREWEAPA SFGQERIWLA DQLDPGSPVF NLPCSLEILH PIEADEVVAA LKTVVDRHEV 
LRTSFRIVDG ALRQVVHAEV DLDVQVHDLR SLSAEERRTR MYDHALAAAR AAVPTDRAPL
WRATVTRTGD ARWVVGFVVH HAVFDATSAV ILTTELTELC AASAAGRPAA LPELPIQYAD
YAAWQRGQLT GQALAEQTGY WRSRLAGLPA VHTLPTDRPR PAQLGYAGDE VRFTIPEELQ
EQAAAVGRRS AATPFMVFLT AYVALLSRLS RDDDTVVGVT MSGRDLPEVA GLVGMFINQM
VVRTDTSGDP AFAELLGRVR ATLLDAMENG QIPFQAVVEA IAPQRDPGVQ PLYQIGFNYI
PDSGIEPIPY STSKDDLAFD LTTGTSRLLY RTDLFDRGTA ETFVARYLRV LAAGAADPET
RIGDLPLMDE DERASLLEAA AQEPPREHAT VCRMVEAQAA RTPDATAVIV GDRRLTYAEL
EEAAGRVADR LRRSGAGPES LVAVYAEPCL ELLPALLGVW KAGAGYVPVD PGYPAERVAY
MLADSAASVV LTQRHLAGAL PAAGATVLAV DDPGEWTGQP AAGSSREAAP ENVAYVIYTS
GSTGTPKGVV VEHGSVAAYL AWAGTAYPGL AGQALLHSPI SFDLTVTGLF GPLTVGGAVR
FAALDDGVAA GDRPTFLKAT PSHLALLAAL PDHAAPSADL VLGGEALPAD WVTAWRERHP
GVTVVNEYGP TEATVGCVAA HVAPGEALPV DQAGAVGIGR PAPGNTAYVL DGGLRPAPAG
VVGELYVAGP QVTRGYLNRP GATAAAYVPC PYGPPGGRMY RTGDLARRRA DGSLEFVGRA
DDQIKLHGYR IEPGEIETAL RAQPGVRDAA VSVREDTSDG RRLVAYLVGE ADLTAVGDAL
AATLPAHMIP SGYVTLDSLP LTANGKLDHA ALPAPSAAPD RRYVAPRTAA EELVAEVFAE
LLGVEKVGAE DDFFELGGNS LLAIRAIARI RGQIEVDIPV RGLFSYATVA DLAAEIERRL
NEDLDQLSDE DVERLLTAEG DGRL