Gene Sros_0075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0075 
Symbol 
ID8663338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp74314 
End bp76551 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content68% 
IMG OID 
Productputative terpene cyclase 
Protein accessionYP_003335875 
Protein GI271961679 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGCTT TCACATTGCC CGAGTTCTAC ATGCCATATC CCGCCAGGAT CAATCCTCAT 
ATGGAGCGTT CCCGCGCGCA CAGTGCGGCC TGGGCCCGGC AGATGGGCAT GCTCGACGCA
CCCAAGCCCG GCGGCGGCGT CGTGTGGGAC GACGCCGAGC TGGCCAGGAT GGACTACGCG
CTGATGTGCG CCTACACCCA CCCCGACTGC GACGGCCCCA CCCTGGACCT GATCACCGAC
TGGTACGTCT GGGTGTTCTT CTTCGACGAC CACTTCCTTG AGCAGTTCAA GTACTCCCGC
GACCTGCTCG GGGCGAAGGC CTACCTCGAC CACCTCGAAC TGTTCATGAC CGCGGACGGA
GAGACGCCGC CGGAGCCGGC CAACCCGGCC GAGGCGGGGC TGAAAGACCT CTGGGAGCGT
ACGGTTCCGG CGATGTCGCA CGGGTGGCGG CAGCGCTTCA TCACCAGCAC GCACAACCTG
ATGGTGGAGT CGATGTGGGA GCTGGACAAC ATCGACCGCG GCCGGATCGC CAACCCGATC
GAATACGTGC AGATGCGGCG GAGGGTCGGC GGCGCACCCT GGTCGGCCAA CCTCGTCGAA
TACGCCGTCG GCGCGGAGAT CCCCGACGGT CTGGCCGGGA CGAGGCCGAT GCGGGTCCTG
TCGGACACGT TCTCGGACGC GGTGCACCTG CGCAACGACC TGTTCTCCTA CCAGCGCGAG
GTCCAGGAGG AGGGCGAGAA CTCCAACGCG GTGCTGGTCT TCGAGCGGTT CTTCGACTGC
CCGACGCAGG AGGCCGCCGA GCTCGTCAAC GACCTGCTGA CCTCCCGGCT GCAGCAGTTC
GAGAACACGA CGCTGATCGA GGTCCCGGCC CTGCTGGCCG AGAACACCGT GCCCGTGCAC
GAGCAGCTCG GGGTCGCCGC CTACGTCAAG GGTCTGCAGG ACTGGCAGTC CGGCGGACAC
GAGTGGCACG CGAGATCCAG CCGGTACATG AACGAAGGCG CCGCCTCGGG CCCCGCCGGT
GTGCTGAGAG GCCCGACCGG CCTGGGCACC TCCGCCGCCG TACCGACGCT CTCCCCGGCA
CGGCTGGGCC TGCGGCGCAG GTCCCAGCAG CAGTCCCACA GGCCGTTCCA GCCGGTGGGG
CATCTGCCGC TGCCGGATCT CTACATGCCC TACCCGGTCC GCACCAGCCC CCACCTGGAC
GCCGCGCGAC GCTACGCAGT CGGCTGGGCG CGGCGGATGG GCATGTTCGA CGCGATACCC
GGGGTGGAAG CCGGCGGGTT GTGGGACGAG CGGCGCTTCA TCGGCTTCGA CTTCGCCCAC
TGCGCCGCGA TGATCCACGC GGACGCGAGC CCCGAACAGC TCAACCTGTC CTCTGACTGG
CTGGCCTGGG GCACGTACGG TGACGACTAC TTCCCCGCGG TGTTCGGGGC GCCCCGCGAC
CTCGTGGCGG CGAAGCTCTG CAACGAACGG CTGTCGGCGT TCATGCCGCT GGACGCCGGG
GCCACCCCGG AGCCGACGAA CCCGATCGAG CGGGGGCTGG AGGACCTGTG GCGGCGCACC
GCGGAGCCGA TGAGCGTGCC CGCCCGGCAG CAGTTCCGCG AGGCGGTCGA GGACATGACC
GCCGGCTGGC TGTGGGAGCT GGTCAACCAG ACCCAGCACC GTGTCCCCGA TCCGGTCGAC
TACATCGAGA TGCGCCGCAA GACGTTCGGG TCGGACATGA CGATGAGCCT GGCCCGGCTC
GCGCACTCGG ACATGATGCC TGCGGAGATC TACCAGACAC GGGTCATGCG GGAGCTGGAC
ACCGCGGCGC AGGACTACGC CTGTTTCACC AACGACCTGT TCTCCTACCA GAAGGAGATC
GAGTTCGAGG GTGAGGTCCA CAACCTCGTC CTGGTCGTGG AGAACTTCCT GGAAGTGGAC
AGGTGGAAGG CCCGGGACGT CGTGGCCGAC CTGATGACAG CGCGGATGCA GCAGTTCGAG
CACATCGTCG CCAACGGCCT GCCGGCGCTG TTCGACGATT TCGCCCTCGA CGAGCAGGCC
CGCAGGATTC TCACCCGCCA TGCCGACGAC CTCAAGGAGT GGATGTCGGG AATCCTCGAA
TGGCACCGTA GGTGCGCGCG ATACACCGAG GCCGAGCTCC GGCGCAGCCG CCTTCCGGGA
GCGCCGGCGG GCTTCTCGCT TCTGCCCGCA GGGCTGGGCA CCTCGGCGGT GCGGGTCGGG
GCCGGCAGGC GGGGCTGA
 
Protein sequence
MQAFTLPEFY MPYPARINPH MERSRAHSAA WARQMGMLDA PKPGGGVVWD DAELARMDYA 
LMCAYTHPDC DGPTLDLITD WYVWVFFFDD HFLEQFKYSR DLLGAKAYLD HLELFMTADG
ETPPEPANPA EAGLKDLWER TVPAMSHGWR QRFITSTHNL MVESMWELDN IDRGRIANPI
EYVQMRRRVG GAPWSANLVE YAVGAEIPDG LAGTRPMRVL SDTFSDAVHL RNDLFSYQRE
VQEEGENSNA VLVFERFFDC PTQEAAELVN DLLTSRLQQF ENTTLIEVPA LLAENTVPVH
EQLGVAAYVK GLQDWQSGGH EWHARSSRYM NEGAASGPAG VLRGPTGLGT SAAVPTLSPA
RLGLRRRSQQ QSHRPFQPVG HLPLPDLYMP YPVRTSPHLD AARRYAVGWA RRMGMFDAIP
GVEAGGLWDE RRFIGFDFAH CAAMIHADAS PEQLNLSSDW LAWGTYGDDY FPAVFGAPRD
LVAAKLCNER LSAFMPLDAG ATPEPTNPIE RGLEDLWRRT AEPMSVPARQ QFREAVEDMT
AGWLWELVNQ TQHRVPDPVD YIEMRRKTFG SDMTMSLARL AHSDMMPAEI YQTRVMRELD
TAAQDYACFT NDLFSYQKEI EFEGEVHNLV LVVENFLEVD RWKARDVVAD LMTARMQQFE
HIVANGLPAL FDDFALDEQA RRILTRHADD LKEWMSGILE WHRRCARYTE AELRRSRLPG
APAGFSLLPA GLGTSAVRVG AGRRG