Gene Sros_1518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1518 
Symbol 
ID8664794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1604722 
End bp1606068 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content74% 
IMG OID 
Productpeptidase M20 
Protein accessionYP_003337254 
Protein GI271963058 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0128232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000123266 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGACTCCTG CCGAGATCGA AGGCGCCGTC GCCGCCGGCC TGCCCCAGGC GGTCGAAGAC 
CTCAAGCGGC TCGTCGCCAT CCCCTCGGTG GCCTTCCCCG GCCACCCGGA GGAGCCGGTG
CGCGCGGCCG CCGCGGCGAC CGAGGAGCTG CTGCGCAGCG CCGGACTGCC GCACGTCCGG
CAGATTCCGG TCGAGGGAAG CTTCCCCGCC GTCTACGCCG AGGCACCGGC CCCGCCCGGC
GCGCCGACCG TGCTGCTCTA CGCCCACTAT GACGTGCAGC CCGCGGGCGA CCCCGCGCTG
TGGCGCACCC CGGCCTTCGA GCCGACGGAG GTCGACGGGG CCATCCACGG CCGCGGCGCC
GCCGACGACA AGTCCGGCAT CATCTCCCAC GTCGCCGCGC TCCGGGCCTT CCGGGGAGAC
TTCCCGGTGG GCATCAAGGT GATCATCGAG GGCCAGGAGG AGTACGCCGG GGAGCGCCTT
GAGGCCTTCG TCGAGCAGAA CCCCGAGCTG CTCCGCGCCG ACGCGATCAT CGTCGCCGAC
TGCGGCAACC CGAGCGTGGG CGACCCGGCG GTGACCACCT CGCTGCGCGG CATGGGCGCC
TTCACCGTCG AGGTGCGCAC CCTGAAGGAG TCGCTGCACA GCGGCTCGTT CGGCGGCGCC
GCCCCGGACG CGCTCGCCGC GCTGATCCGG ATGCTGGCCG GCCTGCACGA CGACCACGGC
GACATCCGCG TCCCCGGCCT GCCACGCGGC AGCTTCCTCG GCTCCGGCCC CTCGGAGGAG
GAGTTCCGGG CCACGGCGGG CGTGCTCGAC GGCGTCTCGC TGGTCGGCTC GGGTTCGCTG
GCCGACCGCC TGTGGGCCTC CTACGCCATC ACGGTCACCG GCCTGGACGT GCCGACCGTC
TCCGGCGCCA TCAACGCGGT CCAGGCGGTC GCGCGCGCCC GGGTGACCGT ACGCGTGCCT
CCGGCGGGCG ACCCGAAGAC GACCGTGGAC GCCGTGGTCG ACTTCCTCCG TCAGGTTGCT
CCCTGGGGTG TCGAGGTCCA CGTCACCGAC TACGTGCTGG GCTCCGGCTA CCTCGCCGAC
TCCGGCGGAG CCGCCCGCGC CGCGCTGAAC CGGGCGATGG AGCACGCCTT CGGCCGTCCG
CCGCGCGACG TCGGCGCCGG CGGCTCGATC CCGCTGGTCT CCACGCTCGT CAAGCAGTTC
CCCGCCGCGT CGATCCTGCT GTTCGGCGCC GAGGACGACG ACGCCTCGAT CCACGCGCCC
AACGAGCGGG TCAACATCGA GGAGCTCCGC CGCACGGCCC TCGCGGAAGC GCTCTTCCTC
CAGGAGTACG GCTCCGCGAC GGTCTAG
 
Protein sequence
MTPAEIEGAV AAGLPQAVED LKRLVAIPSV AFPGHPEEPV RAAAAATEEL LRSAGLPHVR 
QIPVEGSFPA VYAEAPAPPG APTVLLYAHY DVQPAGDPAL WRTPAFEPTE VDGAIHGRGA
ADDKSGIISH VAALRAFRGD FPVGIKVIIE GQEEYAGERL EAFVEQNPEL LRADAIIVAD
CGNPSVGDPA VTTSLRGMGA FTVEVRTLKE SLHSGSFGGA APDALAALIR MLAGLHDDHG
DIRVPGLPRG SFLGSGPSEE EFRATAGVLD GVSLVGSGSL ADRLWASYAI TVTGLDVPTV
SGAINAVQAV ARARVTVRVP PAGDPKTTVD AVVDFLRQVA PWGVEVHVTD YVLGSGYLAD
SGGAARAALN RAMEHAFGRP PRDVGAGGSI PLVSTLVKQF PAASILLFGA EDDDASIHAP
NERVNIEELR RTALAEALFL QEYGSATV