Gene Sros_3604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3604 
Symbol 
ID8666892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3998885 
End bp4000189 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content71% 
IMG OID 
Productpeptidase M20 
Protein accessionYP_003339280 
Protein GI271965084 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.682601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCCG CCGAGACCGA GGTAGCCGAG CTCTGCGCCG AGCTCATCCG CGTCGACACC 
AGCAACTACG GAGACGGCAG CGGCCCCGGC GAGCGCGCCG CCGCCGAGAT CGTCATGGCC
AGGCTCGCGG AGGTCGGCGC CGAGGCGACC TACGTGGAGA GCGCGCCGGG CCGGGGCAAC
GTCGTGACCA GGATCGAGGG CTCCGACCCC GGCCTGCCCG CGCTGCTGGT CCACGGCCAC
CTGGACGTGG TGCCCGCCAA CGCCGCCGAC TGGACCGTCG ACCCGTTCGC CGGCGAGATC
CGGGACGGCT ACATCTGGGG CCGGGGCGCG GTCGACATGA AGGACATGGA CGCCATGATG
CTGGCGGTCC TGCGGCAGAT GGTGACCGAG GGCCGCAAGC CCAGGCGCGA CGTCGTCTTC
GCCTGGGTGG CCGACGAGGA GGCGGGCGGC GAGTACGGCG CCAAGTACCT CGCCAGCAAG
CACCCCGAGC TGTTCGACGG GGTCGACCAC GCCATCAGCG AGGTCGGCGG CTACTCCCTG
GAGGTCGACC CCTCGCTGCG GCTCTACCTG ATCGAGACCG CGCAGAAGGG CCTGGCCTGG
ATGCGCCTGG TGGCCGGCGG CACCGCCGGG CACGGCTCCA TGCTCAACCC CGACAACGCG
GTCACCGAGG TCGCCAAGGC GGTGGCCAGG CTCGGCTCGC ACGAGTGGCC GCTCAAGCTG
ACGCCGACCG TGCGGCGTTT CCTGTCGGAG GTGGCCGACG CCTTCGGCCT GCCCTTCGAC
CCGGAGGATC CGGCGCCGAT CATCGAGGCC ATCGGCCCGC TGGCCCGGTT CGTCGGCGCG
ACCCTGCGTC ACACGACCAA CCCGACCATG CTCGCCGCGG GCTACAAGGC GAACGTGATC
CCCGGCCAGG CCAGTGCCGT GGTGGACGGC CGGTTCCTGC CGGGCTTCGA GGAGGAGTTC
CTGTCCACGG TCGACGAGCT GCTCGGACCG GACGTCCGGC GTGAGTACAT CACCCACGAC
ATCGCGCTGG AGACCACGTT CGACGGCGAG CTGGTCGAAT CGATGATCGC GGCGCTGAAG
GCGGAGGACC CGACCGCGCG GGCGGTGCCG TACTGCATGT CGGGCGGCAC CGACAACAAG
ACGTTCTTCG CCGACCTGAA CATCAGGGGC TTCGGGTTCG TCCCGCTGCG GCTGCCCGCG
GAGATGGACT TCGCCGCGAT GTTCCACGGC GTCGACGAGC GGGTGCCGGT GGACGCCCTG
CAGTTCGGCA CCCGCGTCCT CGACCGGCTG CTGACGAACT ACTGA
 
Protein sequence
MTPAETEVAE LCAELIRVDT SNYGDGSGPG ERAAAEIVMA RLAEVGAEAT YVESAPGRGN 
VVTRIEGSDP GLPALLVHGH LDVVPANAAD WTVDPFAGEI RDGYIWGRGA VDMKDMDAMM
LAVLRQMVTE GRKPRRDVVF AWVADEEAGG EYGAKYLASK HPELFDGVDH AISEVGGYSL
EVDPSLRLYL IETAQKGLAW MRLVAGGTAG HGSMLNPDNA VTEVAKAVAR LGSHEWPLKL
TPTVRRFLSE VADAFGLPFD PEDPAPIIEA IGPLARFVGA TLRHTTNPTM LAAGYKANVI
PGQASAVVDG RFLPGFEEEF LSTVDELLGP DVRREYITHD IALETTFDGE LVESMIAALK
AEDPTARAVP YCMSGGTDNK TFFADLNIRG FGFVPLRLPA EMDFAAMFHG VDERVPVDAL
QFGTRVLDRL LTNY