Gene Sros_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4047 
Symbol 
ID8667341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4505360 
End bp4507390 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content73% 
IMG OID 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_003339698 
Protein GI271965502 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0166577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00142198 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCTTCC GAACCGCCAT GGACATCGGC GGTACCTTCA CCGACGTCGT CCGCTACGAC 
GAACGGACCG GCCGCGTGGT GGCCTCGAAG GCGCCGACCA CACCGGGAAA CCTCGCCGAC
GGCGTGTTCT CCGCGCTCGG CCGGGTCGTG GACGACCCCT CCGAGATCTC CTTCTTCGTG
CACGGCACCA CCCAGGGGCT CAACGCGCTG CTGGAGCGCA AGGGGGCGCG GGTGCTGCTG
GTCACCGGGG AGGGGGCCCG GGACGTCTAC CGGATCGCCC GGGGCAACCG GGACCGGATG
TTCGACCTGC GCTACCGCAA GCCCGAGCCG CTGGTGCCCC GCTCGGACGT GACGGAGGTC
GCCGGACGGC TGGACTGGCG CGGCGAGGAA CTCGTCCCCC TGGACGAGGG CGCGGTCAGG
GCCGCGGCCC GGCGGGCCCG CGGCGAGAGC TTCGACGCGG TCGCGGTCTG CCTGCTGTTC
AGCTACGTCA ACCCCGCCCA CGAGATCCGC GCGGGCGAGA TCCTGGCCGA GGAGCTGGGC
GAGGACACCC TCGTCGTCCT CTCCCACGAG GTGGCCCGCG AATGGCGCGA GTACGAGCGG
ACGTCCTCCG CCGTGCTGGA GGCCTACACC GGACCGGTGG TCCGCCGTTA CCTCGCCGGG
ATCGAGGAGC GGTTCGCCGA GCGGGGCCTG ACCGTCCCGG TGCACGTCAT GCAGTCCTCC
GGAGGCCTGG TCAACGCCTC CCACGCGATG CGGCGCCCGC TGCAGACCCT GCTGTCCGGC
CCGGTCGGCG GCACCATGGG CGGCGTCGCG GCGGCCCGGC TCCTGGGCCG GCCCAACGCC
ATCTGCGTGG ACATGGGAGG CACCTCCTTC GACGTGTCCC TGGTGGTCGA CGGCAGGCCC
GACATCAGCA CCGAGGCGCG TGTCGAGGGC TTCCCCGTGC TGATGCCGAT CGTGAACCTC
CACACGATCG GCGCCGGCGG CGGCTCGATC GCCTACGCCG AGGCCGGTGC GCTGCGGGTC
GGCCCCGAGT CGGCAGGAGC CGTGCCCGGA CCGGCCTGCT ACGGCCGGGG CGGCGTCCGG
CCGACCGTCA CCGACGCCAA CGTGGTGCTC GGCAGGGTGG ACCCGTCCTG GTTCGCCGGC
GGGCTGATGT CCCTGGACGT CCATGCCGCC CACACGGCCG TGGCCGACCT GGGGCGCGAG
CTCCGCCTGG AGACGCTCCA GATCGCCGAG GGCATCTGCA GCGTGGCCAA CGCCAAGATG
GCCCAGGCCA TCCGGACCCT CACCGTGGAG CACGGGGTCG AGCCGCGCGA GTTCGCCCTG
GTCGCCTTCG GCGGCGCGGG CGCCATGCAC GCGGTCTTCA TCGCCCGCGA GCTCGGCATC
TCCGAGGTGG TCGTCCCCCG CTTCCCCGGC GCGTTCTCGG CCTGGGGCAT GCTGGAGGCC
GACGTCCGCC GCGACCTGAG CCATCCGTAC TTCCGCTCGG GCGGGGAGCT GGACGGCGCC
GACATGGCGT CCCGGCTGAA GGACCTGCAG GACCAGGCGC TGGAGGAGCT GGCCGGGCAG
GGCGTGGCCG GCGGCCGGAT GCGGATCGAG CACGCGGTGG ACATGCGCTA CGAGGGCCAG
GACTACACCC TGACCGTTCC CCTGCGGGAC GCCGCGGAGC CGGGCACGCC CGGCTTCCCG
GAGCGGATCG CGGCCCGCTA CGCCGACGCG CACACCAAGC GGTACGGCCA CGCCACCCCC
GAGGCGCCGG TGGAGTTCGT GACGCTCCGC AGCACCGGTT TCGGCGTCTT CCCCCGGACC
GCCGCCACCC ACGCCGCCCA GCCGGACGAG GGGACGCGGA CCGTACGAGA AGTGATCTTC
GACGGCGAGG CGCACCCCAC CCCCGTGCTG CGCCGCGGCG CGCTGGAGGG CGAGCTCACC
GGTCCGGCGA TCGTCGTCGA GGAGACGGCG ACCACGGTGA TCCCGCCGGG CTGCGTGGCC
TCGGTGGACG GCAACGGCTT TCTGATCATC AAGGTGGGAG GAGTCAAGTG A
 
Protein sequence
MSFRTAMDIG GTFTDVVRYD ERTGRVVASK APTTPGNLAD GVFSALGRVV DDPSEISFFV 
HGTTQGLNAL LERKGARVLL VTGEGARDVY RIARGNRDRM FDLRYRKPEP LVPRSDVTEV
AGRLDWRGEE LVPLDEGAVR AAARRARGES FDAVAVCLLF SYVNPAHEIR AGEILAEELG
EDTLVVLSHE VAREWREYER TSSAVLEAYT GPVVRRYLAG IEERFAERGL TVPVHVMQSS
GGLVNASHAM RRPLQTLLSG PVGGTMGGVA AARLLGRPNA ICVDMGGTSF DVSLVVDGRP
DISTEARVEG FPVLMPIVNL HTIGAGGGSI AYAEAGALRV GPESAGAVPG PACYGRGGVR
PTVTDANVVL GRVDPSWFAG GLMSLDVHAA HTAVADLGRE LRLETLQIAE GICSVANAKM
AQAIRTLTVE HGVEPREFAL VAFGGAGAMH AVFIARELGI SEVVVPRFPG AFSAWGMLEA
DVRRDLSHPY FRSGGELDGA DMASRLKDLQ DQALEELAGQ GVAGGRMRIE HAVDMRYEGQ
DYTLTVPLRD AAEPGTPGFP ERIAARYADA HTKRYGHATP EAPVEFVTLR STGFGVFPRT
AATHAAQPDE GTRTVREVIF DGEAHPTPVL RRGALEGELT GPAIVVEETA TTVIPPGCVA
SVDGNGFLII KVGGVK