Gene Sros_3981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3981 
Symbol 
ID8667275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4435679 
End bp4437268 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content75% 
IMG OID 
ProductX-Pro dipeptidyl-peptidase domain-containing protein 
Protein accessionYP_003339634 
Protein GI271965438 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.595687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCGT ATCGCGTTCT CCTCCAACGT GACCTGCCGG TGCCGATGAC CGACGGGGTC 
ACCCTTCTGG CCGACCGCTA CGTGCCGGTC GGCGCCGTGC GCCCGCCCAC GGTCCTGGTC
CGCTCGCCGT ACGGCAGGCG CGGCGTGTTC GGGCTGGCCT TCGGCCGGGC GTTCGCCCGG
CGGGGGTTCC AGGTCGTCCT GCAGAGCTGC CGGGGCGGGT TCGGCTCCGG CGGCCTGCTC
GACCCGCTCG GCGACGAGCA CGAGGACGGG CTGGCCACGC TGGCCTGGCT GCGCGGGCAG
CCCTGGTACG GCGGGAGCCT GGCCATGCAC GGCCCCTCCT ACCTGGGCTA CGCCCAGTGG
GCGATCGCCC CGTTCGCCGG CCCCGACCTG AAGGCCATGG CCACCTCGGT GACCGCCTCC
CAGTTCCGCG ACGCCGCCTA CGTGGGCGGG GCGTTCGCGC TGGAGTCCTC ACTGATCTGG
ACCACGCTGA CCGCCTCGAT GGACACCCGG TTCGGCGGGG CCGGCGCGCT GCTGGCCCCC
CGCAGGACCC GGCGCGCGGC ACTGTCGGGA CGCCCCCTCG GGGAGCTGGA CGTGCTGTCG
GCCGGCAGGG GGCTGCCGTT CTTCCAGGAC CTGCTGGCCC ACCACGCCGA TCCGGCCGCC
TACTGGGGCA GGCGCGACTT CTCGGCCTCG GTGGGCGAGG TGGAGGCGGC GGTCACCATG
GTCGGCGGAT GGTACGACGT GTTCCTGCCG TGGCAGGTCA AGGACTACAC GACGATGCGG
GCGGCGGGGC GGCGGCCGTA CCTGACGATC GGCCCGTGGT ACCACGCCGA CATCCGGCAC
GGCCGGGTGG CCAACGCCGA CGCGCTGGCC TGGTTCAGGG CGCACCTGCT GGGCGACCCC
TCGGGGCTGC GGGAGCAGCC GGTCAGGCTG TACGTCACCG GTGCGGGCGA GTGGCGCGAC
TATCCCGACT GGCCGGTGCC CGGCATGCGG GAGCAGCGCT GGCACCTGCA GCCGGGGCTC
GCGCTCTCGA CCGGCAACCC GCGGGAGGGC GACCCCGACC GCTACCGCTA CGACCCCGCG
CACCCCACGC CCGTGCTGGG CGGGCCGGTC CTGCTGGGCA ACTCCGAGCC GCGCGACAAC
CGGCGCCTGG AGGCCCGGCG CGACGTGCTC GTCTACACCG GCCCCGAGCT GCGCGAGGAC
ACCGACATGA TCGGTCCGGT CTCCGCCGAC CTCTACCTCC GGTCGAGCAC CGAGCACGCC
GACGTGGTGG TGCGGGTCTG CGACGTGCAC CCGGACGGCG CGTCCTACAA CGTGTGCGAG
GGCGTGCGCC GCCTGTCGCC CGGCGCTCCC CCGGCCGGCT CCGACGGGAT CCGCCGCGTC
CGGGTGGACC TGTGGCCGGT CGGCCACCGC TTCCGGCGCG GCCACCGGAT CCGCCTGCAC
GTGGCCGGCG GCGCCTATCC CCGCATCGCC CGCAACCTCG GCACGGGAGA GCCGCTGGGC
ACCGGCCGCA CGATGGTCGC GGCCGACCAC GAGGTCTTCC ACGACCCCGC TCACCCCTCC
GCGGTCGTGC TGCCCCTCGT CCGCGGCTGA
 
Protein sequence
MPPYRVLLQR DLPVPMTDGV TLLADRYVPV GAVRPPTVLV RSPYGRRGVF GLAFGRAFAR 
RGFQVVLQSC RGGFGSGGLL DPLGDEHEDG LATLAWLRGQ PWYGGSLAMH GPSYLGYAQW
AIAPFAGPDL KAMATSVTAS QFRDAAYVGG AFALESSLIW TTLTASMDTR FGGAGALLAP
RRTRRAALSG RPLGELDVLS AGRGLPFFQD LLAHHADPAA YWGRRDFSAS VGEVEAAVTM
VGGWYDVFLP WQVKDYTTMR AAGRRPYLTI GPWYHADIRH GRVANADALA WFRAHLLGDP
SGLREQPVRL YVTGAGEWRD YPDWPVPGMR EQRWHLQPGL ALSTGNPREG DPDRYRYDPA
HPTPVLGGPV LLGNSEPRDN RRLEARRDVL VYTGPELRED TDMIGPVSAD LYLRSSTEHA
DVVVRVCDVH PDGASYNVCE GVRRLSPGAP PAGSDGIRRV RVDLWPVGHR FRRGHRIRLH
VAGGAYPRIA RNLGTGEPLG TGRTMVAADH EVFHDPAHPS AVVLPLVRG