Gene Sros_3908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3908 
Symbol 
ID8667198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4354609 
End bp4356612 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content72% 
IMG OID 
ProductXaa-Pro dipeptidyl-peptidase 
Protein accessionYP_003339568 
Protein GI271965372 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.135659 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.218311 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGAT GGAAGACAGT CCTCATCGCC CTGGCGGTCG CCGTGCCCTC CGGCGCCCTG 
CCCGCCGCGG CCGCGCGATC CGCGACCGCC CCGATGGAGA GCAGCGCGGT CGCGCGATCC
GCGTCCGCCC CGATCCAGAA CAACGAGACC CAGCCCGTCT ACTCGCGCGC CGACGCCCTG
GCCGAGACCG TGTTCGTCGA GGTCGCCGGG GTCGACAGCG ACCGTGACGG CCGTCCGGAC
CGGGTCGCCG TCGACATCAT GCGGCCCAAG GAGACCGCCT CCGGGCTGAA GGTCCCCGTC
ATCATGGAGG CCAGCCCCTA CTACGCGGGC GGCAACGACG TGCCCAACCA CGTGGTGGAC
CTCGACGGGT CCGACCCCGC GGGCATGGAG TACCGGCGCA ACAAGTTCAA CGACCTGCGC
GACGAGATGT TCGGCCTCGA CGAGGACATG GCCACCTCCC GCGCGGCCGC GCCGCCCTTC
CGCGGCTACT ACGACAACTA CTTCGTGCCG CGCGGCTACG CGGTCGCGCT GGTGGAGAAC
CTCGGCTCCG GCCGCGCGAC CGGCTGCCCG ACGACCGGCC TGGCCAACGA GACCGCCGGC
CCCAAGGCCG CCATCGACTG GCTCAACGGC CGGGCCCGGG GCTTCGACGC GTCCGGGAAC
GAGATCCGGG CCGACTGGTC GACCGGCAAC GTCGGCATGA CCGGCGTCTC CTACAACGGC
ACGCTCTCCT ACGCCGTCGC CGCGACCGGC GTCAAGGGGC TCAAGACGAT CGTGCCGATC
GCGGCCATCT CCTCCTGGTA CGACTACTAC CGGGCCAACG GCGGCGTGAT CGCCGCCGGC
GGCTACCAGG GCGAGGACGC CGACGTGCTG GCCCGTTACG TCCTCACCAG GGAGAACGCC
GAGCAGGCCT GCGGCGCGCT GATGGACGAG ATCGAGCGCG ACCAGGACCG GGTCACCGGC
GACTACTCCG CCTTCTGGGA CGGGCGCAAC TACCTCAACG ACGTGAGCAA GGTCCGCGCC
AGCGTCTTCG TGGTCCACGG CCTCAACGAC TGGAACGTCA AGACCAAGCA GGCCGTCCAG
TGGTGGGACG CGCTCGACCG GCGGGATGTG CCGCGCAAGA TCTGGCTCCA CCAGGGCGCC
CACTTCAACC CCTTCTCCTT CGCCCAGCGC AACACCGAGT GGCTGCGCCA GCTCCACCAC
TGGTTCGACT TCTACCTGTA CGGCCTGCGC AACGGCATCA TGGACGAGCC GCAGGCGGAC
GTGGAGTCGG GGCCGGGCCA GTGGGCCCAG CACGCCTCGT GGCCGCTGCC CGGCACCCGT
GACGTACGGC TGCGCCTGGC GGCCGGGACG GGCGGCCAGA ACGGCACCCT GAGCCGGGAG
CGCTCCGGCG GCCGCGCCGT GGAGTCCTTC ACCGACCAGA ACGTCCGCAC GGCCGAGCAG
CTGGCCGAGA ACGTCACGGC CACCGACCCC AACCGGCTGG CCTACCTGTC GCCGGCGCTG
GCGACGGACG CCCGGCTCTC CGGCACGCCG AGCGTCTCGG TCAAGGCGGC GTTCGGCGGC
GGTCGCTCGC CGTACCTGAC GGCCCTGCTG GTCGACTACG GCACCGACGC CCGCGCCTAC
GGCGGTGTCA GCTACGGCCC CACCCCGGTC TGCTACGGCC AGGGCGTGCC GGGGGACACC
GGCTGCGCCC GCCTCGCCGA GCACGTCCCG GTGACCGCCC CCTACAAGAT CATCACCAGG
GGCTGGATCG ACGTCCGCAA CCGACACTCC GCCTCCCGGA CCGAACTGCT CCGCGAGGGC
CGCTTCTACG ACTTCGACTT CGACCTCCAG CCCACCGACT ACGTCGTCAA GGCCGGTCAC
CGGATCGGTG TCGTGCTCAT CTCCACCGAC CGCGACTTCA CGCTCCGGCT GCCCGCCGGG
ACCGGCGTCT CGGTCGAGCC CGGTGACAGC TCCGTCGAGC TCCCCCTGGT CGGTGGCCGA
TCGGCTCTCG GCCGGCTGTA CTGA
 
Protein sequence
MTRWKTVLIA LAVAVPSGAL PAAAARSATA PMESSAVARS ASAPIQNNET QPVYSRADAL 
AETVFVEVAG VDSDRDGRPD RVAVDIMRPK ETASGLKVPV IMEASPYYAG GNDVPNHVVD
LDGSDPAGME YRRNKFNDLR DEMFGLDEDM ATSRAAAPPF RGYYDNYFVP RGYAVALVEN
LGSGRATGCP TTGLANETAG PKAAIDWLNG RARGFDASGN EIRADWSTGN VGMTGVSYNG
TLSYAVAATG VKGLKTIVPI AAISSWYDYY RANGGVIAAG GYQGEDADVL ARYVLTRENA
EQACGALMDE IERDQDRVTG DYSAFWDGRN YLNDVSKVRA SVFVVHGLND WNVKTKQAVQ
WWDALDRRDV PRKIWLHQGA HFNPFSFAQR NTEWLRQLHH WFDFYLYGLR NGIMDEPQAD
VESGPGQWAQ HASWPLPGTR DVRLRLAAGT GGQNGTLSRE RSGGRAVESF TDQNVRTAEQ
LAENVTATDP NRLAYLSPAL ATDARLSGTP SVSVKAAFGG GRSPYLTALL VDYGTDARAY
GGVSYGPTPV CYGQGVPGDT GCARLAEHVP VTAPYKIITR GWIDVRNRHS ASRTELLREG
RFYDFDFDLQ PTDYVVKAGH RIGVVLISTD RDFTLRLPAG TGVSVEPGDS SVELPLVGGR
SALGRLY