Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3908 |
Symbol | |
ID | 8667198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 4354609 |
End bp | 4356612 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Xaa-Pro dipeptidyl-peptidase |
Protein accession | YP_003339568 |
Protein GI | 271965372 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.135659 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.218311 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGAT GGAAGACAGT CCTCATCGCC CTGGCGGTCG CCGTGCCCTC CGGCGCCCTG CCCGCCGCGG CCGCGCGATC CGCGACCGCC CCGATGGAGA GCAGCGCGGT CGCGCGATCC GCGTCCGCCC CGATCCAGAA CAACGAGACC CAGCCCGTCT ACTCGCGCGC CGACGCCCTG GCCGAGACCG TGTTCGTCGA GGTCGCCGGG GTCGACAGCG ACCGTGACGG CCGTCCGGAC CGGGTCGCCG TCGACATCAT GCGGCCCAAG GAGACCGCCT CCGGGCTGAA GGTCCCCGTC ATCATGGAGG CCAGCCCCTA CTACGCGGGC GGCAACGACG TGCCCAACCA CGTGGTGGAC CTCGACGGGT CCGACCCCGC GGGCATGGAG TACCGGCGCA ACAAGTTCAA CGACCTGCGC GACGAGATGT TCGGCCTCGA CGAGGACATG GCCACCTCCC GCGCGGCCGC GCCGCCCTTC CGCGGCTACT ACGACAACTA CTTCGTGCCG CGCGGCTACG CGGTCGCGCT GGTGGAGAAC CTCGGCTCCG GCCGCGCGAC CGGCTGCCCG ACGACCGGCC TGGCCAACGA GACCGCCGGC CCCAAGGCCG CCATCGACTG GCTCAACGGC CGGGCCCGGG GCTTCGACGC GTCCGGGAAC GAGATCCGGG CCGACTGGTC GACCGGCAAC GTCGGCATGA CCGGCGTCTC CTACAACGGC ACGCTCTCCT ACGCCGTCGC CGCGACCGGC GTCAAGGGGC TCAAGACGAT CGTGCCGATC GCGGCCATCT CCTCCTGGTA CGACTACTAC CGGGCCAACG GCGGCGTGAT CGCCGCCGGC GGCTACCAGG GCGAGGACGC CGACGTGCTG GCCCGTTACG TCCTCACCAG GGAGAACGCC GAGCAGGCCT GCGGCGCGCT GATGGACGAG ATCGAGCGCG ACCAGGACCG GGTCACCGGC GACTACTCCG CCTTCTGGGA CGGGCGCAAC TACCTCAACG ACGTGAGCAA GGTCCGCGCC AGCGTCTTCG TGGTCCACGG CCTCAACGAC TGGAACGTCA AGACCAAGCA GGCCGTCCAG TGGTGGGACG CGCTCGACCG GCGGGATGTG CCGCGCAAGA TCTGGCTCCA CCAGGGCGCC CACTTCAACC CCTTCTCCTT CGCCCAGCGC AACACCGAGT GGCTGCGCCA GCTCCACCAC TGGTTCGACT TCTACCTGTA CGGCCTGCGC AACGGCATCA TGGACGAGCC GCAGGCGGAC GTGGAGTCGG GGCCGGGCCA GTGGGCCCAG CACGCCTCGT GGCCGCTGCC CGGCACCCGT GACGTACGGC TGCGCCTGGC GGCCGGGACG GGCGGCCAGA ACGGCACCCT GAGCCGGGAG CGCTCCGGCG GCCGCGCCGT GGAGTCCTTC ACCGACCAGA ACGTCCGCAC GGCCGAGCAG CTGGCCGAGA ACGTCACGGC CACCGACCCC AACCGGCTGG CCTACCTGTC GCCGGCGCTG GCGACGGACG CCCGGCTCTC CGGCACGCCG AGCGTCTCGG TCAAGGCGGC GTTCGGCGGC GGTCGCTCGC CGTACCTGAC GGCCCTGCTG GTCGACTACG GCACCGACGC CCGCGCCTAC GGCGGTGTCA GCTACGGCCC CACCCCGGTC TGCTACGGCC AGGGCGTGCC GGGGGACACC GGCTGCGCCC GCCTCGCCGA GCACGTCCCG GTGACCGCCC CCTACAAGAT CATCACCAGG GGCTGGATCG ACGTCCGCAA CCGACACTCC GCCTCCCGGA CCGAACTGCT CCGCGAGGGC CGCTTCTACG ACTTCGACTT CGACCTCCAG CCCACCGACT ACGTCGTCAA GGCCGGTCAC CGGATCGGTG TCGTGCTCAT CTCCACCGAC CGCGACTTCA CGCTCCGGCT GCCCGCCGGG ACCGGCGTCT CGGTCGAGCC CGGTGACAGC TCCGTCGAGC TCCCCCTGGT CGGTGGCCGA TCGGCTCTCG GCCGGCTGTA CTGA
|
Protein sequence | MTRWKTVLIA LAVAVPSGAL PAAAARSATA PMESSAVARS ASAPIQNNET QPVYSRADAL AETVFVEVAG VDSDRDGRPD RVAVDIMRPK ETASGLKVPV IMEASPYYAG GNDVPNHVVD LDGSDPAGME YRRNKFNDLR DEMFGLDEDM ATSRAAAPPF RGYYDNYFVP RGYAVALVEN LGSGRATGCP TTGLANETAG PKAAIDWLNG RARGFDASGN EIRADWSTGN VGMTGVSYNG TLSYAVAATG VKGLKTIVPI AAISSWYDYY RANGGVIAAG GYQGEDADVL ARYVLTRENA EQACGALMDE IERDQDRVTG DYSAFWDGRN YLNDVSKVRA SVFVVHGLND WNVKTKQAVQ WWDALDRRDV PRKIWLHQGA HFNPFSFAQR NTEWLRQLHH WFDFYLYGLR NGIMDEPQAD VESGPGQWAQ HASWPLPGTR DVRLRLAAGT GGQNGTLSRE RSGGRAVESF TDQNVRTAEQ LAENVTATDP NRLAYLSPAL ATDARLSGTP SVSVKAAFGG GRSPYLTALL VDYGTDARAY GGVSYGPTPV CYGQGVPGDT GCARLAEHVP VTAPYKIITR GWIDVRNRHS ASRTELLREG RFYDFDFDLQ PTDYVVKAGH RIGVVLISTD RDFTLRLPAG TGVSVEPGDS SVELPLVGGR SALGRLY
|
| |