Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3981 |
Symbol | |
ID | 8667275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 4435679 |
End bp | 4437268 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | X-Pro dipeptidyl-peptidase domain-containing protein |
Protein accession | YP_003339634 |
Protein GI | 271965438 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.595687 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCCGT ATCGCGTTCT CCTCCAACGT GACCTGCCGG TGCCGATGAC CGACGGGGTC ACCCTTCTGG CCGACCGCTA CGTGCCGGTC GGCGCCGTGC GCCCGCCCAC GGTCCTGGTC CGCTCGCCGT ACGGCAGGCG CGGCGTGTTC GGGCTGGCCT TCGGCCGGGC GTTCGCCCGG CGGGGGTTCC AGGTCGTCCT GCAGAGCTGC CGGGGCGGGT TCGGCTCCGG CGGCCTGCTC GACCCGCTCG GCGACGAGCA CGAGGACGGG CTGGCCACGC TGGCCTGGCT GCGCGGGCAG CCCTGGTACG GCGGGAGCCT GGCCATGCAC GGCCCCTCCT ACCTGGGCTA CGCCCAGTGG GCGATCGCCC CGTTCGCCGG CCCCGACCTG AAGGCCATGG CCACCTCGGT GACCGCCTCC CAGTTCCGCG ACGCCGCCTA CGTGGGCGGG GCGTTCGCGC TGGAGTCCTC ACTGATCTGG ACCACGCTGA CCGCCTCGAT GGACACCCGG TTCGGCGGGG CCGGCGCGCT GCTGGCCCCC CGCAGGACCC GGCGCGCGGC ACTGTCGGGA CGCCCCCTCG GGGAGCTGGA CGTGCTGTCG GCCGGCAGGG GGCTGCCGTT CTTCCAGGAC CTGCTGGCCC ACCACGCCGA TCCGGCCGCC TACTGGGGCA GGCGCGACTT CTCGGCCTCG GTGGGCGAGG TGGAGGCGGC GGTCACCATG GTCGGCGGAT GGTACGACGT GTTCCTGCCG TGGCAGGTCA AGGACTACAC GACGATGCGG GCGGCGGGGC GGCGGCCGTA CCTGACGATC GGCCCGTGGT ACCACGCCGA CATCCGGCAC GGCCGGGTGG CCAACGCCGA CGCGCTGGCC TGGTTCAGGG CGCACCTGCT GGGCGACCCC TCGGGGCTGC GGGAGCAGCC GGTCAGGCTG TACGTCACCG GTGCGGGCGA GTGGCGCGAC TATCCCGACT GGCCGGTGCC CGGCATGCGG GAGCAGCGCT GGCACCTGCA GCCGGGGCTC GCGCTCTCGA CCGGCAACCC GCGGGAGGGC GACCCCGACC GCTACCGCTA CGACCCCGCG CACCCCACGC CCGTGCTGGG CGGGCCGGTC CTGCTGGGCA ACTCCGAGCC GCGCGACAAC CGGCGCCTGG AGGCCCGGCG CGACGTGCTC GTCTACACCG GCCCCGAGCT GCGCGAGGAC ACCGACATGA TCGGTCCGGT CTCCGCCGAC CTCTACCTCC GGTCGAGCAC CGAGCACGCC GACGTGGTGG TGCGGGTCTG CGACGTGCAC CCGGACGGCG CGTCCTACAA CGTGTGCGAG GGCGTGCGCC GCCTGTCGCC CGGCGCTCCC CCGGCCGGCT CCGACGGGAT CCGCCGCGTC CGGGTGGACC TGTGGCCGGT CGGCCACCGC TTCCGGCGCG GCCACCGGAT CCGCCTGCAC GTGGCCGGCG GCGCCTATCC CCGCATCGCC CGCAACCTCG GCACGGGAGA GCCGCTGGGC ACCGGCCGCA CGATGGTCGC GGCCGACCAC GAGGTCTTCC ACGACCCCGC TCACCCCTCC GCGGTCGTGC TGCCCCTCGT CCGCGGCTGA
|
Protein sequence | MPPYRVLLQR DLPVPMTDGV TLLADRYVPV GAVRPPTVLV RSPYGRRGVF GLAFGRAFAR RGFQVVLQSC RGGFGSGGLL DPLGDEHEDG LATLAWLRGQ PWYGGSLAMH GPSYLGYAQW AIAPFAGPDL KAMATSVTAS QFRDAAYVGG AFALESSLIW TTLTASMDTR FGGAGALLAP RRTRRAALSG RPLGELDVLS AGRGLPFFQD LLAHHADPAA YWGRRDFSAS VGEVEAAVTM VGGWYDVFLP WQVKDYTTMR AAGRRPYLTI GPWYHADIRH GRVANADALA WFRAHLLGDP SGLREQPVRL YVTGAGEWRD YPDWPVPGMR EQRWHLQPGL ALSTGNPREG DPDRYRYDPA HPTPVLGGPV LLGNSEPRDN RRLEARRDVL VYTGPELRED TDMIGPVSAD LYLRSSTEHA DVVVRVCDVH PDGASYNVCE GVRRLSPGAP PAGSDGIRRV RVDLWPVGHR FRRGHRIRLH VAGGAYPRIA RNLGTGEPLG TGRTMVAADH EVFHDPAHPS AVVLPLVRG
|
| |