Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4047 |
Symbol | |
ID | 8667341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 4505360 |
End bp | 4507390 |
Gene Length | 2031 bp |
Protein Length | 676 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | 5-oxoprolinase (ATP-hydrolyzing) |
Protein accession | YP_003339698 |
Protein GI | 271965502 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0166577 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00142198 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCTTCC GAACCGCCAT GGACATCGGC GGTACCTTCA CCGACGTCGT CCGCTACGAC GAACGGACCG GCCGCGTGGT GGCCTCGAAG GCGCCGACCA CACCGGGAAA CCTCGCCGAC GGCGTGTTCT CCGCGCTCGG CCGGGTCGTG GACGACCCCT CCGAGATCTC CTTCTTCGTG CACGGCACCA CCCAGGGGCT CAACGCGCTG CTGGAGCGCA AGGGGGCGCG GGTGCTGCTG GTCACCGGGG AGGGGGCCCG GGACGTCTAC CGGATCGCCC GGGGCAACCG GGACCGGATG TTCGACCTGC GCTACCGCAA GCCCGAGCCG CTGGTGCCCC GCTCGGACGT GACGGAGGTC GCCGGACGGC TGGACTGGCG CGGCGAGGAA CTCGTCCCCC TGGACGAGGG CGCGGTCAGG GCCGCGGCCC GGCGGGCCCG CGGCGAGAGC TTCGACGCGG TCGCGGTCTG CCTGCTGTTC AGCTACGTCA ACCCCGCCCA CGAGATCCGC GCGGGCGAGA TCCTGGCCGA GGAGCTGGGC GAGGACACCC TCGTCGTCCT CTCCCACGAG GTGGCCCGCG AATGGCGCGA GTACGAGCGG ACGTCCTCCG CCGTGCTGGA GGCCTACACC GGACCGGTGG TCCGCCGTTA CCTCGCCGGG ATCGAGGAGC GGTTCGCCGA GCGGGGCCTG ACCGTCCCGG TGCACGTCAT GCAGTCCTCC GGAGGCCTGG TCAACGCCTC CCACGCGATG CGGCGCCCGC TGCAGACCCT GCTGTCCGGC CCGGTCGGCG GCACCATGGG CGGCGTCGCG GCGGCCCGGC TCCTGGGCCG GCCCAACGCC ATCTGCGTGG ACATGGGAGG CACCTCCTTC GACGTGTCCC TGGTGGTCGA CGGCAGGCCC GACATCAGCA CCGAGGCGCG TGTCGAGGGC TTCCCCGTGC TGATGCCGAT CGTGAACCTC CACACGATCG GCGCCGGCGG CGGCTCGATC GCCTACGCCG AGGCCGGTGC GCTGCGGGTC GGCCCCGAGT CGGCAGGAGC CGTGCCCGGA CCGGCCTGCT ACGGCCGGGG CGGCGTCCGG CCGACCGTCA CCGACGCCAA CGTGGTGCTC GGCAGGGTGG ACCCGTCCTG GTTCGCCGGC GGGCTGATGT CCCTGGACGT CCATGCCGCC CACACGGCCG TGGCCGACCT GGGGCGCGAG CTCCGCCTGG AGACGCTCCA GATCGCCGAG GGCATCTGCA GCGTGGCCAA CGCCAAGATG GCCCAGGCCA TCCGGACCCT CACCGTGGAG CACGGGGTCG AGCCGCGCGA GTTCGCCCTG GTCGCCTTCG GCGGCGCGGG CGCCATGCAC GCGGTCTTCA TCGCCCGCGA GCTCGGCATC TCCGAGGTGG TCGTCCCCCG CTTCCCCGGC GCGTTCTCGG CCTGGGGCAT GCTGGAGGCC GACGTCCGCC GCGACCTGAG CCATCCGTAC TTCCGCTCGG GCGGGGAGCT GGACGGCGCC GACATGGCGT CCCGGCTGAA GGACCTGCAG GACCAGGCGC TGGAGGAGCT GGCCGGGCAG GGCGTGGCCG GCGGCCGGAT GCGGATCGAG CACGCGGTGG ACATGCGCTA CGAGGGCCAG GACTACACCC TGACCGTTCC CCTGCGGGAC GCCGCGGAGC CGGGCACGCC CGGCTTCCCG GAGCGGATCG CGGCCCGCTA CGCCGACGCG CACACCAAGC GGTACGGCCA CGCCACCCCC GAGGCGCCGG TGGAGTTCGT GACGCTCCGC AGCACCGGTT TCGGCGTCTT CCCCCGGACC GCCGCCACCC ACGCCGCCCA GCCGGACGAG GGGACGCGGA CCGTACGAGA AGTGATCTTC GACGGCGAGG CGCACCCCAC CCCCGTGCTG CGCCGCGGCG CGCTGGAGGG CGAGCTCACC GGTCCGGCGA TCGTCGTCGA GGAGACGGCG ACCACGGTGA TCCCGCCGGG CTGCGTGGCC TCGGTGGACG GCAACGGCTT TCTGATCATC AAGGTGGGAG GAGTCAAGTG A
|
Protein sequence | MSFRTAMDIG GTFTDVVRYD ERTGRVVASK APTTPGNLAD GVFSALGRVV DDPSEISFFV HGTTQGLNAL LERKGARVLL VTGEGARDVY RIARGNRDRM FDLRYRKPEP LVPRSDVTEV AGRLDWRGEE LVPLDEGAVR AAARRARGES FDAVAVCLLF SYVNPAHEIR AGEILAEELG EDTLVVLSHE VAREWREYER TSSAVLEAYT GPVVRRYLAG IEERFAERGL TVPVHVMQSS GGLVNASHAM RRPLQTLLSG PVGGTMGGVA AARLLGRPNA ICVDMGGTSF DVSLVVDGRP DISTEARVEG FPVLMPIVNL HTIGAGGGSI AYAEAGALRV GPESAGAVPG PACYGRGGVR PTVTDANVVL GRVDPSWFAG GLMSLDVHAA HTAVADLGRE LRLETLQIAE GICSVANAKM AQAIRTLTVE HGVEPREFAL VAFGGAGAMH AVFIARELGI SEVVVPRFPG AFSAWGMLEA DVRRDLSHPY FRSGGELDGA DMASRLKDLQ DQALEELAGQ GVAGGRMRIE HAVDMRYEGQ DYTLTVPLRD AAEPGTPGFP ERIAARYADA HTKRYGHATP EAPVEFVTLR STGFGVFPRT AATHAAQPDE GTRTVREVIF DGEAHPTPVL RRGALEGELT GPAIVVEETA TTVIPPGCVA SVDGNGFLII KVGGVK
|
| |