Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4699 |
Symbol | |
ID | 8667993 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 5224044 |
End bp | 5225672 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | X-Pro dipeptidyl-peptidase domain-containing protein |
Protein accession | YP_003340290 |
Protein GI | 271966094 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.401259 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTTCC TCAGCCGTTT GCACGGTGTG CGCCCCGCCC TGCGTCACCG CGTATGCGTG CGGCGCAACG TGGAGATCCC GGCCGCTGAT GGTGTGCGAC TGCTGGCCAC GCACTATTAC CCGGCAGGCC AGCGGCGGCC GCCGCTGGTG CTGCTGCGCA GCCCGTACGG CCGCGGCAAC GCCCTGGACC AGCTTCCAGC CCTACTGGCC GAGCGTGGCT ACCAAGTCCT GTACCAAAGC CTGCGCGGCA CGGCCGGCTC GGGCGGCAGC TTCGACGGCT TCGTCATCGA CCCAGCCGAC GCCGACGGCA CGCTGAGCTG GCTGCGCGCC CAGCCGTGGT TCGGCGGCGA GCTGGCCACC TGGGGTGCCA GCTACCTCGG CCTCGTCCAA TGGGAACTGG CCGCCCGCGA TATCCCGGAA TGGAAGATCG CGCTCGTCCA GGACGCGCCG TCCAGCTTCG CCGAACACTT CATGTACCCC GGCGGAGCGT TCGCGACAGG CAACGCGCTC GGCTGGGTGC AGCTGGTGGA GCGGATGTTC ACCAGCGGCC ACGGCATCGT CCGCCAAATG GCCGGCCTGG TCGGTGCCGC ACGGCGACTG TCGAGTGCCA CGCTCGCCCT CCCCCTGCAG GAAGCCGATC AGGCGCTGAC CGGGCATCCC GTGCTGTGGT TCCGGGATTG GGTCCACCAC GGCCCGGGCG AGCCGTACTG GGCGAGCACC GACCACCGCC ACAACGTCGC GCGCATGCCA CCGGTGGTGC ACCTGCAGGG GGGCTGGCAT GACTTCTTCC TGCCGGGCAT GCTGGCCGAT TACGCAGCCC TGTCGGCCGC CGGACGGAAC GTGCGGCTGC TGATCGGCCC GTGGACCCAT GGCCGCGGGC TCTACACCCG CGAAGGACTC AGCGACGCGC TCGCCATGCT GGATGCCGCC TTGCTTGGAC GGAAGGCGCC CTCGGGTGTG CGGCTGTTCG TCACCGGTGC GGGCCGTTGG GTCGACGTGC CCACGTGGCC GCCCGCGCAC AAGGCGACCC CCTGGTTCCT GCACCTGGGC GGCGGGCTGA GCCGTACCCG ACCCGACGGC CGGGCAGAGC CCAGCCGCTA CCGCTACGAC CCCGCCGACC CCACCCCCAC GGTCGGTGGC ACGCAGGTGG GCATGTCGGC GGGCGCGAAG GACAATCGCC GCATCGAGGC CAGAGCCGAC GTGCTCACCT TCACCACCGC GCCCCTTACG GAGGATGTCG AGGCCGTCGG CTCGGTCCGC GTCCGCCTGC ACGCACGCTC CGACAACCCG CACGTCGACT ACTTCGCCCG CATCTGCGAC GTCGATCCGC GCGGTAGGTC GGTCAATGTG TGCGACGGCA TCATCCGGCT GTACGAGTCC GGCGCAAGCG GGACGGGCGA TGTCGGGGTG GCCGACATCG CCCTGTGGCC GATGGCGCAC CGTTTCCGGC GCGGTCACCG CATCCGGCTG CAGGTCTCCA GCGGGGCCCA CCCGCGCTTC GGCCGTAACC CCGGCACGGG CGAACCGCTC GCGACCGGAC GCGAGCTGCG CGCCTCGGAG CACGAGATCT TCCACGACCA CAACAACCCG TCAGCCCTCT GGCTGCCGCT TACTTCAGGA GCATCGTGA
|
Protein sequence | MTFLSRLHGV RPALRHRVCV RRNVEIPAAD GVRLLATHYY PAGQRRPPLV LLRSPYGRGN ALDQLPALLA ERGYQVLYQS LRGTAGSGGS FDGFVIDPAD ADGTLSWLRA QPWFGGELAT WGASYLGLVQ WELAARDIPE WKIALVQDAP SSFAEHFMYP GGAFATGNAL GWVQLVERMF TSGHGIVRQM AGLVGAARRL SSATLALPLQ EADQALTGHP VLWFRDWVHH GPGEPYWAST DHRHNVARMP PVVHLQGGWH DFFLPGMLAD YAALSAAGRN VRLLIGPWTH GRGLYTREGL SDALAMLDAA LLGRKAPSGV RLFVTGAGRW VDVPTWPPAH KATPWFLHLG GGLSRTRPDG RAEPSRYRYD PADPTPTVGG TQVGMSAGAK DNRRIEARAD VLTFTTAPLT EDVEAVGSVR VRLHARSDNP HVDYFARICD VDPRGRSVNV CDGIIRLYES GASGTGDVGV ADIALWPMAH RFRRGHRIRL QVSSGAHPRF GRNPGTGEPL ATGRELRASE HEIFHDHNNP SALWLPLTSG AS
|
| |