Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3166 |
Symbol | |
ID | 8666454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 3447357 |
End bp | 3448559 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | Zn-dependent dipeptidase microsomal dipeptidase |
Protein accession | YP_003338854 |
Protein GI | 271964658 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCCCC CCAATCCCCG CTACCAGGGA TACCGGTCGT TCGACTATCT GGAGCCACAC GCCGACTTCA AGGTCTTCGA CCTCGCCCCC GAAATCGACC GTGTTCCGGC GTACGACCTG GGCCTGTCGG CCGAGCAGTC CGCGCGGGTC AGCCGCCTCC TGACCGAGCA CATGGCCATC TCGCTGCACG AGCACCCCAA GGTCCTGACC GCGGACGTCA CGCTGCTGCG CGACTACAAC CGGACCGGAC GCAACGTGCT CGGGTACGAG GGCCTGGCGC GTTCGGGCAT GACCGCGCTC TTCGACAACT TCATGAACGG CACCAACTGC GTCACCAGCG AGCACGGCTG GAAGTGGGAC GACGTCATCT ACGACCTCGG CCTGCGCTTC GCCGACATCG CCAAGCAGGA CTTCGTCGTC CTGGCCCGCA CGGTCGAGGA GATCGAGAAG GCCAAGGCGG GCGGGCAGCT CGCGCTGGTG GCCGGGCTGG AGGCGGCGAC GATGATCGAG AATGAGCTCG ATCGTCTGGA CATCCTGTAC GGCTTCGGGG TCCGTCAGAT CGGTGTCGCG TATTCGCAGG CCAACCAGTT GGGTTCGGGG TTGGCCGAGC GGGCCGATGC CGGTCTGACC AATTTCGGCC GTCGTGCGGT GGAGCGGATG AACCGGCTCG GTATGGCGAT CGACATCTCG CACTCGGGTG ACCGTACGTG TCTGGAGGTC ATCGAGCATT CGGCGGTGCC GGTCTTCATC ACGCATGCCG GTGCTCGTGC GGTGTGGCCG ACCAACCGGA TGAAGCCCGA TGAGGTGATC AGGGCGTGTG CCGAGCGTGG TGGTGTGATC GGTCTGGAGG CGGCTCCGCA CACCACGCTG TCGGAGGAGC ATCGCGAGCA CTCGCTGGAG TCGGTGATGG ATCACTTCAC CTACTGCGTG GACCTGGTGG GCATCGACCA CGTCGCCTTC GGCCCCGACA CCAACTTCGG TGACCACGTG GGGCTGCACG ACTCCTTCAC CGGTCACCTC TCGATCGGCC AGGCCCACGG ACACGTCGAG CACCCGCGCG TGCCGTATGT GGCCGGTATG GAGAACCCGG CGGAGAACTT CACCAACATC GTCGGCTGGC TCGTCAAGCA CGGCTACGGC GACGACGACA TCAGCAAGGT CATCGGCGGG AACATCCTGC GCGTACTCAA GGAAGTCTGG TGA
|
Protein sequence | MQPPNPRYQG YRSFDYLEPH ADFKVFDLAP EIDRVPAYDL GLSAEQSARV SRLLTEHMAI SLHEHPKVLT ADVTLLRDYN RTGRNVLGYE GLARSGMTAL FDNFMNGTNC VTSEHGWKWD DVIYDLGLRF ADIAKQDFVV LARTVEEIEK AKAGGQLALV AGLEAATMIE NELDRLDILY GFGVRQIGVA YSQANQLGSG LAERADAGLT NFGRRAVERM NRLGMAIDIS HSGDRTCLEV IEHSAVPVFI THAGARAVWP TNRMKPDEVI RACAERGGVI GLEAAPHTTL SEEHREHSLE SVMDHFTYCV DLVGIDHVAF GPDTNFGDHV GLHDSFTGHL SIGQAHGHVE HPRVPYVAGM ENPAENFTNI VGWLVKHGYG DDDISKVIGG NILRVLKEVW
|
| |