Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_2852 |
Symbol | |
ID | 8666138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 3099264 |
End bp | 3100340 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | proline dipeptidase, putative |
Protein accession | YP_003338552 |
Protein GI | 271964356 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.164066 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.067191 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAACA TGACCGTCAC CCATGCTCGG CGCCGCGAGA GCCTCGCGGC ACTGCTCCCC GCTCACGAGG CGGACGCGAT CCTGGTCACC CGCGGCGTCA ACGTGCGCTA CCTGACCGGC CTGGCCAGCT CCAACGCGGC GGTGCTCGTC CGGGCCGACG CCCGTGCCAC GCTGGCCACC GACTCCCGCT ACGCCGAGAC CGCGCGGCGC TCCTGCTCCG ACATCGAGGT GGTCGAGGAG CGTGACGTCG CCGGATGCCT GGTGGTGATG GCCGACAGGG TCGCCGTCGA GGCCCACCAC ATGCCCGTCG CCGACTACTT CCGGCTGGGC GAGGACCTGC TGCGCCTGTC CGGATTGGTG GAGTCGGTGC GCCGGGTCAA GGACGAGGCC GAGATCGACC TCCTCCGCGA CGCGTGCGCG ATCACCGACC AGGCCTTCGC CGACGTGCTG CCCATGTTGC GGCCCGGGGT GACCGAGAGG GACATCGCCA GGGCGCTGGA GTCCCGGATG ATCGAGCTGG GCGCCGAGAA GCCCGCCTTC GACTCGATCG TGGCGAGCGG TCCCAACGGC TCGATCCCGC ACCACTCGCC CTCGGGCCGG CCGCTGGAGC GGGGCGATCT GGTCACGATG GACTTCGGGG CGCTGCACGA GGGCTACCAT GCCGACATGA CCCGGACGGT GGCGATCGGC GAGCCCGCCT CGTGGCAGCG CGAGCTGTAC GACCTGGTCC GCGCAGCCCA GCGCGCCGGC CGCCATGCCG TACGGCCCGG CGCCGCCCCC CACGAGGTGG ACGCCGCCGC CCGCGAGGTG ATCGCGCAGG CGGGGTACGG AGACTACTTC GGCCACGGCC TCGGTCACGG TGTCGGGCTG GAGATTCACG AGGTGCCGTT CCTGTCTCCG CTGAAGCCAG AGCCCGACCA TGAGCACGCT AGACTAGAGG ATCGAGTTCC GGTCACCGTT GAGCCTGGAG TTTACCTACC GGGCAGGGGC GGCGTCCGCA TCGAGGACAC GCTCGTGACG CGTGATGACG GACCGGAACT TCTCACACGG ACGACCAAAG AGCTGCTTGT CCTTTAA
|
Protein sequence | MRNMTVTHAR RRESLAALLP AHEADAILVT RGVNVRYLTG LASSNAAVLV RADARATLAT DSRYAETARR SCSDIEVVEE RDVAGCLVVM ADRVAVEAHH MPVADYFRLG EDLLRLSGLV ESVRRVKDEA EIDLLRDACA ITDQAFADVL PMLRPGVTER DIARALESRM IELGAEKPAF DSIVASGPNG SIPHHSPSGR PLERGDLVTM DFGALHEGYH ADMTRTVAIG EPASWQRELY DLVRAAQRAG RHAVRPGAAP HEVDAAAREV IAQAGYGDYF GHGLGHGVGL EIHEVPFLSP LKPEPDHEHA RLEDRVPVTV EPGVYLPGRG GVRIEDTLVT RDDGPELLTR TTKELLVL
|
| |