Gene Sros_3689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3689 
Symbol 
ID8666977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4085633 
End bp4086925 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content68% 
IMG OID 
Productputative prolyl aminopeptidase 
Protein accessionYP_003339358 
Protein GI271965162 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.798401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.360636 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAT CACGCCGATA CGGCACCGTG CTCACTGACC ACACCTTCAC CGTCCCTCTT 
GATCATGACC GGCCCGATGA CCAGCAGATC CAGGTGTATG CGCGCGAAGT GCGCGCCGCT
GGAAAAGCGG GCCGGGACCT TCCTTGGCTG TTGTTCCTGG GCGGCGGCCC CGGCGCGGCC
GCCCCCCGCC CACTCGGCGG CGAAGGCTGG CTACACCGGG CCTTGCAGGA CTACCGTGTG
CTGCTGCTGG ACCAGCGAGG CACCGGCAGA TCTACCCCCG TGGACCGCCG TTTCCTGGCC
CAGGTCGGCG ACCCGGACGC CCAGGCGCAC TACCTGTCGC ACTTCCGCGC CGACGCGATC
GTGCGGGATG CCGAGATTAT CCGCCGAACA CTCATCGGGG ACCGGCCGTG GAGCGTGCTG
GGGCAAAGTT TCGGTGGCCT GTGCACGGTC ACCTACCTCT CCTTTGCTCC GCAGGGGCTG
GCCGAGGCGT TCATCACCGG CGGGCTACCC GGCGTGCGTG CTACCGCCGA CGACGTCTAT
CGCGCGCTGT ATCCCCGGGT CGTCGCCAAG AACGCTGAAC ACTTCGACCG CTTCCCGGGC
GACGGCGAGC AAGCCCGTAC CGTCGCCCGG TACCTGCGGG ACCATCAGGT CGTCCTGCCG
AGCGGCAGGC CGCTGACCGT CGGGACCTTT CAGTCCCTGG GCAACCTCCT CGGCGGCAGC
GACGGCAGCC GCCGCCTGCA CTATCTCCTC GAAGACCCCT TCACCGGCGG GACCGAACCA
TCAGACGCCT TTCTGTCCGA GGTCGACTGG GAACTGTCCC GAATCGCCGG GGGACCGCTG
TACTCCCTGC TGCATGAGGC GACCTACGCC CAAGGGGAAG GCGCCACCCA CTGGTCGGCT
CAACGCATCC GGGGCGAGTT CCCCGCCTTC GACGCCACCG CAGCTCTCGA CTCGGGCGCG
CCCGTCCTGT TCACCGGCGA GATGATCTAC CCCTGGATGT TCGACACCGA CCCGGCGCTA
CGGCCGTTCC GCCAGGCCGC ACACCTGATC GCCGAACGTA CGACCTGGCC CGCCCTCTAT
GACACCGGCC GCCTGCGCGC CAACACCGTG CCCGTCGCCG CCGCCATCTA CTACGACGAC
ATGTACCTCG ACCGTGACCT GTCGATCAGC ACCGCCCAGA CCATCCACGG CCTGAGGCCC
TGGATCACCA ACGAATACCA ACACAACGGC CTACGCACCA GCAACGGAGC CGTCCTTGAC
CACCTCATCG CCCTGATACG CGAAACACCA TGA
 
Protein sequence
MTTSRRYGTV LTDHTFTVPL DHDRPDDQQI QVYAREVRAA GKAGRDLPWL LFLGGGPGAA 
APRPLGGEGW LHRALQDYRV LLLDQRGTGR STPVDRRFLA QVGDPDAQAH YLSHFRADAI
VRDAEIIRRT LIGDRPWSVL GQSFGGLCTV TYLSFAPQGL AEAFITGGLP GVRATADDVY
RALYPRVVAK NAEHFDRFPG DGEQARTVAR YLRDHQVVLP SGRPLTVGTF QSLGNLLGGS
DGSRRLHYLL EDPFTGGTEP SDAFLSEVDW ELSRIAGGPL YSLLHEATYA QGEGATHWSA
QRIRGEFPAF DATAALDSGA PVLFTGEMIY PWMFDTDPAL RPFRQAAHLI AERTTWPALY
DTGRLRANTV PVAAAIYYDD MYLDRDLSIS TAQTIHGLRP WITNEYQHNG LRTSNGAVLD
HLIALIRETP