Gene Sros_7338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7338 
Symbol 
ID8670658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8097070 
End bp8099142 
Gene Length2073 bp 
Protein Length690 aa 
Translation table11 
GC content72% 
IMG OID 
ProductProlyl oligopeptidase 
Protein accessionYP_003342767 
Protein GI271968571 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGAC AGCCATACCC TCCAGCCAGC CGTGATGAAC TCGTCGAGGA CCTGCACGGC 
ACCCCCGTCC CCGACCCCTA CCGATGGCTG GAGGACCCCG ACGATCCGGC GGCCAAGACG
TGGCTGGCGG AGGAGGAGGC GCTTTTCGCG GCCGAGATGG GCGGGCTTCC GGGCCGGGAG
GCGTTCAAGG CGCGGATCGC CGAGCTGCTC AGGTCCGGCT CGATCGGCGC GCCGGTCTGG
CGCGGGGAGC GCGGGTTCTT CATGCGGCGC ACCCCGGAGC AGGAGCACGC GGTCCTCTAC
ACCGTCGCCC CCGACGGCAC CGAGCGGGCG CTGCTGGACC CGATGGCGCT CGACCCGACC
GGCCTCACCA CGCTCGACTC CTGGCAGCCC GACAAGGAGG GCCGCCTGCT CGCCTACCAG
ATCTCGGTGG GCGGCGACGA GGAGTCGAGC CTCTACCTGA TCGACGTCGT GCTGGGCGAG
CGGATCGAGG GGCCGATCGA CCGCTGCCGC TACTCCCCCG TGGCCTGGCT CCCCGGCGGG
GAGGCGTTCT ACTACGTGCG CAGGCTGCCC GCCACCGAGG TGCCCGACGG GGAGGAGCAG
TTCCACCGCC GGGTCTACCT GCACCGGATC GGCACCTCCA CCGAGGAGGA CGTCCTGATC
TTCGGTGACG GCATGGAGAA GACCAACTAC TACGGCGTCG CGGTCTCCCG CGACGGCCGC
TGGCTGCAGA TCTCCGCCTC CCGCGGCACG GCGCCCCGCA ACGACCTGTG GGTGGCCGAC
CTCGTGGCCT CCTCTCCCGA GTCGCCGGAG CTCGTCGTCG TCCAGGAGGA CGTCGACGCC
CAGAGCGCCG TGCACTTCGG CCGCGACGGC AGGCTCTACG TGTTCACCGA CCGCGACGCC
CCGCGCGGGC GGATCTGCGT GACCGATCCC TCCACCCCCC AGTTCGAGCA CTGGCGCGAC
CTGATCCCGC AGGACCCGGA GGCCGTGCTG TCGGACTTCG CGATCCTCGA CGACCTGGAC
CGGCCGGTCA TGCTGGTCGG CTGGACCCGC CACGCGATCA GCGAGATCTC CGTCCACGAC
CTGGTCACCG GCGAGCGCGT CGGGGAGGTG CCGACGCCCG GCCTGGGCAC GATCGGCGGC
ATCAGCGAGC GCCCCGAGGG CGGCCACGAG GCCTGGTTCG GCTACACCGA CAACACCACC
CCGCCGACCA TCCAGCGCTA CGACGCCCGC ACCGGTGAGA CCACCCTCTG GGCCTCCTCC
CCCGGCGCCG TCGAGGTGCC CGCCGTCGAG ACCGAGCAGG TGACCTACCG CTCGGCCGAC
GGCAGCGAGG TGCACATGCT GGTGATCTCC AAGCCCGGCG CCGAGGGCCC GCGCCCGACC
ATCCTGTACG GCTACGGCGG CTTCGGCATC TCGATGACCC CCGGCTACTC GGCGTCGATC
CTGAGCTGGG TCGAGGCGGG CGGCGCCTAC GCCATCGCCC AGCTGCGCGG CGGTGGCGAG
CAGGGCGAGG AGTGGCACCG GGCGGGAATG CTCGCCAACA AGCAGAACGT CTACGACGAC
CTGCACGCGG CGGCCGAGCA CCTGATCGCC ACCGGCGTCA CCACCACCTC CCGGCTGGCC
ATCTCCGGCG GCTCCAACGG CGGCCTCCTG GTCGGCGCGG CCCTGACGCA GCGCCCCGAC
CTGTACGCCG CGGTCGTCTG CTCGGCCCCG CTGCTCGACA TGGTCCGCTA CGAGCTGTTC
GGCCTCGGCG CGACCTGGAA CGTCGAGTAC GGCTCCGCCG AGAAGCCCGA CGAGTTCGCC
TGGCTGTACG CCTACTCGCC CTATCACCGG GTCCGCGAGG GCGTGTCGTA CCCGGCGACG
CTGTTCACCG TCTTCCAGTC CGACACCCGG GTCCACCCGC TGCATGCCTG GAAGATGTGC
GCCGCCCTCC AGCACGCCCA GTCCTCCGAC CGGCCGATCC TGCTCCGCAA CGAGACCGAG
GTCGGCCACG GCGCCCGCGC GGTGAGCAAG ACCGTCGAGC TCGCCGCCGA CCAGCTCACC
TTCCTCGCCC ATCACACGGG GCTGACGTCG TAG
 
Protein sequence
MTRQPYPPAS RDELVEDLHG TPVPDPYRWL EDPDDPAAKT WLAEEEALFA AEMGGLPGRE 
AFKARIAELL RSGSIGAPVW RGERGFFMRR TPEQEHAVLY TVAPDGTERA LLDPMALDPT
GLTTLDSWQP DKEGRLLAYQ ISVGGDEESS LYLIDVVLGE RIEGPIDRCR YSPVAWLPGG
EAFYYVRRLP ATEVPDGEEQ FHRRVYLHRI GTSTEEDVLI FGDGMEKTNY YGVAVSRDGR
WLQISASRGT APRNDLWVAD LVASSPESPE LVVVQEDVDA QSAVHFGRDG RLYVFTDRDA
PRGRICVTDP STPQFEHWRD LIPQDPEAVL SDFAILDDLD RPVMLVGWTR HAISEISVHD
LVTGERVGEV PTPGLGTIGG ISERPEGGHE AWFGYTDNTT PPTIQRYDAR TGETTLWASS
PGAVEVPAVE TEQVTYRSAD GSEVHMLVIS KPGAEGPRPT ILYGYGGFGI SMTPGYSASI
LSWVEAGGAY AIAQLRGGGE QGEEWHRAGM LANKQNVYDD LHAAAEHLIA TGVTTTSRLA
ISGGSNGGLL VGAALTQRPD LYAAVVCSAP LLDMVRYELF GLGATWNVEY GSAEKPDEFA
WLYAYSPYHR VREGVSYPAT LFTVFQSDTR VHPLHAWKMC AALQHAQSSD RPILLRNETE
VGHGARAVSK TVELAADQLT FLAHHTGLTS