Gene Sros_1984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1984 
Symbol 
ID8665266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2134376 
End bp2136094 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content69% 
IMG OID 
ProductTrypsin-like protein serine protease typically periplasmic containing C-terminal PDZ domain-like protein 
Protein accessionYP_003337715 
Protein GI271963519 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGGCAC GATTCACCGC CGTGACCGGC CTGGTGCTGG CCGGTGTGCT GGCCCTGCCT 
GGTGTCGCCA TGGCAAAGCC GAAAGATCTG GATATTCCAG TAGGAACATG GCTGGCGGCC
AGGACGCATC CGGCCGTGCA GCTGACCTCG GTGTCCTACA CGGCCGACGT GGCCGTGCCG
ACTCCGGTGG CGAACCTGGA GGCGATCAGA GGGCTGACCC AGCAGGGGGT CGCCGCGGTC
CAGTCGGGCA AGATCTCGTC GGACGAGAAC AGCATCTACC GGTGGGTCGT CCAGCAGATG
GCCAAGGAGC CGGGGCGCTA CTTCTCGCCG GGGGCGCCGG AGCGGGTCGT GGAGGCGACG
GCCGGCGGCC TGTGCACGGG CTGGTGGGTC ACGCCGGACG GTTACATGGT GACGGCGGCC
CACTGCGTCG GCCAGGAGGA GAGCGAGCTG GCGCAGACCT TCGCCACCCA GGCCCTCACG
AAGATCAACG AGAAGGACGC CGCCGACCTG GTTGCCAGCC TCGGTGACAT CGCCTCCGAC
GACGAGATCG TCCGGACGGC CGCAAAGATC TTCCAGGTCT GGAACGCGGA CAACATCAAG
ATCCGCAACG TCCAGAGCTC CCTGTCCCTG CTGCAGAGCC TTCCCGGCGG CGGCGTCGAC
AAGACCGCCA AGGCGGTACC GATCGAGCTG GTGGCCAAGG GGACGGTCTA CCCGGGCAAG
GATGTCGCGA TCCTCAAGGC CAACGGGCAG AACAACCTGC CGACCGTCCC GCTGGGCCAG
GACTCGGACG TGCGGGTGGG CGACACCCTC TACATCAGCG GGTTCCCCGG CACGGTGACG
CAGACCTCGA TCTTCAACAT CGAGTCCAAG CTCGACCCGG CCTTCACCGA GGGCCCCTAC
AACGCCAGCC GTCAGACCCC CGAGGGCGTG CCGTACATCC AGACCCAGGC CCCGTCCTAC
CCGGGCAACT CGGGTGGTCC GGTGTTCAGC AAGGACGGCA ACGTCATCGG CATCCTGGTC
GGTGGCCTGA TCCAGCAGGA CGGCGGCTCC ACCGAGGGGG AGAGCTTCGT GCTGCCGGTC
AGCATCGTCA GGGAGAAGCT GAACGAGAAG AACATCAAGT CGGCCGAGTC GGTGACGACG
AAGGCCTACA ACGAGGCGCT CGACCTGTTC TTCAAGAACC ACTACTCCGA CGCGCTGCCC
AAGTTCCGTG AGGTCCAGGC GCTGCAGCCG AACCACCCGT ACGTCGCCAA GTACATCACC
GACTCCCAGC AGGCCATCAC CGCCGGCAAG GACGAGAGCT CCTCGTCGAT CCTGCCGTGG
GTGCTGTGGG GCGGCGGAGG CCTGCTCGTC CTGTTCGTGC TCGGCACGCT GGGCGCGGTG
CTCAGAGGCA AGCAGCGGTC GAAGGTCCCG CCGTCCTCCT TCCCGCCCGT GCCGTACGGC
GCGCAGCCCG GCTACCTGCC GCCGGGACAG GGCCAGTACG GCCACCCGGC CCAGCACGGC
CAGCCGTACG GCGTGCCGCA GCAGCAGCCC TACCCGCAGC GGCCGCCCTA TCCGCTCCCG
GCCGCACCCG AGGACACCCG GGCGGTCCAC CCGCAGTCGC CGTACGGGGC GCCGCAGCAG
AAGCCCGGCC CCGGGGCGAA CACCCGGATC GCCGGACTGG AGGCGGAGCT GGAACAGCTG
CGTCGCAACA TGGGGCAGCG CCCGCCCGAC CAGCGCTGA
 
Protein sequence
MWARFTAVTG LVLAGVLALP GVAMAKPKDL DIPVGTWLAA RTHPAVQLTS VSYTADVAVP 
TPVANLEAIR GLTQQGVAAV QSGKISSDEN SIYRWVVQQM AKEPGRYFSP GAPERVVEAT
AGGLCTGWWV TPDGYMVTAA HCVGQEESEL AQTFATQALT KINEKDAADL VASLGDIASD
DEIVRTAAKI FQVWNADNIK IRNVQSSLSL LQSLPGGGVD KTAKAVPIEL VAKGTVYPGK
DVAILKANGQ NNLPTVPLGQ DSDVRVGDTL YISGFPGTVT QTSIFNIESK LDPAFTEGPY
NASRQTPEGV PYIQTQAPSY PGNSGGPVFS KDGNVIGILV GGLIQQDGGS TEGESFVLPV
SIVREKLNEK NIKSAESVTT KAYNEALDLF FKNHYSDALP KFREVQALQP NHPYVAKYIT
DSQQAITAGK DESSSSILPW VLWGGGGLLV LFVLGTLGAV LRGKQRSKVP PSSFPPVPYG
AQPGYLPPGQ GQYGHPAQHG QPYGVPQQQP YPQRPPYPLP AAPEDTRAVH PQSPYGAPQQ
KPGPGANTRI AGLEAELEQL RRNMGQRPPD QR