Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_1984 |
Symbol | |
ID | 8665266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 2134376 |
End bp | 2136094 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | Trypsin-like protein serine protease typically periplasmic containing C-terminal PDZ domain-like protein |
Protein accession | YP_003337715 |
Protein GI | 271963519 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGGCAC GATTCACCGC CGTGACCGGC CTGGTGCTGG CCGGTGTGCT GGCCCTGCCT GGTGTCGCCA TGGCAAAGCC GAAAGATCTG GATATTCCAG TAGGAACATG GCTGGCGGCC AGGACGCATC CGGCCGTGCA GCTGACCTCG GTGTCCTACA CGGCCGACGT GGCCGTGCCG ACTCCGGTGG CGAACCTGGA GGCGATCAGA GGGCTGACCC AGCAGGGGGT CGCCGCGGTC CAGTCGGGCA AGATCTCGTC GGACGAGAAC AGCATCTACC GGTGGGTCGT CCAGCAGATG GCCAAGGAGC CGGGGCGCTA CTTCTCGCCG GGGGCGCCGG AGCGGGTCGT GGAGGCGACG GCCGGCGGCC TGTGCACGGG CTGGTGGGTC ACGCCGGACG GTTACATGGT GACGGCGGCC CACTGCGTCG GCCAGGAGGA GAGCGAGCTG GCGCAGACCT TCGCCACCCA GGCCCTCACG AAGATCAACG AGAAGGACGC CGCCGACCTG GTTGCCAGCC TCGGTGACAT CGCCTCCGAC GACGAGATCG TCCGGACGGC CGCAAAGATC TTCCAGGTCT GGAACGCGGA CAACATCAAG ATCCGCAACG TCCAGAGCTC CCTGTCCCTG CTGCAGAGCC TTCCCGGCGG CGGCGTCGAC AAGACCGCCA AGGCGGTACC GATCGAGCTG GTGGCCAAGG GGACGGTCTA CCCGGGCAAG GATGTCGCGA TCCTCAAGGC CAACGGGCAG AACAACCTGC CGACCGTCCC GCTGGGCCAG GACTCGGACG TGCGGGTGGG CGACACCCTC TACATCAGCG GGTTCCCCGG CACGGTGACG CAGACCTCGA TCTTCAACAT CGAGTCCAAG CTCGACCCGG CCTTCACCGA GGGCCCCTAC AACGCCAGCC GTCAGACCCC CGAGGGCGTG CCGTACATCC AGACCCAGGC CCCGTCCTAC CCGGGCAACT CGGGTGGTCC GGTGTTCAGC AAGGACGGCA ACGTCATCGG CATCCTGGTC GGTGGCCTGA TCCAGCAGGA CGGCGGCTCC ACCGAGGGGG AGAGCTTCGT GCTGCCGGTC AGCATCGTCA GGGAGAAGCT GAACGAGAAG AACATCAAGT CGGCCGAGTC GGTGACGACG AAGGCCTACA ACGAGGCGCT CGACCTGTTC TTCAAGAACC ACTACTCCGA CGCGCTGCCC AAGTTCCGTG AGGTCCAGGC GCTGCAGCCG AACCACCCGT ACGTCGCCAA GTACATCACC GACTCCCAGC AGGCCATCAC CGCCGGCAAG GACGAGAGCT CCTCGTCGAT CCTGCCGTGG GTGCTGTGGG GCGGCGGAGG CCTGCTCGTC CTGTTCGTGC TCGGCACGCT GGGCGCGGTG CTCAGAGGCA AGCAGCGGTC GAAGGTCCCG CCGTCCTCCT TCCCGCCCGT GCCGTACGGC GCGCAGCCCG GCTACCTGCC GCCGGGACAG GGCCAGTACG GCCACCCGGC CCAGCACGGC CAGCCGTACG GCGTGCCGCA GCAGCAGCCC TACCCGCAGC GGCCGCCCTA TCCGCTCCCG GCCGCACCCG AGGACACCCG GGCGGTCCAC CCGCAGTCGC CGTACGGGGC GCCGCAGCAG AAGCCCGGCC CCGGGGCGAA CACCCGGATC GCCGGACTGG AGGCGGAGCT GGAACAGCTG CGTCGCAACA TGGGGCAGCG CCCGCCCGAC CAGCGCTGA
|
Protein sequence | MWARFTAVTG LVLAGVLALP GVAMAKPKDL DIPVGTWLAA RTHPAVQLTS VSYTADVAVP TPVANLEAIR GLTQQGVAAV QSGKISSDEN SIYRWVVQQM AKEPGRYFSP GAPERVVEAT AGGLCTGWWV TPDGYMVTAA HCVGQEESEL AQTFATQALT KINEKDAADL VASLGDIASD DEIVRTAAKI FQVWNADNIK IRNVQSSLSL LQSLPGGGVD KTAKAVPIEL VAKGTVYPGK DVAILKANGQ NNLPTVPLGQ DSDVRVGDTL YISGFPGTVT QTSIFNIESK LDPAFTEGPY NASRQTPEGV PYIQTQAPSY PGNSGGPVFS KDGNVIGILV GGLIQQDGGS TEGESFVLPV SIVREKLNEK NIKSAESVTT KAYNEALDLF FKNHYSDALP KFREVQALQP NHPYVAKYIT DSQQAITAGK DESSSSILPW VLWGGGGLLV LFVLGTLGAV LRGKQRSKVP PSSFPPVPYG AQPGYLPPGQ GQYGHPAQHG QPYGVPQQQP YPQRPPYPLP AAPEDTRAVH PQSPYGAPQQ KPGPGANTRI AGLEAELEQL RRNMGQRPPD QR
|
| |