Gene Sros_0871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0871 
Symbol 
ID8664143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp891897 
End bp895133 
Gene Length3237 bp 
Protein Length1078 aa 
Translation table11 
GC content70% 
IMG OID 
ProductSubtilisin-like protein serine protease-like protein 
Protein accessionYP_003336624 
Protein GI271962428 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAC GCAGGAAGAG ATCATTTCCC CTCACGCAAA GGCAGATCAT GAGTTTAGGT 
AAGTCGTTCC TCGCCGCCGC CATGGCAGGA ACGATCGTGG GCATCACACC GGCACCAGCG
CCGGCGCAGA CACCGCAGCC TCCCTCCCCG CCGGCGGACG GGGTGACCCT GATCACCGGT
GACCGTGTCG TGGTCACCGG GCACGGGCAC CGGGTGGAGC CCGGACCCGG CCGGCAGGAG
GTCGGCTTCA CGAGCCAGGT ACGTGAGAAG CACCTGTACG TGATTCCGTC CGACGCTCAG
CCCCTGGTCG CTCAGGGGGT GCTCGACCGG CGGCTGTTCG ACGTCACCCA ACTGCTGCAG
TGGCGGTACG GGGACGCCGA GATCCGCGAC ATCCCGCTGA TCACGCGGTC GGACGCGGGC
CCGGCCCCTG CGCTCAGGGG CGCGCAGGGT ACCCGGCGGC TCGCCGGCCT TGGCATGACC
ACGCTCCGCC TGCCCAAGAG CGACGCGGCT CGGGCGTGGA AGGAGATGAC GGGCGGCGGC
CGTACGCCGG CCGCGGGCAC GACGAAGATC TGGCTGGACG GCCGCCGGTC CTTCAGCCTC
GACCGGAGCA CCGAACAGAT CGGCGCCACC GAGGCGTGGA AGCAGGGGAT GACCGGCGAG
GGCGTCACGG TCGCCGTCCT CGACTCCGGC TACGACCCCG GCCATCCGGA CCTGAAGGGC
GTGGTGGCGC AGGAACGCAA CTTCAGCGAG GAGCCCGACA TCCGCGACAA CCACGGTCAC
GGCACCCATG TCGCCTCGAC CGTCGCGGGT AACGGCGAGA AGTACCGGGG AGTGGCTCCC
GGCGCACGGC TGGCCATCGG AAAGGTGGGC GACAGGTTCG GCGCCTCCGA GTCCGCCATC
CTGGCGGGCA TGGAGTGGGC CTCCCTTGAG GTCAAGGCCA AGGTCGTCAA CTTCAGCATG
GGTGCCCCCG ACCAGCCCGA GATCGACCCC GTGGAGCAGG CGGTGAACAC GCTGTCGGCG
GAGACGGGCA CGCTGTTCGT CGTCGCCGCG GGGAACGACG GCGGGAGGCG GCCGGTGAGC
AGCCCCGCCA GCGCCGACGC CGCCCTCGCG GTCGGCGCCG TCGACAGGCA GGATCGGGTG
GCCGGGTTCT CCAGCACCGG CCCCCGCGTA GGCGACCACG CCGTCAAACC GGACCTCACC
GCACCGGGGG TCGCCATCGT CGCGGCGGCG GCCGAGGGCA CCGCCGACGG CGCTCACGTC
GAGATGAGCG GCACCTCCAT GGCCGCACCG CACGTGGCCG GGGCTGCGGC CATCCTCGCC
CAGCGGCACC CCGGCTGGAC CGGGCAGCAG CTCAAGGCCG CTCTGGTCGG CAGCGCCGCT
CCCTCGTCCG GCGCCACGCC GTTCCAGCAG GGCACCGGAC GGGTGGACGT GGTCCGCGCC
CTGAAGCAGC AGGTCGTGGC CCAGACGGCG GGCACCTGGG CCGTCTTCCC CTGGGACGGT
CCGGACGGAC GCAAGAAGAC CGGGACCGTC ACCTACACCA ACTCCGGCGA CGCCCCGGTC
AGCCTCGACC TGACCGTCGA GGGAGAAGTG CTCGAACTCG GCACCCGGCG GCTCGACGTG
CCCGCCGGAG GACAGGCATC GGTCACGCTC AGCATCGACG CGAGCGGCAA GGCCCCCGGC
GACTACGCCG GGACGATCAC CGCCACCTCG GGCGACAGCG TGATCCGCAC CCTGGCGGGT
GCGTACGTCG AGCCCGAGTC CTACGACGTC ACCATCGCCG CCATCGGCAA GCAGGGCCAA
CCGGTCGACC CCTGGTCGGC TGAGATCTAT GACGCGAAGA CGGGGGCCGT CACCGAGCCG
TTCTTCCGGA ACGGCATGGC CACGGTACGG CTCCCCAAGG GCGACTGGGA CCTCTACACC
TGGATCGCAG AGAGGATCGA CGGGAAGCTG AACGTCACCG CCGCCAACTC CCCGCTGAAG
GTCGATGGAG GCAGCCGCCG GCTGACGGTG GACGCACGCC AGGGCAAGGC GACCAAGGTC
ACACTCGACG ATCCGACCGC CACGCCTCGG CGTGGTTTCG ATCTCGGGAT GGCCCACGGC
GCATGGAATT CGTGGTCGTC GACGAACATG GACGCCAACA CCGAGCTCTT CGTCGTGCCG
GTTCACCGGC CAGGCCTGAC CTACACGTTG AGAACCACGT GGCTGAGCAA GGACGTGTCT
CCCAGTCCCT ACGTCTACGA CCTCGTCGAC CGTCGCACCG ACGGCGTTCC CGAGAATCCC
GTCTACGACG CCAGGCAGAA GGATCTGGCG AAGGTCTCCG CGACCTACCG GGCCTCGGGA
GTGGCGGCCT TGGGGACGCC GATGGCCGGA CTACAGGTCG GGGGCTTCCT GGGCTCGTTC
CTGGCACCAC TGGTCGGTGA CATCCCCCTG CCCGGCACGC TCATCCACTA CCGGACCCCC
GGGCTGACCT ACGAAAGCGG ACTTCAGGTC GGCACCTCCC TGACCTTCGA CGGCGGCAAG
CTCATGAAGC GCGGGCAGAC CAGTGAAGTC TGGAACACCG CGGTCACCGG TCCGTCGTTC
CTGCTGCCCG GCGGCAGCCG TACCGGCGAC AAGCTGACCT TCTCCGCGGT GGGGCTGTTC
ACCGACGGGG GCCCGGGAAG AACGGGCTCG GACACCGCCG CCACCGGCAC CGCCACCCTC
GCCAGAGACG GGAAGGTGCT GGCCAAGGCC GACATCGCCG ACTGCGAGGT CTACCGGAGG
GAAGGGTGCG AGCTCCACGC CGACCTTCCC GCCGGGTCCG GCGCCTACAC GCTGACCGCG
TCGATGCGCA GGCAGGTTCC GCACTCGACA CTCTCCACCG GCGTGGAGTC CGTATGGAGG
TTCCGGTCCG CGACCACGGC GAAGGGGCTG CCACTGCCGC TGACGGCGGT CCGTTACAGC
CCCGCCGGCC TGGATGAGTC CAATCGCGCC AAGCCGGGCA GCGTGACCCG TCTTCCCCTG
TGGATCGAGC GCAACCCCGG CTCCACCGGG GCGGCGATCG AGTCGGTCCA GGTGGAGATG
TCCATCGACG ACGGGGCGAA GTGGCGTCGT ATCCCGATCG TCCGCACCGG CTCGGGCTGG
ACCGCCGCGC TGCCGAACCC GCGCACGCCC GGATTCGTCT CCCTCCGCGC GGTGGTGACC
GACACGGCGG GCACCGGCCT GACCCAGACG ATCACTCGCG CCTACGCCGT CGGCTGA
 
Protein sequence
MAERRKRSFP LTQRQIMSLG KSFLAAAMAG TIVGITPAPA PAQTPQPPSP PADGVTLITG 
DRVVVTGHGH RVEPGPGRQE VGFTSQVREK HLYVIPSDAQ PLVAQGVLDR RLFDVTQLLQ
WRYGDAEIRD IPLITRSDAG PAPALRGAQG TRRLAGLGMT TLRLPKSDAA RAWKEMTGGG
RTPAAGTTKI WLDGRRSFSL DRSTEQIGAT EAWKQGMTGE GVTVAVLDSG YDPGHPDLKG
VVAQERNFSE EPDIRDNHGH GTHVASTVAG NGEKYRGVAP GARLAIGKVG DRFGASESAI
LAGMEWASLE VKAKVVNFSM GAPDQPEIDP VEQAVNTLSA ETGTLFVVAA GNDGGRRPVS
SPASADAALA VGAVDRQDRV AGFSSTGPRV GDHAVKPDLT APGVAIVAAA AEGTADGAHV
EMSGTSMAAP HVAGAAAILA QRHPGWTGQQ LKAALVGSAA PSSGATPFQQ GTGRVDVVRA
LKQQVVAQTA GTWAVFPWDG PDGRKKTGTV TYTNSGDAPV SLDLTVEGEV LELGTRRLDV
PAGGQASVTL SIDASGKAPG DYAGTITATS GDSVIRTLAG AYVEPESYDV TIAAIGKQGQ
PVDPWSAEIY DAKTGAVTEP FFRNGMATVR LPKGDWDLYT WIAERIDGKL NVTAANSPLK
VDGGSRRLTV DARQGKATKV TLDDPTATPR RGFDLGMAHG AWNSWSSTNM DANTELFVVP
VHRPGLTYTL RTTWLSKDVS PSPYVYDLVD RRTDGVPENP VYDARQKDLA KVSATYRASG
VAALGTPMAG LQVGGFLGSF LAPLVGDIPL PGTLIHYRTP GLTYESGLQV GTSLTFDGGK
LMKRGQTSEV WNTAVTGPSF LLPGGSRTGD KLTFSAVGLF TDGGPGRTGS DTAATGTATL
ARDGKVLAKA DIADCEVYRR EGCELHADLP AGSGAYTLTA SMRRQVPHST LSTGVESVWR
FRSATTAKGL PLPLTAVRYS PAGLDESNRA KPGSVTRLPL WIERNPGSTG AAIESVQVEM
SIDDGAKWRR IPIVRTGSGW TAALPNPRTP GFVSLRAVVT DTAGTGLTQT ITRAYAVG