Gene Sros_4602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4602 
Symbol 
ID8667896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5117119 
End bp5121141 
Gene Length4023 bp 
Protein Length1340 aa 
Translation table11 
GC content72% 
IMG OID 
ProductSubtilisin-like protein serine protease-like protein 
Protein accessionYP_003340206 
Protein GI271966010 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.823259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0573734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCTGG GCACCCTGGT GGCGCCGGGC ACGGCCGCCT TCGCCCAGAA ACAGGAGCCC 
GCCCCCGCCG CCCCGGCGAC GGGACCGGTC ACCGGCCCGA TCGCGCTGAC CGGACTGGGC
CAGGGTGACC GCACGGTCAC CCTGATCACC GGGGACAAGG TGAGGTTGAG CGACGCCGGC
GGCGGCAAGT ACGCCGTGCG CCCCGAGGCG AGCGTCCGCC CCGACGGCAC CAGCGCCCAG
CTGTCCACGC TGGCCACCCC GAAGGGCGTG TACGTCACCC CCAGCGACGC CGTGCCCGCC
ATCAACGCCG GCCGCCTGGA CCGGGAGCTG TTCAACGTCA CCTACCTGGC CGAGAACGGC
TACACCGACG ACAAGACCAA GCAGCTGCCC GTCATCGTGC AGTACCCGCG TCAGCACACG
GACAAGGCCG TCGCCTCGGC CGCCAAGGCC ATCCCGGCCA GCGAGCCCGT GCTCACCCTG
GAGAGCGTCA ACGCCTCGGC GCTCCAGGTC ACCAAGGCCG AGGCGGGCAC GTTCTGGAAC
GCGGTGCGGG CCGTACCGGA CAAGAAGAGC GCGAACGGGA TCGCCGGGCC GGCCAACGTG
CCGGACACGC TGCGCGGCGA CATCGCCAAG GTGTGGCTGG ACGGCAAGGT GAAGGCCGAC
CTCGCCGAGA GCGTCCCGAT GATCGGCGCG CCGCAGGCGT GGGCCGGCGG CCACGACGGC
GCCGGCGTGA AGGTCGCTGT CCTGGACACC GGCGTCGACG CCAAGCACCC CGACCTGGCC
GACAGGATCG TCGACAGCAG GTCCTTCATC CCCGGCCAGG AGGTGCAGGA CGGCCACGGC
CACGGCACCC ACGTCGCCTC CACCATCGCC GGCTCCGGCG CGGCGGGGGG CGGCAAGCAC
AAGGGCGTCG CGCCCGGCGC CCAGCTGATC GTCGGCAAGG TGCTGGCCAA CGAGGGCTCC
GGCAGCGACT CCCAGATCAT CGAGGGCATG GAGTGGGCTG CGGCCTCCGG CGCCAAGGTG
ATCAGCATGA GCCTCGGCGG CGGCGCCTCC GACGGGACCG ACCCGATGAG CCAGGCGGTC
AACGTGCTCA GCGCCTCCAC CGGCGCGCTG TTCGTGATCG CCGCGGGCAA CGCCGGCGCG
TCGGGCGCCG AGACCGTCGC CACCCCCGGC ACGGCCGACG CCGCGCTGAC CGTGGCCGCC
GTCGACAAGA GCGACGCGTG GGCCACTTTC TCCAGCCAGG GACCGCGTGT CGGCGGTGGC
CTCAAGCCGG ACATCGCCGC TCCTGGCGTG GCCATCGCCG CTGCGCGCGC CGCCGGCACG
ACCATGGGCA CCCCGCTCGA CGAGCACTAC ACGGCCGCGA ACGGCACCTC GATGGCCACC
CCGCACGTCG CCGGCGCCGC CGCCATCATG ACGCAGCAGC ACCCGGACTG GACGGGCCCG
CAGATCAAGG CCGCGCTCAT GTCGACCGCC AAGGACGACG CGCTCAGCGT GTACAAGCAG
GGCGCCGGCC GGGTCGACGT GGCCAGGGCC TACACCCAGC AGGTCTTCGC CGTCACCACC
GGGGCCGACT TCGGCGCCGT CGAGAGCGAC GCCGCCCCCG TGACCCGGGA GCTGACCTAC
ACCAACCTGG GCGGCCGGCC GGTCACCCTG ACCCTGACCC CCGGCCTGCG CAAGTCGGAC
GGCAGCGCGG TCGAGGGCGG GCTGAGCATC GCCGAGACGA CGCTCACCGT GCCGGCCGGA
GGCACCGCGA CCACCACCGC CACCGTCGAC CCCAAGACCC TGGCCGACCT GGACAACTAC
ACCGGCGCCG TCACCGCGAC CGCCGACGGC GTCCAGCTGC GCATCCCCGT AGGCGTGGTG
CGTGAGGTGC CCAAGGCCAC GCTCACCATC CACACCCTGG GCCGCGACGG CAAGCCGCGC
AGTCCTTTGG CCCAGGACAC CATCGACGTG TCCGGCGACA AGGGCGTCCT CGGCGGTGTC
GCCCTGACCG CCGAAGGGAC CACGGTCACC CGCGTCCCGC AGGGTGTCAT CAGCGTGACG
CAGGTGCTGA GCTGGGTCGG CGACGACGAA AGGGGCAACC TGGCGTTCCT GTCCGTGCCC
GAGCTCACCG TCACCGGGGA CACCGAGATC ACGCTCGACG CCCGCAAGCT GACCGAGATC
CGATTCACCA CCCCGCAGCC GGCCGAACCG TTGAGCAACG TCCCCTACCT GGCGTATCAG
CGGACCGTGA GCAACGGCAC CCCCTACATG GGATTCACGT GGCCCGACCG CACCTGGTCC
CGGCTGTGGG CCCTGCCCAC GGAGAAGGTC ACCAAGGGCG CCTTCCGCTT CCACACCCGC
TTCACTCTCG GCAGGCCTGA GGTCGAAATG AGCATACGCG GGCGGGGCGG GCTCACCCTG
CACCCGGTCT CCGCGCTGCA CGGCATCAAC ACCTACGGCG TGCACGAGCA GTACGACGGC
TTCCCCGACT TCCGGCCGTT CACCGGCACC CGCGACCTGG AGGTCGTCGA CGTCGGCGAG
GGCAGGCCCG AGGACATCGC GGGCCGCGAC CTGCGCGGCA AGCTCGTCCT GATGGAAGCG
CCGATGTCAG AGGGTTTGTC CGGCCCCATG TGCGGAGTCC AGATCGAGCG GATCGGCCCG
ATCCGCGACG CCGGCGCGGC CGCGATCGCG TACTTCCCGC AGCCCGGGAC CGGTTGCGCG
ATCCCGCTGA GCATCACGCA GATCCCGTTC ACCGGAGAGG CCAAGCCCAT CGGCGTCCCC
ACCGTGTCCC TGCCCTCCCG TGAGGGGGTC GGCCTGCGCG ACCGGCTCGC CGGCGGCAAG
CCGGTCACCC TCCGGGTGAC CGGCACCGAG GAGTCGCCCT ACACCTACAC CTTCGCCCCC
TACGAAGAGG GCCGGGTGCC CCGCTCGCTG CACTACACCT TCGCCGAGCG CGACCTGGCA
CGGATCGACG TGGACACCCA CACCGTCGCC CCCGCGCGCT ACAACGACTG GCGGTACGCG
CAGAAGCCCG ACGACGTGAT GCCGATGTCG ACCTCCGTCT CCGCGTGGGG CGGCCCCCGG
CTCACCCTGC AGGAACGCCG CGACTGGGTC GGACCGCTCG ACTCCGGGGT TCTCTGGTCG
CACGGCATGG AGGAGATGCG CAGTACACCA GCTGATGCCC GGGTTCCGCA GTGGGCCCTG
GAGGTGTTCG ACAAGCCGGT CCGCACCCGG CAGTCCTGGC TGACCACCCC GTTCACGCCC
GGCGTCGCGA CCGGCTCCGA CAAGGTCTAC AAGCTCGCCA AGCCGGGCGC GAACCTCGGC
TGGTTCTTCC GGTGCATGCT CTGCGTCCAG GGCGACCGTC TGTGGGCGGA GTTCGAGCCC
AGCTACGGCT CGCCGGGCAC CAGGAAGTAC AACGGCGGCT ACTGGCCGAC GGACGACCTG
ACCAAGCCGG GCTTCGACGT CCGCCTCTGG CAGGACGGCA AGGAGATCCC GCGCACCAGC
ACCGGCGGCA CCACCATCCT GCCGGTCTTC ACCCTGCCGG AAGGGCCCGG CAGCTACCGG
CTGACCGCCA AGAACGACCG GCACGACGCC GAATGGACCT TCACCGCCCC GGTCAAGGCC
GAGCGACTGC CCGGATCCTT CTGCTCCCTC GAAGCGCTGT ACGGCACCGC GGAGCCCTGC
AAGCCCGCCC CGGTCGTGTT CGTCAGCTAC GACCTGGGCG ACACCCTGGA CGCCGCCAAC
AGCGTCCGCG CGGGCCGTAC GCACACCTTC ACCGTCTCCC CCTACCACTC CCCCTCCGCC
TCCAAGATGC CTGACATCGC CGGCCTGAAG CTGTGGGCCA GCACCGACGA CGGCGCCACC
TACACCCCCG TCTCCGTCAA GCGCGACAAG GACGGGACCT ACACCGCCAC CACCCGCTAC
CCCGCCCTCC AGCAGACCAA GGGTGCCGTG ACCCTCAAGG TCGAGGCCTG GGACAAGGCG
GGCAACACCG TCAAGCAGAC CACCGTCCGC GCCTTCAACC TCCGCGGCCA CGCCGCCGGA
TAA
 
Protein sequence
MVLGTLVAPG TAAFAQKQEP APAAPATGPV TGPIALTGLG QGDRTVTLIT GDKVRLSDAG 
GGKYAVRPEA SVRPDGTSAQ LSTLATPKGV YVTPSDAVPA INAGRLDREL FNVTYLAENG
YTDDKTKQLP VIVQYPRQHT DKAVASAAKA IPASEPVLTL ESVNASALQV TKAEAGTFWN
AVRAVPDKKS ANGIAGPANV PDTLRGDIAK VWLDGKVKAD LAESVPMIGA PQAWAGGHDG
AGVKVAVLDT GVDAKHPDLA DRIVDSRSFI PGQEVQDGHG HGTHVASTIA GSGAAGGGKH
KGVAPGAQLI VGKVLANEGS GSDSQIIEGM EWAAASGAKV ISMSLGGGAS DGTDPMSQAV
NVLSASTGAL FVIAAGNAGA SGAETVATPG TADAALTVAA VDKSDAWATF SSQGPRVGGG
LKPDIAAPGV AIAAARAAGT TMGTPLDEHY TAANGTSMAT PHVAGAAAIM TQQHPDWTGP
QIKAALMSTA KDDALSVYKQ GAGRVDVARA YTQQVFAVTT GADFGAVESD AAPVTRELTY
TNLGGRPVTL TLTPGLRKSD GSAVEGGLSI AETTLTVPAG GTATTTATVD PKTLADLDNY
TGAVTATADG VQLRIPVGVV REVPKATLTI HTLGRDGKPR SPLAQDTIDV SGDKGVLGGV
ALTAEGTTVT RVPQGVISVT QVLSWVGDDE RGNLAFLSVP ELTVTGDTEI TLDARKLTEI
RFTTPQPAEP LSNVPYLAYQ RTVSNGTPYM GFTWPDRTWS RLWALPTEKV TKGAFRFHTR
FTLGRPEVEM SIRGRGGLTL HPVSALHGIN TYGVHEQYDG FPDFRPFTGT RDLEVVDVGE
GRPEDIAGRD LRGKLVLMEA PMSEGLSGPM CGVQIERIGP IRDAGAAAIA YFPQPGTGCA
IPLSITQIPF TGEAKPIGVP TVSLPSREGV GLRDRLAGGK PVTLRVTGTE ESPYTYTFAP
YEEGRVPRSL HYTFAERDLA RIDVDTHTVA PARYNDWRYA QKPDDVMPMS TSVSAWGGPR
LTLQERRDWV GPLDSGVLWS HGMEEMRSTP ADARVPQWAL EVFDKPVRTR QSWLTTPFTP
GVATGSDKVY KLAKPGANLG WFFRCMLCVQ GDRLWAEFEP SYGSPGTRKY NGGYWPTDDL
TKPGFDVRLW QDGKEIPRTS TGGTTILPVF TLPEGPGSYR LTAKNDRHDA EWTFTAPVKA
ERLPGSFCSL EALYGTAEPC KPAPVVFVSY DLGDTLDAAN SVRAGRTHTF TVSPYHSPSA
SKMPDIAGLK LWASTDDGAT YTPVSVKRDK DGTYTATTRY PALQQTKGAV TLKVEAWDKA
GNTVKQTTVR AFNLRGHAAG