Gene Sros_5286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5286 
Symbol 
ID8668580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5800933 
End bp5803239 
Gene Length2307 bp 
Protein Length768 aa 
Translation table11 
GC content67% 
IMG OID 
ProductM6 family metalloprotease domain protein 
Protein accessionYP_003340797 
Protein GI271966601 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.826763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.474282 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTAAGC TCATCACCAT GCTCGCGGCC GGGCTGGTCG GGACGGCCCT GGGTGCCTCC 
GCGCTGCCCG GTCTCGCCTC GGCGGACCCG GGCCCCAAGC TTCCCGCCGC CAAGCCCGGT
GACGGCCGGC ATTCGGATGC GACCAGTCCG TTCGCGATCG AAGAGCAGTC TCTTCGGACC
AGAGCCATGA AGCAGGCGCT GTCCGCCCCC CAGGCACGCG CGCTCCCCTC AGGCAAGGTC
AAGGTCGGCG ACCGCTACGT CGAACTCGCG CTCGAACGCA AGGACAAGAT TTTCACCGTG
CTCGCCGAAT TCGGCGACAA AATCGACAAC ACGACCCTGC ACGGAGGAAC GATCCGCTAC
GGCGGCGTAC CCGGCCCGCT GCACAACCGG ATCCCCAAGC CCCAGCGGAG CTACGACAAC
CACACGCTCT GGCAGCCGGA CTTCAACCGG GCCTACTACC AGGACATCTA TTTCAACGGC
TCAAAGGGGG CCAACTCGCT CCGTAACTTC TACCGGCTCC AGTCCGCCGG CCGGTACGAC
TTCGACGGCT ACGTCTCCGA CTGGGTGAAG GTTCCCTACA ACGAGTCCCG TTACGGCACG
CCGCGCTGCG ACAGCGGCCT GGACTGCGAC CTCCCCCTGT TCGATTTCGT CAGGGACTCG
GCCAACGCCT GGTACGACGC CGAGCGCGCC AAGGGCCGCA GCGTCGAGGA CATCACGGCC
GAGCTCAAGT CCTACGACGT CTGGGACCGC TACGACCACG ACTTCGACGG CGACTTCGAC
GAGCCCGACG GCTACCTCGA CCGGTTCCAG GTGATCCACG CCGGCGTCGA CGAGACGTGG
GGCGGCGGCG CGCAGGGCGC AGACGCGCTG TGGGCGGTCA ACCACGACGC CTACTGGAAC
ACGCGCGGAA GCTCCGGCCC CGCAGGCAAC CTGCGAGGCG GCACCCAGAT CGGCGACACC
GGCGTCTGGG TCGGCAGGTT CCTGACCGCC GGCGAGAACA GCGGCGTCGG CCTGATCGCC
CACGAGTACG GCCACGACCT GGGCTTACCC GACCTGTACG ACGGTGGCGG GAGCAACAGC
GTCCAGTTCT GGTCGCTCAT GTCCAGTGCC TCGTACTTGA GCAGGAAGAA CGGGCAGAGC
GGCGAGTACC CCGGAGACCT GGACGCCTGG AGCAAACTGC GGCTCGGCTG GCTGGCCTAC
GACAGGGCCA AGGCCGCCAC GGCGTCCACG CACACCCTCG GCGTGAGCTC CTACAACACC
GCAGACCCGC AGGCCGTCAT CGTCGATCTG CCGCCGCGCC GGGTCGACAC CGAGTTGGTC
CAACCCGTCC AGGGCTCCCA CCAGTGGTGG AGCGGCAGGG GGGACTACCT CAACGAGACG
CTCACGCGCC AGATCGACCT CACCGGCGTG ACGTCGGCGG CCCTGAACGC CAGAGTCTGG
TATCAGATCG AGCAGGATTT CGACTATCTC TACGCCGAGG TCTCCGAGGA CGGCAAGGTC
TGGACACCGA TCGGCGGCAC CGTCGGCGGG CAGCCGATCC CGAGCGTCAA CGGCGTCCCC
GGTATCACCG GCCTCAGCGG CTGGACGGAC CTGAGCCTCC CACTGGACGC CTACGCGGGC
AAGAAGATCC AGTTCCGGTT CCGCTACTTC ACCGATACCA ACACCACCGA GAACGGCTTC
ATCGTCGACG CCATCACCGG CGCCGTCACC GACGACGCGG AGAACGGCGA CAACGGCTGG
ACCGCGGCGG GCTTCAGCCG CGTCGGCAAG ATCGCGACCA AGGAGCACCC CCGCTCCTAC
ATAGCCGAGA ACCGGCGCTA CACCGGCTAT GGCGCCTACC TCAGAACCGG TCCCTACAGC
GCCGGCTTCG CCAACAACCC CGCCCGGCGC GAGATCTATG AGCACTACCC CTACCGGGAG
GGCGTCCTCG TCTGGCTCTG GGACACCTAC TACACCGACA ACGCCACCCG TAACCACCCC
GGCGAGGGCA TGATCCTGCC GATCGACGCC CACCCGATCC CTCTGCTCTG GAAAGACAAC
AGCATGGTCA ACGACCGGAT CCAGGGCTTC GACGCCCCCT TCGGCCTGTC TCCCACCGGC
CGTTTCGCAC TGCACAGGGA CAGCGCCAAG ACGGTCTTCC CCTCGCTCCC CGCCACCCCG
GCCTTCAACG ACCGCAGCGG CGTTTACTGG TACAGCACCA ACCCCTTGCG CGGCGTCCGG
CCGCCCGACA GCAACACCGA GATCAAGGTG ACCCGGGAAG CCTCCGGGGG CCTTCGCACC
ACCATCACGG TGGGTCCCGC CTCCTGA
 
Protein sequence
MRKLITMLAA GLVGTALGAS ALPGLASADP GPKLPAAKPG DGRHSDATSP FAIEEQSLRT 
RAMKQALSAP QARALPSGKV KVGDRYVELA LERKDKIFTV LAEFGDKIDN TTLHGGTIRY
GGVPGPLHNR IPKPQRSYDN HTLWQPDFNR AYYQDIYFNG SKGANSLRNF YRLQSAGRYD
FDGYVSDWVK VPYNESRYGT PRCDSGLDCD LPLFDFVRDS ANAWYDAERA KGRSVEDITA
ELKSYDVWDR YDHDFDGDFD EPDGYLDRFQ VIHAGVDETW GGGAQGADAL WAVNHDAYWN
TRGSSGPAGN LRGGTQIGDT GVWVGRFLTA GENSGVGLIA HEYGHDLGLP DLYDGGGSNS
VQFWSLMSSA SYLSRKNGQS GEYPGDLDAW SKLRLGWLAY DRAKAATAST HTLGVSSYNT
ADPQAVIVDL PPRRVDTELV QPVQGSHQWW SGRGDYLNET LTRQIDLTGV TSAALNARVW
YQIEQDFDYL YAEVSEDGKV WTPIGGTVGG QPIPSVNGVP GITGLSGWTD LSLPLDAYAG
KKIQFRFRYF TDTNTTENGF IVDAITGAVT DDAENGDNGW TAAGFSRVGK IATKEHPRSY
IAENRRYTGY GAYLRTGPYS AGFANNPARR EIYEHYPYRE GVLVWLWDTY YTDNATRNHP
GEGMILPIDA HPIPLLWKDN SMVNDRIQGF DAPFGLSPTG RFALHRDSAK TVFPSLPATP
AFNDRSGVYW YSTNPLRGVR PPDSNTEIKV TREASGGLRT TITVGPAS