Gene Sros_8756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8756 
Symbol 
ID8672094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9660987 
End bp9664073 
Gene Length3087 bp 
Protein Length1028 aa 
Translation table11 
GC content61% 
IMG OID 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_003344134 
Protein GI271969938 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.999301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGATG CACCCGTTCT GGGCCTGTAC GACTCTCTGC TCACCAGCCA GCTCGCTCGA 
CGCGTGGCTT TTTGGCGTAC GTCGGGTCAT CTGGTCGAAG TCTCCCAGGT GGACGGGGAA
GAGGTCGCTC ACCTGCTAGG CCGGTTCATC GGGGAGTCGG CAGCCCGGGC CATGGCTGCG
CTCAAGAACA CCGACGAGCA GGTAGAGCTG GCCAACCAGC TCTTGGCGCG GCTGGCCAAT
CCCGAACATA TCGAAGACGG CCCGACCAGG CTGTTGTCAG TCATCAAGGA CGCGCCTGGC
ACGCTTCCTC CCAAGCGGCC ATTGACCTCA CTATCAGAGG CAACGCTCCT TACTAATGCC
CGCGAGGATC CGAACCTGGC TCATGAGCTG GCAACTGAGT TGGCCAGCGC CGACCGGGTA
GACCTGTTGT GCGCGTTCGT CAAATGGTCC GGGCTCCGTG TCCTAGAGAA GCAGCTCGAT
GAGTTGCGCG ATCGCGGAGT CTCACTGCGC GTCATCACCA CCACCTACAT GGGCGCAACC
GAGCGGCGAG CGCTGGACCG GCTCGTGAAC GATTTCGGCG CCGAGGTCCG GATCAGCTAC
GAACAGAACT CAACTCGTCT GCACGCCAAA GCCTGGCTGT TACGCCGCCG TACCGGCTTC
GATACCGCAT ACATCGGTAG TTCAAATCTG TCGCGTGCCG CTCTGCTGGA CGGTTTGGAG
TGGAACGTCC GTTTGTCCTC AGTGACCACT CCTCGACTAC TGGACAAGTT CGAGGGCACC
TTCGACTCCT ATTGGAACAG GCAGCAGTTC GAGGCGTATG ACCCGGCGAC GGACAGCGAG
CGGCTCGATG AGGCTTTGTC TCGTTCGACA TCAGGTGAAC GGATCTTCGA CATCCCCGCG
CTGGTCCCCC ACCCCTTCCC ACACCAGCGG GAGATGCTGG GAGACCTCGA CGTTGAACGC
ACCGTACACG ACCGGCATCG CAACCTACTG GTGGCTGCCA CCGGAACTGG CAAGACGGTC
GTAGCCGCCT TTGACTACCG TAACCTGCAA GAGCGACTAG GCCGACAACC TTCTCTGCTG
TTTGTCGCGC ATCGCAAAGA GATCCTGCAG CAGGCCCTAC GCACCTACCG GCAGGTCCTC
GCGGCCCCCG ACTTCGGTGA ACTGCATGTC GGGGACGACC AGTCCCGTCA CTGGCGGCAT
GTGTTTGCCT CCGTCCAGTC CCTCAATTCA CGCGGCATCG ATATTTTTGC CGCGGATCAA
TTTAATGTCG TCGTTATAGA CGAGTTCCAC CATGCATCTG CCGTCACCTA TCGCCGGATC
ATCGACTACC TCAAGCCGAA AGAGCTTCTC GGTCTCACCG CCACCCCTGA ACGGGCCGAT
GGCACCTGGG TCCAAGACGA ATTTTTCGAC CGGCACATCA CCTCCGAGCT GCGCCTGTGG
GACGCGCTAG ACGCCGACCT GCTCTGCCCG TTCCATTATT TCGGGATCAA TGACGAGACC
GATCTCAGTC ACGTAGCCTG GTCACGCGGC GCTTACCTCG GTCGCGAACT CGATGAGGCC
CTTGCTGGCG ACAGCGACCG GGCACGCCTT GTCTTCAACG CACTGCTGGA CAAGGTCAGC
GACTTGCAAG CAGTCCGTGG CCTAGGTTTC TGCGTCTCGG TACGGCATGC GCATTTCATG
GCCGAGTTCT TCACCAAGGC CGGCCTGAAG TCGCTGGCCG TGGATGGATC GACCGACCCT
GCCGAACGCA GGGCGGCTCT GCTAGCCCTG CGCGACGGGA AGGTCACGTT CCTCTTCGCG
GTGGACCTGT TCAACGAGGG TCTGGATATC CCCGACGTGA ATACTCTGCT CCTGCTGCGT
CCGACTGAGA GCGCCACGGT GTTTCTCCAG CAGCTTGGAC GTGGGCTGCG TCGTACGCCT
AACAAGGATG TGCTGACCGT CCTCGATTTC GTTGGGCAGC ATCGCAAGGA GTACAGGTTC
GGCAACCGGT TCCATGCGCT AACAGGGTTC ACCCGAGGCC GCCTGAAGCA GGAAGTGGAT
AAGGACTTCC CGCTGCTACC ACCGGGGTGT CAGATTGTCC TGGACCGGGT TACCAAGGAT
CGCCTGATCG CCGAACTCCA AGTCCAGCTC GGCGCCACCG TCAGCACGCT CACCCAGGAG
ATCCGCTCCT GCGCGGAGAC GTCTCTTATC GACTACCTCG AAGCCTCCGG GCGCGATATC
CACGATGTCT ACCGCAACCG ACGCTACTGG ACCTCCCTGC TGCGCCGTGC TGGGATCATC
AAAAACGATG CTTCCCCCAT GGAGGAGATG CTCGGTCGTC GCGTGCGAGC CCTGCTCCAC
GTAGACGATC AGCAGCGAGC AGAGGCGTAC GTACGGCTGC TGCGGCCCGA CGGCCCGCTC
TACGCCCAGT GTTCCCCTCG CGACCAAGCC TTCGTTCGTA TGCTGTTCTT CTCCTTCTGG
CGTGACGGTG GCGGCTTCGC TACCTACGAC GAGGCTCTCG CCCAGCTACG TGCCGAGTCT
GCGCTACGCA AAGAGATTCG ACAGGTGATC ACTTACGGCG CAGAACGTCC CCGACACGTC
GCCAAATCTC TGCCTGAGCC ACTAAGCCAG GTGCCCCTTG CGGTCAACGC CCGGTATTCG
GCAGACGAGA TCCTCGCAGC TCTAGGCTGG GCCGCGCTGG GGGGCGCCAT GACATCGACT
ATGCGCGAAG GCGTCGCGTG GATCCCAGCC ACGCAGTGCG ATGCACTCTT CGTCACCTTG
CAAAAGAACG AGAAGGAGTT CTCGCCACAG ACCATGTACC GAGACTTCGC GCTCACCCCT
AACCTGTTCC ACTGGGAATC ACAACATCGC ACGAGCGCGC AGTCAACCAC TGGTCGCCGC
TACCAGTACC ACGAACGAGA CGGCAGCCAT GTGCTGCTGT TTACTCGAGA ACGTAAAGAA
GACGAGAACC GCCATCCTGA ACCATTCGTC TTCCACGGCA CGGCTAGGTA CGTGGAACAC
CGGGGAGAGA AACCCATGGC CGTCACGTGG CGGCTCGACG AAGAGATGCC CGCCGACCTG
TTCCGTCGCG CTGCTATCGC GGGGTGA
 
Protein sequence
MEDAPVLGLY DSLLTSQLAR RVAFWRTSGH LVEVSQVDGE EVAHLLGRFI GESAARAMAA 
LKNTDEQVEL ANQLLARLAN PEHIEDGPTR LLSVIKDAPG TLPPKRPLTS LSEATLLTNA
REDPNLAHEL ATELASADRV DLLCAFVKWS GLRVLEKQLD ELRDRGVSLR VITTTYMGAT
ERRALDRLVN DFGAEVRISY EQNSTRLHAK AWLLRRRTGF DTAYIGSSNL SRAALLDGLE
WNVRLSSVTT PRLLDKFEGT FDSYWNRQQF EAYDPATDSE RLDEALSRST SGERIFDIPA
LVPHPFPHQR EMLGDLDVER TVHDRHRNLL VAATGTGKTV VAAFDYRNLQ ERLGRQPSLL
FVAHRKEILQ QALRTYRQVL AAPDFGELHV GDDQSRHWRH VFASVQSLNS RGIDIFAADQ
FNVVVIDEFH HASAVTYRRI IDYLKPKELL GLTATPERAD GTWVQDEFFD RHITSELRLW
DALDADLLCP FHYFGINDET DLSHVAWSRG AYLGRELDEA LAGDSDRARL VFNALLDKVS
DLQAVRGLGF CVSVRHAHFM AEFFTKAGLK SLAVDGSTDP AERRAALLAL RDGKVTFLFA
VDLFNEGLDI PDVNTLLLLR PTESATVFLQ QLGRGLRRTP NKDVLTVLDF VGQHRKEYRF
GNRFHALTGF TRGRLKQEVD KDFPLLPPGC QIVLDRVTKD RLIAELQVQL GATVSTLTQE
IRSCAETSLI DYLEASGRDI HDVYRNRRYW TSLLRRAGII KNDASPMEEM LGRRVRALLH
VDDQQRAEAY VRLLRPDGPL YAQCSPRDQA FVRMLFFSFW RDGGGFATYD EALAQLRAES
ALRKEIRQVI TYGAERPRHV AKSLPEPLSQ VPLAVNARYS ADEILAALGW AALGGAMTST
MREGVAWIPA TQCDALFVTL QKNEKEFSPQ TMYRDFALTP NLFHWESQHR TSAQSTTGRR
YQYHERDGSH VLLFTRERKE DENRHPEPFV FHGTARYVEH RGEKPMAVTW RLDEEMPADL
FRRAAIAG