Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_8756 |
Symbol | |
ID | 8672094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 9660987 |
End bp | 9664073 |
Gene Length | 3087 bp |
Protein Length | 1028 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_003344134 |
Protein GI | 271969938 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.999301 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGATG CACCCGTTCT GGGCCTGTAC GACTCTCTGC TCACCAGCCA GCTCGCTCGA CGCGTGGCTT TTTGGCGTAC GTCGGGTCAT CTGGTCGAAG TCTCCCAGGT GGACGGGGAA GAGGTCGCTC ACCTGCTAGG CCGGTTCATC GGGGAGTCGG CAGCCCGGGC CATGGCTGCG CTCAAGAACA CCGACGAGCA GGTAGAGCTG GCCAACCAGC TCTTGGCGCG GCTGGCCAAT CCCGAACATA TCGAAGACGG CCCGACCAGG CTGTTGTCAG TCATCAAGGA CGCGCCTGGC ACGCTTCCTC CCAAGCGGCC ATTGACCTCA CTATCAGAGG CAACGCTCCT TACTAATGCC CGCGAGGATC CGAACCTGGC TCATGAGCTG GCAACTGAGT TGGCCAGCGC CGACCGGGTA GACCTGTTGT GCGCGTTCGT CAAATGGTCC GGGCTCCGTG TCCTAGAGAA GCAGCTCGAT GAGTTGCGCG ATCGCGGAGT CTCACTGCGC GTCATCACCA CCACCTACAT GGGCGCAACC GAGCGGCGAG CGCTGGACCG GCTCGTGAAC GATTTCGGCG CCGAGGTCCG GATCAGCTAC GAACAGAACT CAACTCGTCT GCACGCCAAA GCCTGGCTGT TACGCCGCCG TACCGGCTTC GATACCGCAT ACATCGGTAG TTCAAATCTG TCGCGTGCCG CTCTGCTGGA CGGTTTGGAG TGGAACGTCC GTTTGTCCTC AGTGACCACT CCTCGACTAC TGGACAAGTT CGAGGGCACC TTCGACTCCT ATTGGAACAG GCAGCAGTTC GAGGCGTATG ACCCGGCGAC GGACAGCGAG CGGCTCGATG AGGCTTTGTC TCGTTCGACA TCAGGTGAAC GGATCTTCGA CATCCCCGCG CTGGTCCCCC ACCCCTTCCC ACACCAGCGG GAGATGCTGG GAGACCTCGA CGTTGAACGC ACCGTACACG ACCGGCATCG CAACCTACTG GTGGCTGCCA CCGGAACTGG CAAGACGGTC GTAGCCGCCT TTGACTACCG TAACCTGCAA GAGCGACTAG GCCGACAACC TTCTCTGCTG TTTGTCGCGC ATCGCAAAGA GATCCTGCAG CAGGCCCTAC GCACCTACCG GCAGGTCCTC GCGGCCCCCG ACTTCGGTGA ACTGCATGTC GGGGACGACC AGTCCCGTCA CTGGCGGCAT GTGTTTGCCT CCGTCCAGTC CCTCAATTCA CGCGGCATCG ATATTTTTGC CGCGGATCAA TTTAATGTCG TCGTTATAGA CGAGTTCCAC CATGCATCTG CCGTCACCTA TCGCCGGATC ATCGACTACC TCAAGCCGAA AGAGCTTCTC GGTCTCACCG CCACCCCTGA ACGGGCCGAT GGCACCTGGG TCCAAGACGA ATTTTTCGAC CGGCACATCA CCTCCGAGCT GCGCCTGTGG GACGCGCTAG ACGCCGACCT GCTCTGCCCG TTCCATTATT TCGGGATCAA TGACGAGACC GATCTCAGTC ACGTAGCCTG GTCACGCGGC GCTTACCTCG GTCGCGAACT CGATGAGGCC CTTGCTGGCG ACAGCGACCG GGCACGCCTT GTCTTCAACG CACTGCTGGA CAAGGTCAGC GACTTGCAAG CAGTCCGTGG CCTAGGTTTC TGCGTCTCGG TACGGCATGC GCATTTCATG GCCGAGTTCT TCACCAAGGC CGGCCTGAAG TCGCTGGCCG TGGATGGATC GACCGACCCT GCCGAACGCA GGGCGGCTCT GCTAGCCCTG CGCGACGGGA AGGTCACGTT CCTCTTCGCG GTGGACCTGT TCAACGAGGG TCTGGATATC CCCGACGTGA ATACTCTGCT CCTGCTGCGT CCGACTGAGA GCGCCACGGT GTTTCTCCAG CAGCTTGGAC GTGGGCTGCG TCGTACGCCT AACAAGGATG TGCTGACCGT CCTCGATTTC GTTGGGCAGC ATCGCAAGGA GTACAGGTTC GGCAACCGGT TCCATGCGCT AACAGGGTTC ACCCGAGGCC GCCTGAAGCA GGAAGTGGAT AAGGACTTCC CGCTGCTACC ACCGGGGTGT CAGATTGTCC TGGACCGGGT TACCAAGGAT CGCCTGATCG CCGAACTCCA AGTCCAGCTC GGCGCCACCG TCAGCACGCT CACCCAGGAG ATCCGCTCCT GCGCGGAGAC GTCTCTTATC GACTACCTCG AAGCCTCCGG GCGCGATATC CACGATGTCT ACCGCAACCG ACGCTACTGG ACCTCCCTGC TGCGCCGTGC TGGGATCATC AAAAACGATG CTTCCCCCAT GGAGGAGATG CTCGGTCGTC GCGTGCGAGC CCTGCTCCAC GTAGACGATC AGCAGCGAGC AGAGGCGTAC GTACGGCTGC TGCGGCCCGA CGGCCCGCTC TACGCCCAGT GTTCCCCTCG CGACCAAGCC TTCGTTCGTA TGCTGTTCTT CTCCTTCTGG CGTGACGGTG GCGGCTTCGC TACCTACGAC GAGGCTCTCG CCCAGCTACG TGCCGAGTCT GCGCTACGCA AAGAGATTCG ACAGGTGATC ACTTACGGCG CAGAACGTCC CCGACACGTC GCCAAATCTC TGCCTGAGCC ACTAAGCCAG GTGCCCCTTG CGGTCAACGC CCGGTATTCG GCAGACGAGA TCCTCGCAGC TCTAGGCTGG GCCGCGCTGG GGGGCGCCAT GACATCGACT ATGCGCGAAG GCGTCGCGTG GATCCCAGCC ACGCAGTGCG ATGCACTCTT CGTCACCTTG CAAAAGAACG AGAAGGAGTT CTCGCCACAG ACCATGTACC GAGACTTCGC GCTCACCCCT AACCTGTTCC ACTGGGAATC ACAACATCGC ACGAGCGCGC AGTCAACCAC TGGTCGCCGC TACCAGTACC ACGAACGAGA CGGCAGCCAT GTGCTGCTGT TTACTCGAGA ACGTAAAGAA GACGAGAACC GCCATCCTGA ACCATTCGTC TTCCACGGCA CGGCTAGGTA CGTGGAACAC CGGGGAGAGA AACCCATGGC CGTCACGTGG CGGCTCGACG AAGAGATGCC CGCCGACCTG TTCCGTCGCG CTGCTATCGC GGGGTGA
|
Protein sequence | MEDAPVLGLY DSLLTSQLAR RVAFWRTSGH LVEVSQVDGE EVAHLLGRFI GESAARAMAA LKNTDEQVEL ANQLLARLAN PEHIEDGPTR LLSVIKDAPG TLPPKRPLTS LSEATLLTNA REDPNLAHEL ATELASADRV DLLCAFVKWS GLRVLEKQLD ELRDRGVSLR VITTTYMGAT ERRALDRLVN DFGAEVRISY EQNSTRLHAK AWLLRRRTGF DTAYIGSSNL SRAALLDGLE WNVRLSSVTT PRLLDKFEGT FDSYWNRQQF EAYDPATDSE RLDEALSRST SGERIFDIPA LVPHPFPHQR EMLGDLDVER TVHDRHRNLL VAATGTGKTV VAAFDYRNLQ ERLGRQPSLL FVAHRKEILQ QALRTYRQVL AAPDFGELHV GDDQSRHWRH VFASVQSLNS RGIDIFAADQ FNVVVIDEFH HASAVTYRRI IDYLKPKELL GLTATPERAD GTWVQDEFFD RHITSELRLW DALDADLLCP FHYFGINDET DLSHVAWSRG AYLGRELDEA LAGDSDRARL VFNALLDKVS DLQAVRGLGF CVSVRHAHFM AEFFTKAGLK SLAVDGSTDP AERRAALLAL RDGKVTFLFA VDLFNEGLDI PDVNTLLLLR PTESATVFLQ QLGRGLRRTP NKDVLTVLDF VGQHRKEYRF GNRFHALTGF TRGRLKQEVD KDFPLLPPGC QIVLDRVTKD RLIAELQVQL GATVSTLTQE IRSCAETSLI DYLEASGRDI HDVYRNRRYW TSLLRRAGII KNDASPMEEM LGRRVRALLH VDDQQRAEAY VRLLRPDGPL YAQCSPRDQA FVRMLFFSFW RDGGGFATYD EALAQLRAES ALRKEIRQVI TYGAERPRHV AKSLPEPLSQ VPLAVNARYS ADEILAALGW AALGGAMTST MREGVAWIPA TQCDALFVTL QKNEKEFSPQ TMYRDFALTP NLFHWESQHR TSAQSTTGRR YQYHERDGSH VLLFTRERKE DENRHPEPFV FHGTARYVEH RGEKPMAVTW RLDEEMPADL FRRAAIAG
|
| |