Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_7831 |
Symbol | |
ID | 8671154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 8626853 |
End bp | 8630929 |
Gene Length | 4077 bp |
Protein Length | 1358 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | putative type II DNA modification enzyme |
Protein accession | YP_003343237 |
Protein GI | 271969041 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGGCA CCTCCTACCT GGCGGTCGAG GTCGCGGGCG GCCTGATCCC GCAGGACGTG CTGTCCCGGA TCGGCGCCGC CGACCGCGAG CTGCCCGGCA TCCGTCCCGA GGACTATCAC CTTGCCGCCT CCGAACGGCT GGGCGACGCG GCGAGCCGTA GATGGGACTA CCTGCTCGGC GCCTACCGCG CGTTCCGCGA GCGCGTGGAC GGGCTGCCGG ACGGCGACCC CGCCACGACG CCGACCCGAG AACGCTGGCT GCAGGTGCTC CTGAGCGAGC TGGGCTTCGG GCACGTGCCG TACATCCGTG GCGGATTGAA AGCGGGGGAC AAGGAGTTCC CCGTCTCCCA CCTGTGGCAC AACGTGCCCA TGCACCTGCT CGGCTGGAAC ACCAGGCTGG ACAGGGCCAC ACCCGGCCGC GCCGAGACCA AGCGGGCACC GCAGTCGATG CTGCAGGAGT TTCTCAACCT CTCCGACGAC CACCTGTGGG GCGTGCTCTC CAACGGCCGG CAGCTCCGCA TCCTGCGCGA CTCCACCGCC CTGGTCGGCT CGGCGTATGT CGAGTTCGAC CTCGAAGCCA TCTTCGACGG CGAGCTCTAC TCGGAGTTCG TGCTGCTCTA CACGCTGCTG CACGCCTCCC GCTTCGAGCT CATCTCCGGT GACGACTCGA CGCCCACGTG CGCCGACTGC TGGCTGGAGA AATGGCGCGC CTTCGCCGCC GAGACCGGCC TGCGGGCCCG CGACCAGCTC CGCGACGGCG TGCAGAAGGC CCTCGAAGCG CTCGGCACCG GCTTCCTCGT CGCGAACCCG GGGCTGCGCG ACCAGCTCGT CTCCGGCGCC CTCAAGCGGG AGGACTTCCA TCACGAGCTG CTCCGCCTGG CCTACCAGCT GATCTTCCTC TTCGTCGCCG AGGACCGCGG CGCCCTGCTC ACCCCGGACG CGGCGCAGGA CGCCAGGGAC CGCTACGCCA CCTACTTCTC CACCCGGCGC CTGCGTAGCC TTGCCGCCCG CCGCGCCGGA GACCGCCACA CCGACCTGTG GAAGACCACC GCCAGGGTCA TCAGGGCCCT CGGAGACGAC GACGGCCTGC CCATGCTCGG CCTGCCCGGC CTCGGCGGTC TGTTCTTCCG TACGGCGGGC ACCCTGGCGG AGCTGACCGA CACGCCCGTG CCCGACCAGT TCCTCGTCTG CGACCTGCCC AACGAGGCGC TGCTCACCGC CGTCCGCCAC ATGTCCACCG TGCGCGGCAA GGACGGCCGG CCGCGCGACG TGGACTTCCA GCACCTGGGA GCCGAGGAGC TGGGCAGCGT CTACGAGTCG CTGCTGGAGC TGGTCCCGCA CCCGGATCTC GCGGTGCCGA CGTTCGAGCT GAAGACCGTG GCGGGCAACG ACCGCAAGAC CACCGGGTCC TACTACACGC CCTCGTCACT GATCGAGTCG CTGCTCGACA CCGCGCTCGA CCCGGTCATC GACGAGCACG CGAAGTCCGG TGTCGCCGAT GACCTGTTGA AGATCACCGT CTGCGACCCG GCCTGCGGCT CCGGTCACTT CCTCGTCGCC GCCGCCCGGC GCATCGCCAA GCGGTATGCC GCGATGGTCA CCGGCGAGGC CGAGCCGGTG CCGTCGGCGG TCCAGAAGGC GATGCACAAG GTCGTCGGGA CGTGCATCTA CGGCGTGGAC ATCCAGCCGC TCGCCGCCGA GCTGGCCAAG TTCTCCCTCT GGATGGAGTC CCTGGAGCCG GGCAAGCCGC TCGCCTTCCT CGACGCCCAC ATCAAGGTGG GCAACTCTCT GCTGGGCACC ACGCCCAGGC TGCTGGACGA CGGCATCCCG GATGAGGCGT TCAAGGCCAT CGAGGGAGAC GACAGGAAGA TCGTCGCGTC GTTGAAGAAG GACAACGCCA GGCAGCGCCG CAACCAGGGC ATGCTCTTCA CGGAGACGGG CACTCGTCTC GGCAACAAGG AACTCGCCGG AGAGGTTCAG GCCCTGGCCG CCGTCCCGGT GTCGACCGTG GCCGACGTCC GGGAGCAGGA ACGGAGGTTC CGCGAGTTCG AGCGGTCCGA CGCGCTGACC CACGCCCGCC TGGTCGCCGA CGCCTGGTGC GCGGCATTCG TCTGGCGCAA GCACGCCGAC GCCCCGCCCG CCATCACGAC CGACACCGTA CGGCGGCTGC AGGAGGGAGG CGGCCTTCCC TCGGCTGTGG CCCTGGAGCT CGACCGGCTC GTGGAGCGAT ACCGCTTCTT CCACTGGCAT CTGGAGTTTC CCGACGTCTT CGACGCCGAA GCCGGGGCCG GCTCCGACGC CAACGCGGCG ACCGGCTGGA GGGGTGGTTT CACCTGCGTG CTCGGCAACC CACCGTGGGA GCGGGTCAAG CTTCAGGAGA AGGAGTTCTT CGCCGCCAGA CACGAGGGCA TCGCGAACGC CAAGAACGCC GCCGCCCGGA AGAAGGCCAT CGCGGCCCTG GCCACCAGCG CGGAGGAGAC CGACCGGTGG CTGTTCGCCG AGTTCGCGGG GGAACTTCGC AGCGCCGACG GCTGGACTCA CCTGCTCCGC GAGTCGGGAC GGTATCCGCT GACCGGGCGG GGTGACATCA ACACTTACGC CGTCTTCGCC GAGACGGGGC GGACGCTCCT CGCGCCGAGC GGCCGGGTCG GCATGGTCCT GCCCACCGGT ATCGCGACCG ACGCGACCAC GCAGTTCTTC TTCAAGGACA TGGTGACGAC GAAGACGCTC GCCTCGCTCT ACGACTTCGA GAACGAGGAC AAGCTCTTCG CGCACGTCCA TCATTCGTTC CGATTCTGCC TGTGGACCGG GTCGGGCCCT CAAGCGCCTC GGCGCCGGAT CGACCTGGCG TTCCGGCTCC GCCAGGTCGC GCATCTTGAC GAGCGCTCTT TCACCCTCAC ACCCGAGGAC ATCACATTGC TCAATCCGAA CACCGGAACT TGCCCGGTAT TCGACTTCAA GCGCAATGCT GAGATAACTC TGGGAATCTA TCGGCGAGTC CCGGTGCTCT GGCGAGAGAA TCCCAAGAGC AATCCCTGGG AGCTGTCGTT TATGGCCATG TTGCATATGG CGAACGACTC CGGCCTCTTC CGTCCCAGCG CTGAAGAACG GGAGACGTTG GAAGGGATGC TGGCGGCTGG GTGGCAGCTC GACGGAAACC ACCTCGTCAA GGGTGACGAA CGCCTCCTTC CTCTATTCGA GGCGAAGATG CTCCACCACT TCGATGACCG CTTCGGAACT TACGAGGGCC AGACCCAGGC GCAGGCCAAC GTGGGCACCC TGCCCCGACC TGCTCCTGTG CAGAAGGGCG ACCCGGCCTA CGTGGTGCAG CCCCGTTACT GGGTGGCCGA GAAGGAGGTT CAGGAGAGAC TCTGTCCAGG GAACGTCGTC GAGGGTGATC GTCGTTTCCG TAAGTGGGAC AAAGGGTGGC TACTCGGGTG GCGTGACATC TGCCGCAGTT CCGACGAACG TACTTTGATT GATTCTGTTT TTCCTCGTAC CGCTACCCCT GACACCACTC TGTTAATGCT TCCCAATCGA GGGTCTGCCG CTTGTCTTTC CGGCAATCTC TCCTCATTTA TTCTCGACTA TGTAGTTCGA CAGAAAAGCA GTGGTACGCA CCTGAAGTAT TTTACGGTGC GACAGTTGCC TATTCTTAGT CCAGAGGCAT ATGGCGAGGA TTGTCTCTGG GGTCCAGGTG AGCGGCTGCA CTTATGGGTT AGGGCGCGGG TGCTGGAGCT TTCCTATACC TCCTATGGCT TGGAGGCGTT CGCCCGCGAC AACGGGGACA AGGGTGCCCC TTACAGGTGG GACGAGGAGC GGCGGTTCTG GCTGCGGGCC GAGTTGGATG CGGCCTACTT CCACCTCTAT GGGGTGGTCC GCGAGGACGT GGACTACATC ATGGACACCT TCCGTGCCTT CCGGAACAAG TCTCCTGACC TGTTCGAGCG CACCAAGAAG GCGATCCTGG AGATCTACGA CGCCATGCAG GACGCCATCG ACGGGCGGAA GCCGTACCAG ACCCCGCTTG ACCCGCCCCC CGGCCATGGC CCTCGTCACC CGGAGCGGCG CGCGTGA
|
Protein sequence | MRGTSYLAVE VAGGLIPQDV LSRIGAADRE LPGIRPEDYH LAASERLGDA ASRRWDYLLG AYRAFRERVD GLPDGDPATT PTRERWLQVL LSELGFGHVP YIRGGLKAGD KEFPVSHLWH NVPMHLLGWN TRLDRATPGR AETKRAPQSM LQEFLNLSDD HLWGVLSNGR QLRILRDSTA LVGSAYVEFD LEAIFDGELY SEFVLLYTLL HASRFELISG DDSTPTCADC WLEKWRAFAA ETGLRARDQL RDGVQKALEA LGTGFLVANP GLRDQLVSGA LKREDFHHEL LRLAYQLIFL FVAEDRGALL TPDAAQDARD RYATYFSTRR LRSLAARRAG DRHTDLWKTT ARVIRALGDD DGLPMLGLPG LGGLFFRTAG TLAELTDTPV PDQFLVCDLP NEALLTAVRH MSTVRGKDGR PRDVDFQHLG AEELGSVYES LLELVPHPDL AVPTFELKTV AGNDRKTTGS YYTPSSLIES LLDTALDPVI DEHAKSGVAD DLLKITVCDP ACGSGHFLVA AARRIAKRYA AMVTGEAEPV PSAVQKAMHK VVGTCIYGVD IQPLAAELAK FSLWMESLEP GKPLAFLDAH IKVGNSLLGT TPRLLDDGIP DEAFKAIEGD DRKIVASLKK DNARQRRNQG MLFTETGTRL GNKELAGEVQ ALAAVPVSTV ADVREQERRF REFERSDALT HARLVADAWC AAFVWRKHAD APPAITTDTV RRLQEGGGLP SAVALELDRL VERYRFFHWH LEFPDVFDAE AGAGSDANAA TGWRGGFTCV LGNPPWERVK LQEKEFFAAR HEGIANAKNA AARKKAIAAL ATSAEETDRW LFAEFAGELR SADGWTHLLR ESGRYPLTGR GDINTYAVFA ETGRTLLAPS GRVGMVLPTG IATDATTQFF FKDMVTTKTL ASLYDFENED KLFAHVHHSF RFCLWTGSGP QAPRRRIDLA FRLRQVAHLD ERSFTLTPED ITLLNPNTGT CPVFDFKRNA EITLGIYRRV PVLWRENPKS NPWELSFMAM LHMANDSGLF RPSAEERETL EGMLAAGWQL DGNHLVKGDE RLLPLFEAKM LHHFDDRFGT YEGQTQAQAN VGTLPRPAPV QKGDPAYVVQ PRYWVAEKEV QERLCPGNVV EGDRRFRKWD KGWLLGWRDI CRSSDERTLI DSVFPRTATP DTTLLMLPNR GSAACLSGNL SSFILDYVVR QKSSGTHLKY FTVRQLPILS PEAYGEDCLW GPGERLHLWV RARVLELSYT SYGLEAFARD NGDKGAPYRW DEERRFWLRA ELDAAYFHLY GVVREDVDYI MDTFRAFRNK SPDLFERTKK AILEIYDAMQ DAIDGRKPYQ TPLDPPPGHG PRHPERRA
|
| |