Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_7554 |
Symbol | |
ID | 8670875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 8348071 |
End bp | 8351289 |
Gene Length | 3219 bp |
Protein Length | 1072 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_003342976 |
Protein GI | 271968780 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACCG ACAAGCACGA TGGCAAGATG ACCGAGTCCG TCTGGGAGCG GCTCGCCCTG GAGGAGCTCG CCGAGCTTGC CTGGGAGCCC AAGGCCGGAA AGGACGTCGC CCCCGGCTCC GGGAGCCGTA GGGCATGGGA CGACCTGATC CTCTACGACG AGCTGCGTGC GGCGATTGGG CGGTTGAACC CCGCCCTGCC CCCCACCGCC GTGGACGAGG CTCTCAGCAT CGCCACCACT CCCAAGTCCC TCGACGCCCT CCCCGAGAAC CGGCTCGCCC ACGACTATCT GACCTCCGGC ATCCGGGCCG TCACCTACAC CGACGACTTC GGCGCTGAGC ACACCCCCAC GATCCGGCTC GTCGACCTGC GCAACCCGGA CGCGAATACC TACCACGTCG TCAACCAGGT CACCGTCATC GACAACGACC GCAAGCGCCG CTTCGACGCC GTCCTCTACG TCAACGGCCT GCCGCTCGCC GTCATCGAGC TGAAGAGCGC CGCCGACGAG CACGCCACGC TCAAGGACGC GCACGCCCAG CTGAGCACCT ACCTCGACGA GTTTCCCCTC GCGTTCCGCT ACAACGTGCT CTGCCTGATC TCCGACGGGA TCACCGCCAA GTACGGCACG CCGTTCACCG CCTATGAGCA CTTCGCGCCC TGGAACGTGG ACGAAGACGG TGACCCGGTG GACACCAACG CCTCCGACCA CGAAGGACCG GAGGCGCTGT TCCTCGCCCT GCACGGCCTG TTCAACCAGC CGCGATTCCT AACCTTCACT CGCGACTTCG TCAACTTCAC CCCCCAGGGC AAGCGCATCG CGAAGCCGCA CCAGTTTCAT GCCGTTCAGA AGGCCGTCGA GGCGATCGTC GAGGCGTCCC GCAGCAACGG GCAGGCCGGG GTGATCTGGC ACACGCAGGG CTCCGGCAAG TCCGAGGAGA TGGTGTGCAC CAGCGCGCTG GCCTCCAGAC ACCCCGCGCT CAACAACCCC ACCATCGTCG TCATCACCGA CCGCACCGAT CTCGACGACC AGCTTTTCGG CACCTTCCAG GACAGTCAGA CCCTTCTCGG TCAGACCCCG ATCCAGGTGC AGACGCGTGA GGAACTCCGC GCGGAGCTGA CCAACCGCCG GACTGGTGGC ATCGTCTTCA CCACGCTGCA GAAGTTCGGC CGCACGAAGG AGGAGAAGGA CCTCGGCGTG GACCATCCGC TTCTGTCCGA CCGGCGCAAC ATCCTGGTCA TCGTCGACGA GGCGCACCGC AGCCACTACG ACAACCTCAA CGGCTACGCC CGGCGCCTGC GTGAGGCTCT GCCGTGCGCC ACGCTCCTCG CCTTCACCGG CACACCCATT TCCAAAGCCG AGGCCAACAC CCGCGAGGTC TTCGGCGGGG GCAAGGACTA CATCGACGTC TATGACCTGA AGCGGGCTGT AGATGACGGC GCGACGGTCC GGGTCTACCA CGAGCCCCGG CTCATTCCGG TCTCGCTGCC CCCGGATGTG GATCCGGAGA CCATCGACCA GCAGGCCGAC GACCTGACCG CAGGCATGGA CGACGCCGAG CGCCGCCGGG CTCTGCACTA CGCCACGCAG ATGACCAACG TGTACGGTGC GCCCGACCGG ATAAAGACCC TCGCTGAGGA TTTGGTGGCG CATTGGGAGA AGCGTTCGGA GTTGATGCGC CCGCAGATCG GCGGGCCGGG CAAGGCGATG ATCGTCTGTG CTACTCGGGA CGTCTGCGTC AGGGTCTACG ACGCCCTCGA AGAGCTGCAA CCCGAATGGG CGGACGACGA CCCGACCAAG GGCAAAATGA AGATCGTCTT TCATAGCGTT CCCAGCGACG AGAAGCACCT GAAAGCCCAC GCCCTGCGCC CCTCCCAGCA CAGGATCGTC CAGGCCCGGG CGAAGGATCC CGACGACGAG CTGGAGCTGC TCATCGTCCA CTCCATGCTG CTTACCGGCT ATGACGCCCC GCCGATCCAC ACCATCTACA TGGACCGCCC CATGCAGGGC GCGAACCTGA TGCAGGCCCT GGCCCGTGTC AACCGCCGTT TCCGCGGCAA GCAGGACGGC TTGCTCGTCG GTTACGCGCC ACTCACCGAG AGTCTCAAGA AGGCCCTCGC CGAATACACC CCGAGTGACC GGCAGGACCA GACATTGGGC CGGGACGTCG AACGGGCCAT CACCGAGGTT CGTAACGAGT ATTCGACCAT CTGCGGCCTG CTCGCCGGCA TCGACTGGCG GGCGCTACTC GTCGACACCT CCACGTCTCA GCCGCGGACG CGCGCGTGCC GCTTGACGGC CAATCATCTG CGTGCTCCGA GCACCCCCGG CAACCAGAGC GAGCCGGGAG CCAAGACGCT CGCCGTACGC TTCCGGGAGA GCGCCACGCG GCTGGAGCGC TTCTACGGGC TCTGTGCGAT GAGCAGGGAG ATCTCAGAGC GCTTCGAAGA CCTTAAAGCA TGGCGCCGGG ATATCGCCTT CTTCAGCGAG GTGCGGGCCT GGATGGTGAA GCTGGACGCC GCGGACCGCG AGGCCAGCGG CAAGCCGGTG GCGGCCGAGG TCAGCCTCTA CCTGTCCCAG CTGGCGGCCT CCGTCGTCGA CGCCGATGAG ATCACCGACC TGTACGCCGA GGCCGGCATC GGGCAGCTCG ACATCACCCA GCTCAGCGAC CAGCACCTGC GCAAGATCCA GGAGTCCGAG ACACCTCACC TGGTCGCCGA GGCGCTGCGC CGGTTGATCG AGCAGAAGAT GCGTGAGGTG ACCCGGCACA ACATCGTCCG TCAGGAGAGC TTCACCGAGC GCCTCGAAGA CCTGATGACC CGCTACATGC GGCAGCAGCT CACCAGCGCC GAAATGATCG CCGAGCTGGT CGCCATGGCA AAGGAGGTCT CCGCGGACGC CCGGCGCGGT GAGCGATTCG ACCCTCCGCT CAACCATGCC GAACTCGCCT TCTACGATGC CGTGGCCAAC CACGGCCTCG CGAAAGCTCT CATGGGGGAC GACACCCTCG CGGAGATCGC CCGCGCACTG GTCACCGACA TCCGCAAGAA CCTCAGCGTC GACTGGCTCT CCCGCGAGCC GGTGCGTGCC AAGCTGCGCA GCCGCGTCCG GCGCCTGCTG GCGAAGTTCG ACTACCCGCC CGAGGAGGAA CGCGAGGCCG TGGACCTCGT GATCAAGCAG ATGGAAGCCT TTGCCAACGA GTGGTCTCCG AAGGCGTAG
|
Protein sequence | MTTDKHDGKM TESVWERLAL EELAELAWEP KAGKDVAPGS GSRRAWDDLI LYDELRAAIG RLNPALPPTA VDEALSIATT PKSLDALPEN RLAHDYLTSG IRAVTYTDDF GAEHTPTIRL VDLRNPDANT YHVVNQVTVI DNDRKRRFDA VLYVNGLPLA VIELKSAADE HATLKDAHAQ LSTYLDEFPL AFRYNVLCLI SDGITAKYGT PFTAYEHFAP WNVDEDGDPV DTNASDHEGP EALFLALHGL FNQPRFLTFT RDFVNFTPQG KRIAKPHQFH AVQKAVEAIV EASRSNGQAG VIWHTQGSGK SEEMVCTSAL ASRHPALNNP TIVVITDRTD LDDQLFGTFQ DSQTLLGQTP IQVQTREELR AELTNRRTGG IVFTTLQKFG RTKEEKDLGV DHPLLSDRRN ILVIVDEAHR SHYDNLNGYA RRLREALPCA TLLAFTGTPI SKAEANTREV FGGGKDYIDV YDLKRAVDDG ATVRVYHEPR LIPVSLPPDV DPETIDQQAD DLTAGMDDAE RRRALHYATQ MTNVYGAPDR IKTLAEDLVA HWEKRSELMR PQIGGPGKAM IVCATRDVCV RVYDALEELQ PEWADDDPTK GKMKIVFHSV PSDEKHLKAH ALRPSQHRIV QARAKDPDDE LELLIVHSML LTGYDAPPIH TIYMDRPMQG ANLMQALARV NRRFRGKQDG LLVGYAPLTE SLKKALAEYT PSDRQDQTLG RDVERAITEV RNEYSTICGL LAGIDWRALL VDTSTSQPRT RACRLTANHL RAPSTPGNQS EPGAKTLAVR FRESATRLER FYGLCAMSRE ISERFEDLKA WRRDIAFFSE VRAWMVKLDA ADREASGKPV AAEVSLYLSQ LAASVVDADE ITDLYAEAGI GQLDITQLSD QHLRKIQESE TPHLVAEALR RLIEQKMREV TRHNIVRQES FTERLEDLMT RYMRQQLTSA EMIAELVAMA KEVSADARRG ERFDPPLNHA ELAFYDAVAN HGLAKALMGD DTLAEIARAL VTDIRKNLSV DWLSREPVRA KLRSRVRRLL AKFDYPPEEE REAVDLVIKQ MEAFANEWSP KA
|
| |