Gene Sros_7554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7554 
Symbol 
ID8670875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8348071 
End bp8351289 
Gene Length3219 bp 
Protein Length1072 aa 
Translation table11 
GC content66% 
IMG OID 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_003342976 
Protein GI271968780 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCG ACAAGCACGA TGGCAAGATG ACCGAGTCCG TCTGGGAGCG GCTCGCCCTG 
GAGGAGCTCG CCGAGCTTGC CTGGGAGCCC AAGGCCGGAA AGGACGTCGC CCCCGGCTCC
GGGAGCCGTA GGGCATGGGA CGACCTGATC CTCTACGACG AGCTGCGTGC GGCGATTGGG
CGGTTGAACC CCGCCCTGCC CCCCACCGCC GTGGACGAGG CTCTCAGCAT CGCCACCACT
CCCAAGTCCC TCGACGCCCT CCCCGAGAAC CGGCTCGCCC ACGACTATCT GACCTCCGGC
ATCCGGGCCG TCACCTACAC CGACGACTTC GGCGCTGAGC ACACCCCCAC GATCCGGCTC
GTCGACCTGC GCAACCCGGA CGCGAATACC TACCACGTCG TCAACCAGGT CACCGTCATC
GACAACGACC GCAAGCGCCG CTTCGACGCC GTCCTCTACG TCAACGGCCT GCCGCTCGCC
GTCATCGAGC TGAAGAGCGC CGCCGACGAG CACGCCACGC TCAAGGACGC GCACGCCCAG
CTGAGCACCT ACCTCGACGA GTTTCCCCTC GCGTTCCGCT ACAACGTGCT CTGCCTGATC
TCCGACGGGA TCACCGCCAA GTACGGCACG CCGTTCACCG CCTATGAGCA CTTCGCGCCC
TGGAACGTGG ACGAAGACGG TGACCCGGTG GACACCAACG CCTCCGACCA CGAAGGACCG
GAGGCGCTGT TCCTCGCCCT GCACGGCCTG TTCAACCAGC CGCGATTCCT AACCTTCACT
CGCGACTTCG TCAACTTCAC CCCCCAGGGC AAGCGCATCG CGAAGCCGCA CCAGTTTCAT
GCCGTTCAGA AGGCCGTCGA GGCGATCGTC GAGGCGTCCC GCAGCAACGG GCAGGCCGGG
GTGATCTGGC ACACGCAGGG CTCCGGCAAG TCCGAGGAGA TGGTGTGCAC CAGCGCGCTG
GCCTCCAGAC ACCCCGCGCT CAACAACCCC ACCATCGTCG TCATCACCGA CCGCACCGAT
CTCGACGACC AGCTTTTCGG CACCTTCCAG GACAGTCAGA CCCTTCTCGG TCAGACCCCG
ATCCAGGTGC AGACGCGTGA GGAACTCCGC GCGGAGCTGA CCAACCGCCG GACTGGTGGC
ATCGTCTTCA CCACGCTGCA GAAGTTCGGC CGCACGAAGG AGGAGAAGGA CCTCGGCGTG
GACCATCCGC TTCTGTCCGA CCGGCGCAAC ATCCTGGTCA TCGTCGACGA GGCGCACCGC
AGCCACTACG ACAACCTCAA CGGCTACGCC CGGCGCCTGC GTGAGGCTCT GCCGTGCGCC
ACGCTCCTCG CCTTCACCGG CACACCCATT TCCAAAGCCG AGGCCAACAC CCGCGAGGTC
TTCGGCGGGG GCAAGGACTA CATCGACGTC TATGACCTGA AGCGGGCTGT AGATGACGGC
GCGACGGTCC GGGTCTACCA CGAGCCCCGG CTCATTCCGG TCTCGCTGCC CCCGGATGTG
GATCCGGAGA CCATCGACCA GCAGGCCGAC GACCTGACCG CAGGCATGGA CGACGCCGAG
CGCCGCCGGG CTCTGCACTA CGCCACGCAG ATGACCAACG TGTACGGTGC GCCCGACCGG
ATAAAGACCC TCGCTGAGGA TTTGGTGGCG CATTGGGAGA AGCGTTCGGA GTTGATGCGC
CCGCAGATCG GCGGGCCGGG CAAGGCGATG ATCGTCTGTG CTACTCGGGA CGTCTGCGTC
AGGGTCTACG ACGCCCTCGA AGAGCTGCAA CCCGAATGGG CGGACGACGA CCCGACCAAG
GGCAAAATGA AGATCGTCTT TCATAGCGTT CCCAGCGACG AGAAGCACCT GAAAGCCCAC
GCCCTGCGCC CCTCCCAGCA CAGGATCGTC CAGGCCCGGG CGAAGGATCC CGACGACGAG
CTGGAGCTGC TCATCGTCCA CTCCATGCTG CTTACCGGCT ATGACGCCCC GCCGATCCAC
ACCATCTACA TGGACCGCCC CATGCAGGGC GCGAACCTGA TGCAGGCCCT GGCCCGTGTC
AACCGCCGTT TCCGCGGCAA GCAGGACGGC TTGCTCGTCG GTTACGCGCC ACTCACCGAG
AGTCTCAAGA AGGCCCTCGC CGAATACACC CCGAGTGACC GGCAGGACCA GACATTGGGC
CGGGACGTCG AACGGGCCAT CACCGAGGTT CGTAACGAGT ATTCGACCAT CTGCGGCCTG
CTCGCCGGCA TCGACTGGCG GGCGCTACTC GTCGACACCT CCACGTCTCA GCCGCGGACG
CGCGCGTGCC GCTTGACGGC CAATCATCTG CGTGCTCCGA GCACCCCCGG CAACCAGAGC
GAGCCGGGAG CCAAGACGCT CGCCGTACGC TTCCGGGAGA GCGCCACGCG GCTGGAGCGC
TTCTACGGGC TCTGTGCGAT GAGCAGGGAG ATCTCAGAGC GCTTCGAAGA CCTTAAAGCA
TGGCGCCGGG ATATCGCCTT CTTCAGCGAG GTGCGGGCCT GGATGGTGAA GCTGGACGCC
GCGGACCGCG AGGCCAGCGG CAAGCCGGTG GCGGCCGAGG TCAGCCTCTA CCTGTCCCAG
CTGGCGGCCT CCGTCGTCGA CGCCGATGAG ATCACCGACC TGTACGCCGA GGCCGGCATC
GGGCAGCTCG ACATCACCCA GCTCAGCGAC CAGCACCTGC GCAAGATCCA GGAGTCCGAG
ACACCTCACC TGGTCGCCGA GGCGCTGCGC CGGTTGATCG AGCAGAAGAT GCGTGAGGTG
ACCCGGCACA ACATCGTCCG TCAGGAGAGC TTCACCGAGC GCCTCGAAGA CCTGATGACC
CGCTACATGC GGCAGCAGCT CACCAGCGCC GAAATGATCG CCGAGCTGGT CGCCATGGCA
AAGGAGGTCT CCGCGGACGC CCGGCGCGGT GAGCGATTCG ACCCTCCGCT CAACCATGCC
GAACTCGCCT TCTACGATGC CGTGGCCAAC CACGGCCTCG CGAAAGCTCT CATGGGGGAC
GACACCCTCG CGGAGATCGC CCGCGCACTG GTCACCGACA TCCGCAAGAA CCTCAGCGTC
GACTGGCTCT CCCGCGAGCC GGTGCGTGCC AAGCTGCGCA GCCGCGTCCG GCGCCTGCTG
GCGAAGTTCG ACTACCCGCC CGAGGAGGAA CGCGAGGCCG TGGACCTCGT GATCAAGCAG
ATGGAAGCCT TTGCCAACGA GTGGTCTCCG AAGGCGTAG
 
Protein sequence
MTTDKHDGKM TESVWERLAL EELAELAWEP KAGKDVAPGS GSRRAWDDLI LYDELRAAIG 
RLNPALPPTA VDEALSIATT PKSLDALPEN RLAHDYLTSG IRAVTYTDDF GAEHTPTIRL
VDLRNPDANT YHVVNQVTVI DNDRKRRFDA VLYVNGLPLA VIELKSAADE HATLKDAHAQ
LSTYLDEFPL AFRYNVLCLI SDGITAKYGT PFTAYEHFAP WNVDEDGDPV DTNASDHEGP
EALFLALHGL FNQPRFLTFT RDFVNFTPQG KRIAKPHQFH AVQKAVEAIV EASRSNGQAG
VIWHTQGSGK SEEMVCTSAL ASRHPALNNP TIVVITDRTD LDDQLFGTFQ DSQTLLGQTP
IQVQTREELR AELTNRRTGG IVFTTLQKFG RTKEEKDLGV DHPLLSDRRN ILVIVDEAHR
SHYDNLNGYA RRLREALPCA TLLAFTGTPI SKAEANTREV FGGGKDYIDV YDLKRAVDDG
ATVRVYHEPR LIPVSLPPDV DPETIDQQAD DLTAGMDDAE RRRALHYATQ MTNVYGAPDR
IKTLAEDLVA HWEKRSELMR PQIGGPGKAM IVCATRDVCV RVYDALEELQ PEWADDDPTK
GKMKIVFHSV PSDEKHLKAH ALRPSQHRIV QARAKDPDDE LELLIVHSML LTGYDAPPIH
TIYMDRPMQG ANLMQALARV NRRFRGKQDG LLVGYAPLTE SLKKALAEYT PSDRQDQTLG
RDVERAITEV RNEYSTICGL LAGIDWRALL VDTSTSQPRT RACRLTANHL RAPSTPGNQS
EPGAKTLAVR FRESATRLER FYGLCAMSRE ISERFEDLKA WRRDIAFFSE VRAWMVKLDA
ADREASGKPV AAEVSLYLSQ LAASVVDADE ITDLYAEAGI GQLDITQLSD QHLRKIQESE
TPHLVAEALR RLIEQKMREV TRHNIVRQES FTERLEDLMT RYMRQQLTSA EMIAELVAMA
KEVSADARRG ERFDPPLNHA ELAFYDAVAN HGLAKALMGD DTLAEIARAL VTDIRKNLSV
DWLSREPVRA KLRSRVRRLL AKFDYPPEEE REAVDLVIKQ MEAFANEWSP KA