Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0905 |
Symbol | |
ID | 5207851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 1117623 |
End bp | 1120691 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640594519 |
Product | SMC domain-containing protein |
Protein accession | YP_001275264 |
Protein GI | 148655059 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR00618] exonuclease SbcC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.995306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000299578 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCCCGC GCAAACTGGC ACTGCAAAAC TTTATGTGTT ACCGCGAAGG ACTGCCGCCG CTCGAGTTCG ATGGCATGTC GATTGCGTGC CTGGCAGGCG AAAATGGCGC CGGCAAATCG GCGCTGCTCG ACGCCATCAC ATGGGCGCTG TGGGGCGAGG CGCGCCTGAA GAGCGATGAT GACCTGGTGG CGCTTGGCGC TACTGAGATG ATGGTGGAAC TGGAGTTCAC CCTCGATGGG CAGGATTACC GGGTCATCCG GCGTCGCATT CGCGGGAAGC GCGGCGGTCA GAGCCAGCTC GATTTCCAGG TGCGCGATGG GAACGAATGG CGCTCGCTCA CTCCCGGCGG CATTCGTGAA ACGCAGCAGT TGATCATCCG CACGTTGCGG ATGGATTACG AGATGTTCGC CAACTCCGCC TACCTCCGTC AGGGGCGCGC CGATGAGTTT ACCCGCAAAG AGCCGGCAAA GCGCAAGCAG GTGCTGGCGG ACATTCTCGG TTTGAGCATC TACGAGGAAC TGGAAGGTCG GGCGAAAGAG CGTGCGCGCA GCACTGAAGG ACAGATTCGC GGTCTCGAAG GACAGATCGG TGAACTGCGC CGGCAGGCGG AACGCTATGA CCTGCTGGTG GAAGAGGTGC ACACTGCTGA AGAGCGTGTC GCCGATGCGA TGAAGCGTGT CGAAGCGGCG CGACTGGCGC TCGAAGAGGC GACTGCCAGC GTGCAGAAAC TCGAACAGGT CCGTACCATC CGTGACAATC TCCACACCCA GACTCAGCGA CGCCGCAGGG AACGCGAGGA TCAGGCGCAA TGGCTGGATC GGCAGATGGA AACCCAAAAG CGCGCCGAGT CCTGGATTGC GCGTCGCGCC GAGATCGAGG AGGGCATTCG CCTGCTACGC GCCGCCAAAG CGGAGTGTGA TCGCCTCGCC GCCCTGCGCG ACGAGTATGA CCATCTGCAG AGCCGTCGCG CGGCGCTTGC GCAGGCGCTC GCCAATGCAG AACACGCCAT CCGCTCCGAT CTGCGCGTTG CCGAAACGCA GGTTCAGACC CTCCGCGAAC GCGCCGCCCG TCGCCCGAAA CTGGCAGCGG AACTTCAGCG CCTCGCTGCC CAACTTGCAG AACATCCCCC GATTGCAGAA GCGCTCTCCG CTGCCCGCAC CCGTCGCGCC ACGCTGACCG AACGTCTGCG GCGCGTCAAC GAACTGTTGC GGCGCCGCAC CGAACTGGAG AGCGACATCA AACTGAAACA CGACTCGCTG GTCGCCACGC GCGAAGAGCA GAAGCGCACA CTCCGCACGC TGGCGGAGCA ACTGAAGCAC GAAGCGCGCT GGCGCACCGA ACTCGCCGAA GCGACTGCGG AACGGGCACG CCTCGAACAG GAAGCCGCCC ACCTCGATAC TCTGCGCGAC GACGAGCGCG CCCTTGCCGA AAAGATCGGC GCCATCCGCG CCGAATGCGA AACGGTCAAA AAGCAGGGCG ATCAGATCAA CGAGAAACTG CGCCTGCTCG GTCCCGATCT CAAAGTTTGC CCGCTCTGCA AGAGCGAACT GGGACACGAT GGCATTGCGC ACATCCAGGC GGAGTACGAA CGCGAACGTC AGGCGCTGCG CCAGCAGTAT GCGGCTGCAA AACGCGAAGC CGATCACCTG GAAGCGCACC TCAAACGTCT GCGCAACGAC ATTCGCACCA TCGAGGCGCG TGTCAACACC CTCCCCGACG TGCAGGGACG CATTGCGCGC CTTGAAAGCG AACTGGCAAA ATGTGACGTC TGCCGCCAGC AGCAGATCGA GGCGCAACGC CTGCACGACG ATGTGGCGCT GCGCCTGTTG AAGAACGATT ACGAACCGGC GGCGCGCGAG GAACTGAAGC GCATCGACGC CGAGATCAAC GCACTGGGAT CGGTCGATGC GCTCGAACGC GAGATTGCCG CAGTCGAACG CCAGGTTGCC GCGCTCGAAG ACCGCAGTCG AGAACAGGCA AGCGTGCAGG CGAAGATTGA CGGGTTGCAG CGCGAGATTC AGCAGATCGA CAGCGAAGAC CCTGCGCTGC ACGAACAGGA GCAGATCGTC GCGGCGTTGA GCGCCCAACT CGCACAGGGC GATTTCGCGC ACACCGAGCG CGCCGCGCTT GCGGACATCG ACCGGCAGAT TGCAGCGCTG GGATACAGCC GCGAGCGGTA TGACCAGGCG CAGGCTGATG TCCAGGCTCT GGTTCACTGG GAAGAAGACC TCATACGCCT GCAACGCGCC GAAGAGTGGC TTGCCGAAAA CCGCGACGAA CTCGCGCGCG CGGCTGAGCG CCTCCGGCAA CTCGACGCCC AGATCGCCGC CGACGAGGAA GAGGTGCGTT CCCTCGACGA ACGCCTGTGC GACCTGGCGC CAGCCGCGCG CGCACGCGCC GAGGCTGCCG CCCGGCTCGA CGAACTCAAC CATGAGGTGA TGGCGCGCCA GAAAGACCTG GGCGAACGCC AGGCGGATCT GCGCCGCGCA CAGGAAGCGA CCAGCGCACT CGCCGAAGCC GAAGCGCACC GCCAGGCGCT GCTCGAACGC AAGGGGTTGT TCGATGAACT GACGCAGGCG TTCGGCAAGA AAGGCATCCA GGCGATGCTG ATCGAAACCG CGCTTCCTGA ACTGGAACGC GAAGCCAACC GCCTGCTCAG CCGTATGACC GACAATCAAC TCCACCTGAC CTTTGAAACA CAACGCGACA CGAAGAAAGG GGAGGTCGCC GAGACGCTCG ACATCAAAAT CGCGGATGCG CTCGGCACCC GCGTGTACGA CGCCTACAGC GGCGGCGAGG CGTTTCGCCT CGACTTCGCC ATCCGGATCG CGCTCTCGAA GTTGCTGGCA CGGCGCGCTG GCGCGCGCCT GGAAACCCTG ATCATCGACG AGGGGTTCGG ATCGCAGGAC GCTCGCGGGC GTGAGCGCCT GGTCGAGGCG ATCATCTCGG TGCAGCACGA TTTCCGCCGC GTTCTGGTGA TCACCCACAT CCAGGAATTG AAGGATATGT TTCCGGTGCA GATCGAAATC GTCAAAACAC CAAACGGCAG CGTCTGGAAC ATCGTGTGA
|
Protein sequence | MIPRKLALQN FMCYREGLPP LEFDGMSIAC LAGENGAGKS ALLDAITWAL WGEARLKSDD DLVALGATEM MVELEFTLDG QDYRVIRRRI RGKRGGQSQL DFQVRDGNEW RSLTPGGIRE TQQLIIRTLR MDYEMFANSA YLRQGRADEF TRKEPAKRKQ VLADILGLSI YEELEGRAKE RARSTEGQIR GLEGQIGELR RQAERYDLLV EEVHTAEERV ADAMKRVEAA RLALEEATAS VQKLEQVRTI RDNLHTQTQR RRREREDQAQ WLDRQMETQK RAESWIARRA EIEEGIRLLR AAKAECDRLA ALRDEYDHLQ SRRAALAQAL ANAEHAIRSD LRVAETQVQT LRERAARRPK LAAELQRLAA QLAEHPPIAE ALSAARTRRA TLTERLRRVN ELLRRRTELE SDIKLKHDSL VATREEQKRT LRTLAEQLKH EARWRTELAE ATAERARLEQ EAAHLDTLRD DERALAEKIG AIRAECETVK KQGDQINEKL RLLGPDLKVC PLCKSELGHD GIAHIQAEYE RERQALRQQY AAAKREADHL EAHLKRLRND IRTIEARVNT LPDVQGRIAR LESELAKCDV CRQQQIEAQR LHDDVALRLL KNDYEPAARE ELKRIDAEIN ALGSVDALER EIAAVERQVA ALEDRSREQA SVQAKIDGLQ REIQQIDSED PALHEQEQIV AALSAQLAQG DFAHTERAAL ADIDRQIAAL GYSRERYDQA QADVQALVHW EEDLIRLQRA EEWLAENRDE LARAAERLRQ LDAQIAADEE EVRSLDERLC DLAPAARARA EAAARLDELN HEVMARQKDL GERQADLRRA QEATSALAEA EAHRQALLER KGLFDELTQA FGKKGIQAML IETALPELER EANRLLSRMT DNQLHLTFET QRDTKKGEVA ETLDIKIADA LGTRVYDAYS GGEAFRLDFA IRIALSKLLA RRAGARLETL IIDEGFGSQD ARGRERLVEA IISVQHDFRR VLVITHIQEL KDMFPVQIEI VKTPNGSVWN IV
|
| |