Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4245 |
Symbol | |
ID | 5211230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 5326315 |
End bp | 5327421 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640597834 |
Product | restriction endonuclease |
Protein accession | YP_001278538 |
Protein GI | 148658333 |
COG category | [V] Defense mechanisms |
COG ID | [COG1715] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00114082 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.143836 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCGTC GTCGTTCACG TCACTCGCCC GATGCCGGGT CTGCCATCGG CGCTCTTTTC CTGCTCGGAT TGATCGGATT TCGTCCTTTC TGGCAAACCG TAACGAACCT TGCCCTCCCC TGGCAGATCG CTGCTGCGGT GCTGATCCTC TCGGTTGCGT TTGTGATCCT GTGGTTTGTG CGTCTTTTGA TACGTCATGC GCGCCAGCGG AGCCTGGTGC GTAAGGAACT GTACGCGCTC ACGCCGACCG AATTTGAAGA ACGGGTGCTG CTCTTGTTGA AAGACCTGGG CTGGAGTCAT CTCAGATTGC GCGGCGGCAG TGGAGATCGC GGCGTTGATC TGGAAGGCGA GTTTCAGGGT ACACGGTATG TCGTCCAGTG TAAGCGCTAC CATCACAATA AGTCGGTTTC TCCCTCAGCG GTGCGCGATC TCGTCGGAGC GTTGCACATT CAGAAAGCCG ACCGCGCATT GCTGGTGACG ACAAGTTCGT TTACGCCGCA GGGGTATGCC GAGGCGCGCG ATCAGGCAGT GGAACTGTGG GATGGCGCTA TTCTGGAGCA GAAGATAAGC GAAGCTGCCA GGTTGCGTGA AGACCCGACA CGTAGACAGG CGGTGCAACG GCGACGCCTT GCAACATTCA TTACACTGGT GGTGATCAAC GGGTTAAGCG TTCTGTCAGC ATTTGCGATT GCAGGTCCGC CATCGTCTGC GCCACCAACT ATTCGCACAG CGCCGACGCC CTCTCCTGAA AGTATTGCCG GATCACCTCT GGGAAGAACC GCATCTTCTC TTCCCTTGCC TACCCAGACT CCTTCTTCGG AAGAACCTCA ACCGACAGCG CTGCCGACAC CGACGGTGGC GCCGACAGAA CCCCCCGTCC CGACCACAAC CGTTTTCAAT GGCGGGAATG TGCGCGCTGC GCCGAATATG CGGGGCACGG TGCTCGATCA GGTGCACGCT GGCGAGATTG TCGAACTGCT CGGTCGTTCG GCGGACGGAA ACTGGCTCTA TATCCGCAAT CCGCGCGGTC AGGTTGGCTG GACGCACCGC ACCCTGCTGA CTCTCGAAGC AGACATCAGT GAGCGTCTGG AGGTGGTAGC GCCGTGA
|
Protein sequence | MSRRRSRHSP DAGSAIGALF LLGLIGFRPF WQTVTNLALP WQIAAAVLIL SVAFVILWFV RLLIRHARQR SLVRKELYAL TPTEFEERVL LLLKDLGWSH LRLRGGSGDR GVDLEGEFQG TRYVVQCKRY HHNKSVSPSA VRDLVGALHI QKADRALLVT TSSFTPQGYA EARDQAVELW DGAILEQKIS EAARLREDPT RRQAVQRRRL ATFITLVVIN GLSVLSAFAI AGPPSSAPPT IRTAPTPSPE SIAGSPLGRT ASSLPLPTQT PSSEEPQPTA LPTPTVAPTE PPVPTTTVFN GGNVRAAPNM RGTVLDQVHA GEIVELLGRS ADGNWLYIRN PRGQVGWTHR TLLTLEADIS ERLEVVAP
|
| |