Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0411 |
Symbol | |
ID | 5207347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 523858 |
End bp | 524982 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640594037 |
Product | restriction endonuclease |
Protein accession | YP_001274792 |
Protein GI | 148654587 |
COG category | [V] Defense mechanisms |
COG ID | [COG1715] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTACC TCGACGCTGC CTATACTATC CTCAAAGCTG CCGGTCAACC GTTGCACTAC GAGGCAATCA CCCAGGAGGC GCTAAAGCAG GGGCTGATCA AGCCACAGGG AGCAACCCCG GCTGCGACGA TGGGCTCCCG GCTCTACACC GACACGCTGG AGGAAGGCTC ACGATTCGTG CGCGCTGGCA GGGGAACGTT CGGTCTGGCT GAATGGCGAC CCAGGGGCAT CGACGCCCAC GTGTCCGACA TCAACGCAGA CACGCGAAAA CAATTGCGCG AAATGGTCCT GAACATGCCG CCAGACCGGT TCGAGGCGCT GATCCGCGAG TTGCTGATCC GCATGGGCTT TGATGAGAGT ACAGTCCAGA TCACCCCGTA TCGCAGCGAT GGCGGCGTTG ACGTCATTGG CATCTACCGC GCCGCCGGAC TGACCGAGGT GAGCGCTGCG GTGCAGGTGA AACGCTGGAA GGGCAACGTA GGCGCGTCGA TCGTCACCCA GTTGCGCGGT TCGCTGCAGG TGCATCAGCA GGGTATTATC ATCACTACCA GCGACTTCAC GAAAGATGCG CGTCGAGAGG CTGTCGAAGC GAACAAAACG CGGATTGGTT TGATCAATGG TGATGAACTG ATCGATCTGC TGGTGAAGCA TCAGGTAGGC GTGGTCAAGC GCACCCTGGA GGTAACTGTC CTGGACGACG AGTATTGGAG CGAGTTGATC GGACAGAGCA GTGCGCCTTC TGTCCCGACG GTTTCGCTTC CGGATCCAGT CGCACCCAGA CCAGCCCCGA CTGTCAAAAC AAGGTTGATG CCAGGGAAAC CCAAAGGCTT CATTCTGTTT GGCGAATTCT ACGCTGCGAA CACCTGGCGC GGGGTGTTGT TGGGTGTCTG CCAGGCGCTG GCGCAGCGTT GCGACAATTT CGCAACGGTT GCGACCATTA TCAAGGGGCG CTCGCGGCAG CACATTGCTG ACAGCCCGAC TGGTATGATC TCCCCGGCGC CCATCCCGGG CGCTGCGTTG TGGATCGAGA CCAATCAGAG CGCACGATCA GTGCTATGGA TTATCGCGCA GTTGCTGGAA GCGCTGGGGC GCTCGCCGAA CGACTTTGAG ATTGTGGTCA GTTGA
|
Protein sequence | MTYLDAAYTI LKAAGQPLHY EAITQEALKQ GLIKPQGATP AATMGSRLYT DTLEEGSRFV RAGRGTFGLA EWRPRGIDAH VSDINADTRK QLREMVLNMP PDRFEALIRE LLIRMGFDES TVQITPYRSD GGVDVIGIYR AAGLTEVSAA VQVKRWKGNV GASIVTQLRG SLQVHQQGII ITTSDFTKDA RREAVEANKT RIGLINGDEL IDLLVKHQVG VVKRTLEVTV LDDEYWSELI GQSSAPSVPT VSLPDPVAPR PAPTVKTRLM PGKPKGFILF GEFYAANTWR GVLLGVCQAL AQRCDNFATV ATIIKGRSRQ HIADSPTGMI SPAPIPGAAL WIETNQSARS VLWIIAQLLE ALGRSPNDFE IVVS
|
| |