Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0086 |
Symbol | |
ID | 4026008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 106660 |
End bp | 108276 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637965237 |
Product | restriction modification system DNA specificity protein |
Protein accession | YP_572149 |
Protein GI | 92112221 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGTC CAATTGATGA GGCATTACCA GTGCACTCTA TGGAAAAAAA ATTAGCTAAT ATAAAAACAC CTCACTGGTT ATGGATAGAG CATAACCAGA TAGCGGAGAT CAACCCAAAG AAGCCTAAAC TCGATGAAGA GCTTTCAGTT TCCTTCATCC CGATGGGCGC TGTAGCAGAA GAGTCTGGCC GCTACACCAC CGACGACAGC AAGAAATTCG AAGACGTAAA AAAAGGATAT ACTTACTTTT CAGATGGCGA CATCCTTTTT GCTAAAATCA CTCCTTGCAT GGAAAACGGA AAAGTTGCAC TTTTGAGCAA CCTAACCAAT GGCGTCGGCT TTGGGTCCAC AGAATTCCAC GTATCACGCC TAACCGAAGC CGTTGAAAAA AAATTTTATT TTTATTTTTT TGTTTCCAAG AGTTTCAGAA AACAAGCTCA AGCCAACATG GCTGGCAGTG CCGGCCAGCT TCGTGTCACC ACTGACTACT TTAGCAATGT CAGCGTCCCA CTTTGCCCTA CCAGAGAACA ACAGCGAATT GTCACCAAGA TAGAGGAGCT TTTCTCCGAA ATCGATAGCG GTGTGGAAAG CCTGAAAACC GCCCAGGCCA AGCTCAAGAC CGCCCGCCAG TCACTGCTCA AGGCCGCCTT CGAAGGCAAG CTGACCGAGC AGTGGCGAAA AGACAATGCC GATCGACAGG AAAGCCCGGA AGCCTTGCTG GAGCGGATTC AGGCCGAGCG CGAGGCGCAC TACCAACAGC AGCTGACCGA CTGGCAACAT CAGCTCAAGG ACTGGGAAGC CGCCGGCAAG GAAGGCAAGA AACCCCGCAA GCCCAAGGTG CCCAAGGCCC TGCCACCATT GACACAGCAA GAGCTGGCCG AGTTACCAGA ATTGCCGGAG GGGTGGAAAT GGATAAACCT GGGTAACATT TCGGAGATAT CAGGCGGCAT CACCAAGAAC CAAAAACGTC AATCATTGCC ACAAAAAAAC CCTTTCCTTC GGGTGGCCAA TGTATACGCG AACAAGCTGG AACTGGATGA CATCCACTTC ATCGGGACTA CTCCTGATGA AGCAAAAAGA GCAAAACTAA AAAAAGACGA CCTGCTTATC GTCGAGGGAA ATGGAAGCCC TGACCAAATA GGAAGAGTCG CAAAATGGGA TGGATCGATA GAGCACTGCA CACACCAAAA TCACTTGATA CGTTCAAGAT TGGCAAGCCC AATCAGCGCT GATTTTGTCC TGCATTTTCT TCTCTCGGCA ACAGGAAGAA AAGCAATTAA AAAAGTGGCT AGCTCTACAT CTGGTCTTTA CACACTCAGC CTTGCAAAAG TTGAAAAGCT TTGCATCCCT GTTTGCTCAA AAAACGAGCA GATGATGATT GTCGATCAAC TTGAGTCACG CCTCTCCCAA CTCGACCAAT TGGAGCGGAC CCTGACCGCT TCCATGAAAC AGGCCGAAGC GCTCAAGCAG TCCATCCTCA AGCGCGCCTT CGCCGGTCGA CTGGTGCCTC AGGATCCCGA CGACGAGCCG GCCAGCGAGC TGTTGGCGCG CATCCGCGCC GAGCGGGAAA GCCAGCCAAG GGCCCCTCGC AAGCTGCACA GGGAACCGAC GCCATGA
|
Protein sequence | MNSPIDEALP VHSMEKKLAN IKTPHWLWIE HNQIAEINPK KPKLDEELSV SFIPMGAVAE ESGRYTTDDS KKFEDVKKGY TYFSDGDILF AKITPCMENG KVALLSNLTN GVGFGSTEFH VSRLTEAVEK KFYFYFFVSK SFRKQAQANM AGSAGQLRVT TDYFSNVSVP LCPTREQQRI VTKIEELFSE IDSGVESLKT AQAKLKTARQ SLLKAAFEGK LTEQWRKDNA DRQESPEALL ERIQAEREAH YQQQLTDWQH QLKDWEAAGK EGKKPRKPKV PKALPPLTQQ ELAELPELPE GWKWINLGNI SEISGGITKN QKRQSLPQKN PFLRVANVYA NKLELDDIHF IGTTPDEAKR AKLKKDDLLI VEGNGSPDQI GRVAKWDGSI EHCTHQNHLI RSRLASPISA DFVLHFLLSA TGRKAIKKVA SSTSGLYTLS LAKVEKLCIP VCSKNEQMMI VDQLESRLSQ LDQLERTLTA SMKQAEALKQ SILKRAFAGR LVPQDPDDEP ASELLARIRA ERESQPRAPR KLHREPTP
|
| |