Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0087 |
Symbol | |
ID | 4026009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 108276 |
End bp | 111053 |
Gene Length | 2778 bp |
Protein Length | 925 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637965238 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_572150 |
Protein GI | 92112222 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.517588 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACTC TCACGCCAGA GCAGCAGGCA CGCCGGCGCA TAGATCGACA GCTTGAAGAG GCTGGCTGGA TTGTGCAGTC GATGAGCGAA CTCAACCCCG CAGCCGGACG AGGCGTCGCC GTCCGCGAAT ACCCGACCGA CAGCGGACCC ATGGATTATC TGTTGATGGT GGATCACCAG GCCGTGGGCG TCATCGAGGC CAAGCGTGAT GACGAAGGTC ATCGACTGAA CGCGGTGGAG CTCCAGTCGG CGCGCTACGC AACCTCCAGC CTCAAGCACC TGGGCCAGGT AGCACTGCGC TTCGTCCATG AAGCGACCGG GGAAATCACG CGCTATACGG ATAGCCGCGA TCCCAAGCCT CGAGCGCGAG AGGTATTTCA ATTCCCGCGC CCCGATACCC TGGCGCAACG ACTCAAAGAA CCGGAAACCC TGCGCGCTCG CTTGACGCAT GTCCCGACTC TGCAACCTCA TGGCCTGCGC GACTGCCAAC TCCACGCCAT TACCAACCTG GAGCAGTCGT TGGGTGAGAA CCGGCCACGC GCGCTGATCC AGATGGCCAC CGGTGCCGGC AAGACCTTTA CCGCCATTAC CTCTATCTAT CGGCTACTCA AGTTCTCCCA GGCCAAACGG GTGTTGTTCC TGGTGGATAC CAAGAACCTC GGCGAGCAGG CCGAAGGGGA GTTTCACCAG TACATCCCTC AGGATGACAA CCGCAAGTTC ACCGAGCTCT ATACCGTCCG GCGCCTGACC AGCAGCCATA TCCCCGATGA TGCCCAGGTC ATCATCTGCA CTATTCAGCG CCTGTACTCG ATCCTCAAGG GCGAACCGCT CGACGAGGCG ACGGAGGAAA CGGTGCCGGA TGGCGCCGCC ATCCTGCGCA AGCCGCCACT GCCGGTAATC TATAATGCGC GCATCCCGCC GGAATTCTTC GAATTCATGG TCATCGACGA GTGCCATCGC TCCATCTACA ACCTGTGGCG TCAGGTACTG GAATACTTCG ATAGCTTCCT GATCGGCCTC ACCGCGACCC CGGACAACCG GACCTACGGC TTCTTCCAGC AGAACGTAGT CAGCGAATAT ACCCTCGAGC AATCCGTGGC GGATGGCGTC AATGTCGATC ACCGTGTCTG GCGCATCGAT ACCGAGATCA CCCAAGCCGG CGGCCGCCTC GAGGCAGAGG AGTTGGTAGA AAAACGCGAA CGCCTAAGCC GCAAGAAGCG CTGGGAACAG CTCGACCAAG AGATCGATTA CCAAGGCACC CAGCTCGATC GCTCCGTAGT CAACCCTTCC CAGATCCGTA CCGTGATTCA GGCCTTTCGC GATGGCCTGC CGCAGATGTT CCCGGACCGG GTCGACGACG ACGGCGCCTT TCAGGTGCCC AAGACCCTGA TCTTCGCCAA GACCGACAGC CATGCCGACG ACATCATCCG CATCGTTCGC GAGGAGTTCG GCGAGGGCAA CGAATTCTGC AAGAAGATCA CCTACCGCAT CGACGAAGAC CCCAAGAGTG TGCTGGCCAG CTTCCGCAAT GACTACCATC CGCGCATCGC CGTGACCGTG GACATGATCG CCACCGGCAC CGACGTCAAG CCGCTGGAAT GTCTGCTGTT TATGCGCGAT GTGAAGAGTC GCAACTACTA CATGCAGATG GTCGGGCGCG GCACCCGCAG CCTCGACGCG GACGGGCTGG CAGTAGTCAG CAACGCCGCG CGTGGCCCCA AGCAGGAATT CATCCTGGTC GATGCGATCG GCGTCAAAGA TTCCCAGAAG AAGGAAACCG GCAGCCTCGA AAAAACGCCC AGCTATTCCC AGAAAGATCT AATGAACGCA GTGACGATGG GTGTGCGTGA CGAGGAGGTC CTGAGCTCCC TGTCCGGGCG TTTCGCCCGA TTCGCCAAGC GCCTCGACGC CGAGCAGCAG CGCCAGGTCA AGGACCTCAC CGGCGGCGCC AGCCTCAACG ACCTGGCCGG CCAGCTACTG AAGGTGGACG ACCCCGATGC CATCGACGCC CAGACCCGGA AACACTTCAA CCTGACGCCC AACGAAACGA TCACCAGCAC CCAATGGGAT GAAGTACGCA AGCAGCGTGC GCAACAGGCC TGCGCACCGA TCACCGGCCC GTTGAACACC CTGCTCGAAG ACATTCGCGA GCGTCAGGAG CAGACGCTGG ATACCGTGAA TATCGATACC GTCACCCACA GCGGCTGGGA CCGCGATCTC GACACCCAGC GCGCCCACCT GCGGCAGGAA TTCGAGCAGT GGCTGCAAAG CCATCGCGAT GACATCACCG CCCTGACCAT TTACTATGCC CAGCCTCAGC GTCGGGCCGA GATCACTCAA CAAACCATCG CGGCACTCTT GGAGTCGCTT AAGCAGGAGA GACCCAGGCT GGCCCCGGTC AGGATATGGG ACGCCTACGC CGCTCTTGAC GACGTACAGG TAAAGCGCCC GGAACAAGAG CTCGCACAGA TCGTCGCCCT GGTGCGCCGC GCCTGCGGCT GGGACGACAA ACTCACGCCC TATGCCGACA CCGTGCGCGC AAATTTCAAG AAGTGGATCT TCGCCAGGCA TGCCGGCAAC CAGACCAAGT TCAGCCCCGA GCAGCAGGCC TGGCTGGAAA TGATCCGCGA CCATATCGCC GTATGCCTGC ACCTCGAAGT CGATGACCTC GACTACATTC CCTTCAATGA AGAAGGCGGC GCGGGGAAGA TGTATCAGCT ATTTGGTGAC GAGATGGACC GGGTGATCGA TGAAATCAAT GATGCATTGG CAGCATAA
|
Protein sequence | MTTLTPEQQA RRRIDRQLEE AGWIVQSMSE LNPAAGRGVA VREYPTDSGP MDYLLMVDHQ AVGVIEAKRD DEGHRLNAVE LQSARYATSS LKHLGQVALR FVHEATGEIT RYTDSRDPKP RAREVFQFPR PDTLAQRLKE PETLRARLTH VPTLQPHGLR DCQLHAITNL EQSLGENRPR ALIQMATGAG KTFTAITSIY RLLKFSQAKR VLFLVDTKNL GEQAEGEFHQ YIPQDDNRKF TELYTVRRLT SSHIPDDAQV IICTIQRLYS ILKGEPLDEA TEETVPDGAA ILRKPPLPVI YNARIPPEFF EFMVIDECHR SIYNLWRQVL EYFDSFLIGL TATPDNRTYG FFQQNVVSEY TLEQSVADGV NVDHRVWRID TEITQAGGRL EAEELVEKRE RLSRKKRWEQ LDQEIDYQGT QLDRSVVNPS QIRTVIQAFR DGLPQMFPDR VDDDGAFQVP KTLIFAKTDS HADDIIRIVR EEFGEGNEFC KKITYRIDED PKSVLASFRN DYHPRIAVTV DMIATGTDVK PLECLLFMRD VKSRNYYMQM VGRGTRSLDA DGLAVVSNAA RGPKQEFILV DAIGVKDSQK KETGSLEKTP SYSQKDLMNA VTMGVRDEEV LSSLSGRFAR FAKRLDAEQQ RQVKDLTGGA SLNDLAGQLL KVDDPDAIDA QTRKHFNLTP NETITSTQWD EVRKQRAQQA CAPITGPLNT LLEDIRERQE QTLDTVNIDT VTHSGWDRDL DTQRAHLRQE FEQWLQSHRD DITALTIYYA QPQRRAEITQ QTIAALLESL KQERPRLAPV RIWDAYAALD DVQVKRPEQE LAQIVALVRR ACGWDDKLTP YADTVRANFK KWIFARHAGN QTKFSPEQQA WLEMIRDHIA VCLHLEVDDL DYIPFNEEGG AGKMYQLFGD EMDRVIDEIN DALAA
|
| |