Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0844 |
Symbol | |
ID | 4027407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 943973 |
End bp | 945814 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637966010 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_572900 |
Protein GI | 92112972 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCGCA TGAGCGCGCC CTCGCTTCGC CCCTACCAGC ATCAGGCCGT CGATAACGTC ATCGCGCATT TCCGGCGCGG CGACGATCCC GCCGTGGTGG TGCTGCCCAC CGGCAGCGGC AAGTCGCTGG TCATCGCCGA ACTGGCACGG CTGGCACGCG GCCGGGTCCT GGTACTGGCG CACGTTCGGG AGCTGGTCGA GCAGAACCAT GCCAAGTACC GCGCCTATGG ACTGGAGGCG GACCTGTTCA GTGCCGGTCT GGGACGCAAG GAAAGCGAGC GTCAGGTGGT CTTCGGTTCG GTGCAGTCCG TGGTCAGGAG CCTGGAGCGT TTCGAGGCCC ACGAAGCGTG GGGCGCGTTC ACCCTGCTGG TCATCGACGA GTGCCACCGC ATTCCGCCGG GGGCGTCGGC GGGCAAGGTC CGGGATGACA AGGGCGCGTC GAGCTATCAT CAGGTGATCG CGCACCTGCG GCGCCACAAC CCGCGCCTCA AGGTGCTGGG CCTGACGGCG ACGCCGTATC GTCTGGGGCA AGGCTTCATC TACCACCGTC ATCACCACGG CATGGTGCGC GGCGACGCGG ATTGCTTCTT TCAGGATTGC GTGTTCGAGC AGCCACTGCG ACTGATGGTC AAGCAGGGCT ATCTGGCGCC GCCCCGACGC GTGGATGCCG CACTGTATGC TGAGGATGGC GAGCAGACCC GGTATGACTT CTCGCGGCTG TCGCCCGGTC GCGGCGGCGG CTTCGAGGAG GCGGAGCTGA ACCGCGTCGT GCAGGGGCAC CGGGCGACGC CGGGGATCAT CGCCGAGGTC GTCGAGCAGG CCCGCGAGCG GCGCGGCGTG ATGATCTTCG CGGCCACGGT GGCGCACGCG CGGGAAATCA TGGGATATCT GCCGGCGCAG CAGGCGGCGC TGATCGTGGG CGCGACGCCG GGGCGGGAGC GCGAAGCACT GATCGAGGAC TTCAAGGCGC AGCGGTTGAA GTATCTCGTC AACGTGGCGG TCCTCACGAC CGGTTTCGAT GCCCCGCATG TGGATCTGAT CGCGATCCTG AGGCCCACGG AGTCCGTCAG TCTGTATCAG CAGATCGTCG GGCGCGGCTT GCGGCTGGCG CCGGGCAAGC AGGACTGCCT GATTCTCGAT TATGCGGGCA ACCCCTGGGA CCTGTATGCG CCGGAGGTCG GCGAGCCCAG GCCCGATTCC GATGCCGAAC CGGTACAGGT CGAATGCCCG GCATGCGGGC ATGCCAATCT GTTCTGGGGC AAGCGCGATG GCGAATTGGT CATCGAACAT TACGGCCGTC GCTGCCAAGG CCTGCTGGAA GACACGGACG GGCGGCGCCG GCAGTGCGAC TTTCGTTTTC GCTTCAAGGT CTGCGATCAG TGCGGCGCGG AAAACGATAT CGCCGCGCGT GCCTGCCATC AATGCGACGA GCGGCTGGTC GACCCCGACG ACAAGCTCAA GGCTGCGCTG CGTCTCAAGG AAGCCAAGGT GCTGCGAGTC TCGGGGATGC AGTTACAGGC GGTAACCAAT GGTCGCGGGC TGCCGCGCCT GAAAGTGACC TACCATGATG AAGACGGCGC CACGCTCGAC GAGTGGTTCG CGCTGGAAAC CGCCGCACAG CGTCGCGCCT TCACGCTGGC CTTTCTGCGC CATCACCTGC GGGCGCCGGG CAGCGACTGG CGGCCGGATT CGCCCGAGGC GGTGATCGCC GGCGAGCGGC GTCTCAAGGC CCCGGATTTC GTGATCGGAC GCAAGGTGGG TCGCCACTGG CAGATCCGGG ACAAGCTCTT CGACTACGCG GGGCGCTATC GCAAGGCCGA TGATGCAGCG GACAATGGCT AG
|
Protein sequence | MSRMSAPSLR PYQHQAVDNV IAHFRRGDDP AVVVLPTGSG KSLVIAELAR LARGRVLVLA HVRELVEQNH AKYRAYGLEA DLFSAGLGRK ESERQVVFGS VQSVVRSLER FEAHEAWGAF TLLVIDECHR IPPGASAGKV RDDKGASSYH QVIAHLRRHN PRLKVLGLTA TPYRLGQGFI YHRHHHGMVR GDADCFFQDC VFEQPLRLMV KQGYLAPPRR VDAALYAEDG EQTRYDFSRL SPGRGGGFEE AELNRVVQGH RATPGIIAEV VEQARERRGV MIFAATVAHA REIMGYLPAQ QAALIVGATP GREREALIED FKAQRLKYLV NVAVLTTGFD APHVDLIAIL RPTESVSLYQ QIVGRGLRLA PGKQDCLILD YAGNPWDLYA PEVGEPRPDS DAEPVQVECP ACGHANLFWG KRDGELVIEH YGRRCQGLLE DTDGRRRQCD FRFRFKVCDQ CGAENDIAAR ACHQCDERLV DPDDKLKAAL RLKEAKVLRV SGMQLQAVTN GRGLPRLKVT YHDEDGATLD EWFALETAAQ RRAFTLAFLR HHLRAPGSDW RPDSPEAVIA GERRLKAPDF VIGRKVGRHW QIRDKLFDYA GRYRKADDAA DNG
|
| |