Gene Csal_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0844 
Symbol 
ID4027407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp943973 
End bp945814 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content67% 
IMG OID637966010 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_572900 
Protein GI92112972 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCGCA TGAGCGCGCC CTCGCTTCGC CCCTACCAGC ATCAGGCCGT CGATAACGTC 
ATCGCGCATT TCCGGCGCGG CGACGATCCC GCCGTGGTGG TGCTGCCCAC CGGCAGCGGC
AAGTCGCTGG TCATCGCCGA ACTGGCACGG CTGGCACGCG GCCGGGTCCT GGTACTGGCG
CACGTTCGGG AGCTGGTCGA GCAGAACCAT GCCAAGTACC GCGCCTATGG ACTGGAGGCG
GACCTGTTCA GTGCCGGTCT GGGACGCAAG GAAAGCGAGC GTCAGGTGGT CTTCGGTTCG
GTGCAGTCCG TGGTCAGGAG CCTGGAGCGT TTCGAGGCCC ACGAAGCGTG GGGCGCGTTC
ACCCTGCTGG TCATCGACGA GTGCCACCGC ATTCCGCCGG GGGCGTCGGC GGGCAAGGTC
CGGGATGACA AGGGCGCGTC GAGCTATCAT CAGGTGATCG CGCACCTGCG GCGCCACAAC
CCGCGCCTCA AGGTGCTGGG CCTGACGGCG ACGCCGTATC GTCTGGGGCA AGGCTTCATC
TACCACCGTC ATCACCACGG CATGGTGCGC GGCGACGCGG ATTGCTTCTT TCAGGATTGC
GTGTTCGAGC AGCCACTGCG ACTGATGGTC AAGCAGGGCT ATCTGGCGCC GCCCCGACGC
GTGGATGCCG CACTGTATGC TGAGGATGGC GAGCAGACCC GGTATGACTT CTCGCGGCTG
TCGCCCGGTC GCGGCGGCGG CTTCGAGGAG GCGGAGCTGA ACCGCGTCGT GCAGGGGCAC
CGGGCGACGC CGGGGATCAT CGCCGAGGTC GTCGAGCAGG CCCGCGAGCG GCGCGGCGTG
ATGATCTTCG CGGCCACGGT GGCGCACGCG CGGGAAATCA TGGGATATCT GCCGGCGCAG
CAGGCGGCGC TGATCGTGGG CGCGACGCCG GGGCGGGAGC GCGAAGCACT GATCGAGGAC
TTCAAGGCGC AGCGGTTGAA GTATCTCGTC AACGTGGCGG TCCTCACGAC CGGTTTCGAT
GCCCCGCATG TGGATCTGAT CGCGATCCTG AGGCCCACGG AGTCCGTCAG TCTGTATCAG
CAGATCGTCG GGCGCGGCTT GCGGCTGGCG CCGGGCAAGC AGGACTGCCT GATTCTCGAT
TATGCGGGCA ACCCCTGGGA CCTGTATGCG CCGGAGGTCG GCGAGCCCAG GCCCGATTCC
GATGCCGAAC CGGTACAGGT CGAATGCCCG GCATGCGGGC ATGCCAATCT GTTCTGGGGC
AAGCGCGATG GCGAATTGGT CATCGAACAT TACGGCCGTC GCTGCCAAGG CCTGCTGGAA
GACACGGACG GGCGGCGCCG GCAGTGCGAC TTTCGTTTTC GCTTCAAGGT CTGCGATCAG
TGCGGCGCGG AAAACGATAT CGCCGCGCGT GCCTGCCATC AATGCGACGA GCGGCTGGTC
GACCCCGACG ACAAGCTCAA GGCTGCGCTG CGTCTCAAGG AAGCCAAGGT GCTGCGAGTC
TCGGGGATGC AGTTACAGGC GGTAACCAAT GGTCGCGGGC TGCCGCGCCT GAAAGTGACC
TACCATGATG AAGACGGCGC CACGCTCGAC GAGTGGTTCG CGCTGGAAAC CGCCGCACAG
CGTCGCGCCT TCACGCTGGC CTTTCTGCGC CATCACCTGC GGGCGCCGGG CAGCGACTGG
CGGCCGGATT CGCCCGAGGC GGTGATCGCC GGCGAGCGGC GTCTCAAGGC CCCGGATTTC
GTGATCGGAC GCAAGGTGGG TCGCCACTGG CAGATCCGGG ACAAGCTCTT CGACTACGCG
GGGCGCTATC GCAAGGCCGA TGATGCAGCG GACAATGGCT AG
 
Protein sequence
MSRMSAPSLR PYQHQAVDNV IAHFRRGDDP AVVVLPTGSG KSLVIAELAR LARGRVLVLA 
HVRELVEQNH AKYRAYGLEA DLFSAGLGRK ESERQVVFGS VQSVVRSLER FEAHEAWGAF
TLLVIDECHR IPPGASAGKV RDDKGASSYH QVIAHLRRHN PRLKVLGLTA TPYRLGQGFI
YHRHHHGMVR GDADCFFQDC VFEQPLRLMV KQGYLAPPRR VDAALYAEDG EQTRYDFSRL
SPGRGGGFEE AELNRVVQGH RATPGIIAEV VEQARERRGV MIFAATVAHA REIMGYLPAQ
QAALIVGATP GREREALIED FKAQRLKYLV NVAVLTTGFD APHVDLIAIL RPTESVSLYQ
QIVGRGLRLA PGKQDCLILD YAGNPWDLYA PEVGEPRPDS DAEPVQVECP ACGHANLFWG
KRDGELVIEH YGRRCQGLLE DTDGRRRQCD FRFRFKVCDQ CGAENDIAAR ACHQCDERLV
DPDDKLKAAL RLKEAKVLRV SGMQLQAVTN GRGLPRLKVT YHDEDGATLD EWFALETAAQ
RRAFTLAFLR HHLRAPGSDW RPDSPEAVIA GERRLKAPDF VIGRKVGRHW QIRDKLFDYA
GRYRKADDAA DNG