Gene Csal_1833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1833 
Symbol 
ID4028059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2087818 
End bp2089848 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content61% 
IMG OID637967027 
Productexcinuclease ABC subunit B 
Protein accessionYP_573884 
Protein GI92113956 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAC CCTTTCGTAT TCAGTCGCAG TATCGGCCCG CGGGTGACCA GCCAGCGGCT 
ATCGACGGCC TGATAAAAGG CTTCAATGCC GGGCTGGCGC ATCAGACCCT GCTGGGGGTG
ACCGGCTCGG GAAAGACGTT CACCATGGCC AACGTGGTCG AGCGCCTGCA GCGCCCGACC
ATCGTGATGG CGCCCAACAA GACGCTGGCC GCCCAGCTTT ATGGCGAGTT CAAGAGTTTC
TTTCCCGACA ATGCGGTGGA GTACTTCGTT TCCTATTACG ATTACTACCA GCCGGAAGCC
TATGTTCCCT CATCGGATAC CTTCATCGAG AAGGACGCCT CGATCAATGA CCATATCGAG
CAGATGCGTC TTTCGGCCAC CAAGGCACTG CTGGAGCGCC GCGATGCCTT GATCGTGGTC
TCGGTCTCGG CGATCTACGG CCTGGGCGAT CCCGATCAAT ATCTGAAGAT GCGTCTGCAC
TTCAATCGCG GCGAGTTGAT CGACCAGCGC ACCTTCCTGC GACGTCTCGC CGAGTTGCAG
TACACGCGCA ACGACATGGA CTTTCGGCGA GGGACGTACC GGGTCCGGGG CGATGTCATC
GACATCTTTC CCGCCGATGC GGAAGACGAG GCCGTCAGGG TCGAGCTGTT CGACGACGAG
ATCGAGACGA TCAGCCTGTT CGATCCTCTT ACCGGGGAGG TGCGCGACAA GGTTCCGCGC
ATGACGATCT ATCCCAAGAG TCACTACGTG ACGCCTCGCG AGACGATTCT CGCCGCCGTC
GAGGACATCA AGGTCGAGCT GGCCGATCGC CTGGCCTGGA TGCGCCAGCA TGACAAGCTC
GTCGAGGCTC AGCGGCTCGA GCAGCGCACC CTGTATGACA TCGAGATGAT GCTCGAACTC
GGGTACTGCA ACGGCATCGA GAACTACTCG CGCTATCTGT CGGGACGGGA GCCCGGCGAA
GCGCCGCCCA CGTTCTTCGA TTATCTGCCT GACGATACTC TGCTGTTCAT CGACGAGTCC
CACGTCAGCG TCCCGCAGGT GGGGGGCATG TACAAGGGCG ACCGTTCGCG CAAGGAAACT
CTGGTCGAGT ACGGCTTCCG CTTGCCTTCG GCACTGGACA ACCGTCCGAT GAAGTTCGAG
GAATGGGAGC GCATTTGCCC GCAAACCGCG TTTGTGTCAG CGACACCGGG CCCTTACGAA
GCCGAGCATG CAGGGCAGGT GGTCGAGCAG GTCGTGCGGC CGACGGGGCT GGTCGATCCC
GAAATCGAAG TCCGGCCGGC ACAAACCCAG GTCGATGACC TGCTTTCCGA GATCAAGCTG
CGAACCGACG TGGGCGAACG TGTCCTGGTC ACGACATTGA CCAAGCGCAT GGCGGAGGAT
CTCACCGAGT ATCTCGACGA GCACGACATT CGCGTGCGCT ATCTGCACTC GGACATCGAC
ACCGTCGAGC GGATCGAAAT CATACGCGAT CTGCGCCTGG GCAAGTTCGA TGTGCTCGTG
GGGATCAACC TGCTGCGCGA GGGGCTCGAT ATTCCCGAGG TCTCGTTGGT CGCGATTCTC
GATGCCGACA AGGAAGGGTT CCTGCGTGCC GAGCGGTCGT TGATCCAGAC CATAGGGCGT
GCGGCACGCA ACGCGCATGG CAAGGCCATT CTCTACGGAG ACCGTGTCAC CAACTCGATG
CAGCGTGCCA TCGACGAAAC GGAACGGCGT CGCAACAAGC AGATCGCCTT CAACGAAGAG
CACGGTATCA CGCCCACGAC GGTCACCAAG TCGGTTGCCG ATATCATGGA AGGGGCCCAG
ACGCCCGGCA AGAAAGTCGG CCGCAAGCGT CCCGACAAGC GGGTTGCCGA AGCGCCTGGC
GACTACAGCA GCGAACAGCT CAACCAGATG GATGCAGCGG GCCTCACGCG GGAAATCGGC
AAGCTGGAGG ATGCCATGCA CGAGGCGGCG CAGAACCTGG AGTTCGAAGA GGCGGCACGC
TTGCGCGATC AGTTGCAGTC GTTGAAGGCC AGGCTGATCG AACTGGGGTG A
 
Protein sequence
MSKPFRIQSQ YRPAGDQPAA IDGLIKGFNA GLAHQTLLGV TGSGKTFTMA NVVERLQRPT 
IVMAPNKTLA AQLYGEFKSF FPDNAVEYFV SYYDYYQPEA YVPSSDTFIE KDASINDHIE
QMRLSATKAL LERRDALIVV SVSAIYGLGD PDQYLKMRLH FNRGELIDQR TFLRRLAELQ
YTRNDMDFRR GTYRVRGDVI DIFPADAEDE AVRVELFDDE IETISLFDPL TGEVRDKVPR
MTIYPKSHYV TPRETILAAV EDIKVELADR LAWMRQHDKL VEAQRLEQRT LYDIEMMLEL
GYCNGIENYS RYLSGREPGE APPTFFDYLP DDTLLFIDES HVSVPQVGGM YKGDRSRKET
LVEYGFRLPS ALDNRPMKFE EWERICPQTA FVSATPGPYE AEHAGQVVEQ VVRPTGLVDP
EIEVRPAQTQ VDDLLSEIKL RTDVGERVLV TTLTKRMAED LTEYLDEHDI RVRYLHSDID
TVERIEIIRD LRLGKFDVLV GINLLREGLD IPEVSLVAIL DADKEGFLRA ERSLIQTIGR
AARNAHGKAI LYGDRVTNSM QRAIDETERR RNKQIAFNEE HGITPTTVTK SVADIMEGAQ
TPGKKVGRKR PDKRVAEAPG DYSSEQLNQM DAAGLTREIG KLEDAMHEAA QNLEFEEAAR
LRDQLQSLKA RLIELG