Gene Csal_2133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2133 
SymboluvrC 
ID4026470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2400940 
End bp2402754 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content64% 
IMG OID637967335 
Productexcinuclease ABC subunit C 
Protein accessionYP_574183 
Protein GI92114255 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00985622 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTCG ATGCCAAGCA CTTCCTCAAG ACGGTTTCCG AGTCACCCGG CGTGTATCGC 
ATGCTGGATG CCGAAGGCAA CGTGCTGTAC GTCGGCAAGG CCAAACGGCT GAAGGCCCGC
CTCGCCAGCT ACTTCCGTGG TGCGCTGAAT GCCAAGACCC AGGCCCTGGT CAGCCGGATC
GAGGACATCC AGGTCACCAT CACGCGGACC GAGACAGAGG CCCTGCTGCT CGAGCAGACG
CTCATCAAGG AACAGCGTCC GCCGTACAAC ATCCTCCTGC GTGACGATAA ATCGTACCCG
TTCATCTTCG TCTCGGATCG GCACCCCTAC CCCGCCCTCG AATTCAAGCG CGCGCGGCAG
AAACGTGGCG ACGGCCGTTA CCTGGGGCCG TTTCCGAGTA CCACCGCGGT GCGTGAAAGC
CTGTCGTTGA TGCAGAAGAT CTTTCGCATC CGCAACTGCG AGGACAGTGT CTTCGCGCAC
CGCACCCGGC CCTGCCTGCA ATATCAAATT CAGCGCTGCA GCGCTCCCTG TGTCGACTAC
ATCGGTCGCG AGGAATACCA GCGCGATATC GACCACGCCA TCATGTGCCT GGAAGGGCGT
AGCGAGCAGG TCACCGCGCA ACTGACCCGC GACATGGAAA CCGCCAGTCA GGCGCTGGAT
TTCGAGGAAG CCGCGCGCCT GCGCGACCAG ATCCAGCAAC TTCGCCGCCT GCAGGAACGC
CAAATCGTCG ATACTGGGGA CGGCGATGCC GACATCTTCG CTCTGGCCGA GCGCCCCGGC
GGCCTCTGCA TCAGCGCACT GGCCGTGCGT GGCGGCCGCA TGCTCGGCGC TCGCCACCAT
ATGCCGCAGA ACGGCCTGGA CCTCACGGCG GAGGCGCTTC TCGGCGAGTT CATCAGCCAT
TACTATCTGG GACATGAGCG CGAGATTCCC GCGGAAGTCA TCACGGCTCT GCCCCTGGCG
GACAGCGACG TCATTCAAGC CGCCCTCTCC GAGCGTGCCG GCAAGCGGAT TCGCCTGGCC
CATCAGGTTC GCGGCCACCG AGCCCAATGG CTACGCCTGG CGGAAACCAA TGCCGAGCAA
CACCTCACCA CGCAACTGGC CAATCGTTCG CAGCTGGCCA AGCGTTTCGC CTCCCTGCGT
GACGCCATGC AGCTGCAGGA GACACCGACA CGCCTGGAAT GCTTCGACAT CAGCCATAGC
CATGGCGAAG CGACGGTCGC CTCCTGCGTG GTGTTCGACC AGGACGGGCC GGTCAAATCC
GATTACCGAC GTTTCAACAT CGAGGGCGTG GCAGCGGGGG ACGACTACGC GGCCATGCGG
CAAGCCCTGA CTCGACGCTA CAAACGCTTG ACTCAGGACG ACAGCAAGCT TCCCGGCATC
CTGATCGTCG ATGGCGGCAA GGGACAGCTC AACATGGCAC GCGAGGTGCT CGAGGACGTC
GGCATCACCG GAACGCATCT GCTCGGCGTG GCCAAGGGCA CGACACGCAA GCCCGGACTC
GAGACGCTGT TTCTGGAAAC CGTCGACAAC AGTCTGGCAC TGGACAGCGC ATCGCCGGGG
CTGCACCTCA TTCAGCATAT TCGCGACGAA GCCCACCGTT TCGCGATCAC CGGTCATCGT
CAGCAACGCG ACAAGCAACG CCGTACTTCG ACCCTGCAGG ACATTCCCGG GATCGGGCCC
AAGCGTCGCC GGGAGTTGCT GCGCTTCTTC GGTGGGCTCC AGGGCGTGCG CCAGGCCAGT
CGCGACGAGC TGGCGCGTGT GCCCGGCATC AGCGCCCAAA TGGCCACGAC CATCCATCAA
GCCTTGCATG GATGA
 
Protein sequence
MSFDAKHFLK TVSESPGVYR MLDAEGNVLY VGKAKRLKAR LASYFRGALN AKTQALVSRI 
EDIQVTITRT ETEALLLEQT LIKEQRPPYN ILLRDDKSYP FIFVSDRHPY PALEFKRARQ
KRGDGRYLGP FPSTTAVRES LSLMQKIFRI RNCEDSVFAH RTRPCLQYQI QRCSAPCVDY
IGREEYQRDI DHAIMCLEGR SEQVTAQLTR DMETASQALD FEEAARLRDQ IQQLRRLQER
QIVDTGDGDA DIFALAERPG GLCISALAVR GGRMLGARHH MPQNGLDLTA EALLGEFISH
YYLGHEREIP AEVITALPLA DSDVIQAALS ERAGKRIRLA HQVRGHRAQW LRLAETNAEQ
HLTTQLANRS QLAKRFASLR DAMQLQETPT RLECFDISHS HGEATVASCV VFDQDGPVKS
DYRRFNIEGV AAGDDYAAMR QALTRRYKRL TQDDSKLPGI LIVDGGKGQL NMAREVLEDV
GITGTHLLGV AKGTTRKPGL ETLFLETVDN SLALDSASPG LHLIQHIRDE AHRFAITGHR
QQRDKQRRTS TLQDIPGIGP KRRRELLRFF GGLQGVRQAS RDELARVPGI SAQMATTIHQ
ALHG