Gene Csal_0085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0085 
Symbol 
ID4026007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp104738 
End bp106663 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content64% 
IMG OID637965236 
Productputative transcriptional regulator 
Protein accessionYP_572148 
Protein GI92112220 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACGG CCGCCGCCCT GCTGGAGCAG TTGCGTTTGC TGGATGAGTC CGAGCGCGTT 
GAAGCCAAGC GGGCTCAGAA GTTCGGCAAG TCGATGCTGG AAACCATCTG CGCTTTCGCT
AACGAGCCGG GGCTGGACGG TGGTCATCTA TTGCTGGGCG TGGTCGAAGA TCAGGGCCAT
TACCGGGTCG AAGGCGTGCC CGACCCCGAT ACCCTGCTGA ATGACCTGCA TTCTGCCTGT
GCCTCGACGT TCAATGTGCC GCTACGGGTC CAGGCGGCTA CCGAGCTGGT GGAAGGCGAG
CGGCTGGTGG TCATCCAGGT GCCCGAGGCC CGGCCGGCCG ACAAGCCGGT GTATTTCGTC
AAGTCGGCCC TACCTCGGGC CGCCTGGCGG CGCGGCCCCA ACGGCGACTA CCGCTGCAAC
GAGCATGATC TCGAGGTGCT TTACCAACAG CGTTCCCAGG TTGAGTTCGA TCGCTCGGTG
CCACACGGCG CCACCCGCGA CGATATCGAC CCGGATGCCC TCGACGACTA TCGCCGCGAC
CGCCGGGCCA TGAACGCTCA GGCCGAGGAA CTCGGCTATA GCGATGACGA GCTGCTCGAA
GCTTTGGGGG CCGCCATCTG GCAGCACGGC GAGCTGAAGC CGACCCTGGC CGGCATCCTG
CTCTTCGGGC GGCGCATGGC CATTCGTCGG CTGGTGCCTG CCCATCGGGT CGACTACATC
CGCGTGACCG GCAAGGAGTG GATCGAGGAC CCCGACGAGC GCTTCACCAC CATCGACATG
CGCGATACCC TGCCGCGGCT GATCAACCGC GCTGTGGCGG CGGTGTTGGA TGACCTGCCC
ATGGCTTTTC ATCTGCCACA GGGCAGCCAG CAACGCGCGG ACCGACCGCT GATCCCCGCC
AAGGTCGTGC GCGAGGCCAT CGTCAACGCC CTGATGCACC GCAACTATCG TGCCCACCAA
CCGTTGCAGA TCATCCGCTT CAGCAACCGC ATCGAGGTGC GCAATCCCGG CTATTCGCTC
AAGCCCGAGG AACAGCTAGG CCTGCCTGGC AGCGCCTGGC GCAACCCGAC CCTGGCGACG
GTGCTGCACG AAGTCGGCTA TGCCGAAACC AAGGGCAGCG GCATCCGCGT CATGCGCCGC
CAGATGGAAC AGGCGGGGCT GACGCCGCCG GTCTTCGAGT CGGTGCGTCA TGAGGATCGC
TTCATGGCGA CCCTGCTGTT CGTGCATTTC CTCGATGATG AGGCGGTGGA ATGGCTCAAG
CATTTCCGCC ACTGGCAACT CTCCGATGAG GAGTGCCGCG CTCTGCTGTT CGTGCGCGAG
ACGGGGCGCA TCACCAATGC CGACTATCGC GACCAGAATC GCGTCGACAC CCTGGCCGCC
AGTCAGCAGT TGAGCCGACT GCGCGACCTC GGGCTACTCC ACCAGGTTCC CAAGGGCGCC
GAGACCTATT ATCTGCCCGG TGAACATTTT CCCATGCAGG GCAGCCCGGC CATGGAACTG
CTCTCTGTAT TGGACCGCGA CCTGGCTGCC GAACAGGAGA ACTTACCTCA GGAGTCAGGC
GGCTTACCTC AGGAGTCAGG CGGCTTACCT CAGGAGTCAG GCGGCTTACC TCAGGAGCCG
GAAGGCCAGT CACGGGAGTC GCTTATCGCC GAGCTTCCCG GCTGGCTGGC GCTACGGCTC
GAGGCCATCG GCCAGCGTTC CCGAGACAAG CGTCGGGTAC GAGAGCTGCT GCGGGCGCTG
TGTGCCGAGC GCCCCTATCG GGCGGCCGAA CTCTCCCGGC TGTTGAAGCG CAACCAGGAA
TACCTTCAAA AAGAATATAT TACGCCGATG CGCCAGGCAG GAGAACTGGC GTATCAGTAT
CAGGACGACC CCAACCGTCC TGATCAAGCC TACGTATCGC CTACCGCCAA TAGAGAAGCC
GAATGA
 
Protein sequence
MTTAAALLEQ LRLLDESERV EAKRAQKFGK SMLETICAFA NEPGLDGGHL LLGVVEDQGH 
YRVEGVPDPD TLLNDLHSAC ASTFNVPLRV QAATELVEGE RLVVIQVPEA RPADKPVYFV
KSALPRAAWR RGPNGDYRCN EHDLEVLYQQ RSQVEFDRSV PHGATRDDID PDALDDYRRD
RRAMNAQAEE LGYSDDELLE ALGAAIWQHG ELKPTLAGIL LFGRRMAIRR LVPAHRVDYI
RVTGKEWIED PDERFTTIDM RDTLPRLINR AVAAVLDDLP MAFHLPQGSQ QRADRPLIPA
KVVREAIVNA LMHRNYRAHQ PLQIIRFSNR IEVRNPGYSL KPEEQLGLPG SAWRNPTLAT
VLHEVGYAET KGSGIRVMRR QMEQAGLTPP VFESVRHEDR FMATLLFVHF LDDEAVEWLK
HFRHWQLSDE ECRALLFVRE TGRITNADYR DQNRVDTLAA SQQLSRLRDL GLLHQVPKGA
ETYYLPGEHF PMQGSPAMEL LSVLDRDLAA EQENLPQESG GLPQESGGLP QESGGLPQEP
EGQSRESLIA ELPGWLALRL EAIGQRSRDK RRVRELLRAL CAERPYRAAE LSRLLKRNQE
YLQKEYITPM RQAGELAYQY QDDPNRPDQA YVSPTANREA E