Gene Csal_0157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0157 
Symbol 
ID4027298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp178356 
End bp179816 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content63% 
IMG OID637965308 
ProductGntR family transcriptional regulator 
Protein accessionYP_572220 
Protein GI92112292 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGACC ATTACTTTCA TCTCGATTTC GATGCTCAGC GAGGATTGCA GGAGCAACTG 
CGCGAAGCGC TGCTCGATGC CATTCACACC GGCACCATCC CGGCGGATGA GGCATTGCCG
TCCTGTCGCC GCTTGTCCCT GCAACTGGGG ATCTCGCGCA ATACCGTGGC GCTGGTGTAC
GAGGGGCTGG TCGAGGATGG TTATCTGGTC AGTCGTCCAC GCAGCGGGTA TTACCTGCAT
GACGCCTACC ACGACAGCGA TGCCCTGGAA CTGGCGCAGA CCACGCGCCA GCCGGTGCGC
GACGCACCAT CATGGAGCAA GCGCTTCACG CATCGCCCCA GTCACTTGCA GGGCGTGTTG
CGTCCCAGCA ACTGGATGGA TTACGAATTT CCCTTCGTAT ATGGGCAACT GGATACGCGG
TTGTTTCCCC TCGAACAATG GCGCGAGACC TCGCGACGCC TGCTGGGCGG GAATCGCGAG
CGTCACTGGC TCAGCGATCG CTACGATCAG GACGACCCTA TGCTGATCGA GCAATTGCGC
ACGCGGGTGC TGCCCAAGCG CGGCATTCAT GCGCAGAGCG ATGAGATTTT GGTGACCCTG
GGATCGCAGA ATGCGCTGTT TCTGATCGCG CAGCTGCTGT TCGACCGGAA GACGCGAGTC
GCCGTCGAGA ACCCCGGGTA CCGCGAGGCG GTCAATGTCT TCCTGCGCCA GGGCGCCCAG
CTTCACTACC AGCACGTGGA CGGCGAGGGC ATGCAGCTCG ATGCCGAGAC TCCACGCTGT
GACTATCTCT ACGTGACACC TAGCCATCAG GGACCGACCG GCGCCACCAT GAGCCGCGAG
CGTCGCGAGC AGTTGATTAA ACAGATCCAG CGCTTCGATC AAATCGTGCT CGAGGACGAC
TATGACGCAG AGGTGAACTT CGACCGTCAT GCGCAACCGG CGCTGAAAGC CAGCCGGGCC
GGCGGGCGGG TGATCTACAT GAGTAGCCTG TCCAAGCCGC TCTCACCGGG GCTACGCTTG
GGCTATCTCG TGGCCGACGC CGAGTTGATC GACGAGCTGC GCGCGCTGCG GCGGTTGATG
TATCGCCACC CGCCGTCGAG CTTGCAACAA CAGCTGGCGC AGTTTCTCGC CCAGGGACAT
TACGACCGCT ATCTGCGGCA GTTCGCCGAC GAGATGCGCC ATCGCTGGGA GTGCATGAAC
GATGCCATTG CCAGGCACCT GCCCGGTTGC CGGCGCGTCG GCGGCGATCA TGCCAGCGTG
TTTTGGCTGG AAGCCCCGGA GAATGTGGAC ACCCAGCGCT TGGCCTGGCA GGCCGCGCAG
AACGGGGTAC TTATCGAGCC GGGCTTTCAG CATTTCTTCG AGCGCGTGCC GCCGCGCAAC
TTCATGCGCC TGGGCTTCGG CGCCATCGAG GGAGAGCGCA TCGCACCGGG CATCGAGCGC
CTGGCACAAA CGCTGGGGTA G
 
Protein sequence
MLDHYFHLDF DAQRGLQEQL REALLDAIHT GTIPADEALP SCRRLSLQLG ISRNTVALVY 
EGLVEDGYLV SRPRSGYYLH DAYHDSDALE LAQTTRQPVR DAPSWSKRFT HRPSHLQGVL
RPSNWMDYEF PFVYGQLDTR LFPLEQWRET SRRLLGGNRE RHWLSDRYDQ DDPMLIEQLR
TRVLPKRGIH AQSDEILVTL GSQNALFLIA QLLFDRKTRV AVENPGYREA VNVFLRQGAQ
LHYQHVDGEG MQLDAETPRC DYLYVTPSHQ GPTGATMSRE RREQLIKQIQ RFDQIVLEDD
YDAEVNFDRH AQPALKASRA GGRVIYMSSL SKPLSPGLRL GYLVADAELI DELRALRRLM
YRHPPSSLQQ QLAQFLAQGH YDRYLRQFAD EMRHRWECMN DAIARHLPGC RRVGGDHASV
FWLEAPENVD TQRLAWQAAQ NGVLIEPGFQ HFFERVPPRN FMRLGFGAIE GERIAPGIER
LAQTLG