Gene Csal_3237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_3237 
Symbol 
ID4028571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp3606708 
End bp3607676 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content63% 
IMG OID637968452 
ProductAraC family transcriptional regulator 
Protein accessionYP_575280 
Protein GI92115352 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTGTCG TGCAGCCTGT TTTCGAGACA CCGGATGATG AAATCGTACA ATTGGCGGAT 
ACTCTTTCTC GCCGGATTCA GAAATGGGCG CCGGAAGAAG GGCTGACACC GACGGCGGTG
CCGGGGTTGG AACTGGTGCG CGCCAATTCC TCGTTGACCA ACTGTATCGG CTCGACGGTC
TACGACCCTT CGCTGTGTCT GATCGCGCAG GGCAGCAAGC GCATATGGCT GGGAGACCGG
GAAATCGATT ATGGGCCGCT GAGCTGCATG GTGTCGGCGG TGCACCTGCC GGTACTGGGC
AAGATCACCG AGGCTTCGGC GGAGCGGCCC TACCTGGGGT TGAAGCTCGC CGTCGATGCC
CAGGAAGTCA CCGATCTGGT ACTGGAGCTG GGTGAGGGGC TGAGCGAGAT GGAGGAGCGG
GGATGCGCCG AGACCGCTTG CGGCCTCGGG CGCGTGCAGG CCGAGAAGGG GCTGGTGGAA
GCCATGCTGC GATTGGTGAG CCTGCTCGAT TCCCCCCAGG ACATCCGCAT TCTCGCACCG
CTGGTGCGGC GCGAGATCTT CTATCGTGCG CTGGTCGGCG AGATCGGCCT GCACATGCGC
AAGTTCGCGG TGGCCGATAC CCAGACCCAT CGCATCTCGA AGGTGATCGC GGTACTCAAG
GATCGCTTCA CCGAGCCGCT GCGCGTTCGC GAGCTGGCGG ACATGGTGAA CATGAGCGAG
TCGTCACTGT TTCACAGCTT CAAGCAGGTC ACCCGGATGT CGCCGGTGCA GTTCCAGAAA
AAGCTGCGAC TGCATGAAGC GCGCAGGCTG ATGCTGGCTG AAGGGATGGA GGCGGCCACC
GCCAGTTTCC GCGTGGGGTA CGGAAGTCCA TCCCATTTCA GCCGCGAGTA CAGCCGCCTT
TTCGGCGTGC CGCCCCGCAC GGACGTGAGC AAGCTACGCG GCGAGCTGCC GCAGACCGCC
CGCGCCTGA
 
Protein sequence
MSVVQPVFET PDDEIVQLAD TLSRRIQKWA PEEGLTPTAV PGLELVRANS SLTNCIGSTV 
YDPSLCLIAQ GSKRIWLGDR EIDYGPLSCM VSAVHLPVLG KITEASAERP YLGLKLAVDA
QEVTDLVLEL GEGLSEMEER GCAETACGLG RVQAEKGLVE AMLRLVSLLD SPQDIRILAP
LVRREIFYRA LVGEIGLHMR KFAVADTQTH RISKVIAVLK DRFTEPLRVR ELADMVNMSE
SSLFHSFKQV TRMSPVQFQK KLRLHEARRL MLAEGMEAAT ASFRVGYGSP SHFSREYSRL
FGVPPRTDVS KLRGELPQTA RA