Gene Csal_0541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0541 
Symbol 
ID4027680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp599676 
End bp600701 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content69% 
IMG OID637965709 
Producttranscriptional regulator 
Protein accessionYP_572602 
Protein GI92112674 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.514345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTCG CCATGAGTGA GCTCTCGACC ACCGACTGGC CCTATCCACG CCCCGAGCAA 
CCACCGGTCC CTCGGCCCGG AGAAGCGGCG CATACCGTGG ATGTCTGGGT CGTGCCCAAT
TTCTCGATGC TGACATTGTT CTGCCTGCTC GAACCGCTGC GCGTGGCCAA TCGGTTCGGG
CGCACGCTCT TCGCATGGCG GTTGTTGTCG GCGGACGGCG AGGCGGTGGT GGCCAGCAAC
GGGGTGCGCA TCGAGGTGGA CGGCCCCCTG GATGCGCATT GCCACGGCGA GCTGCTGATG
GTGATCTCGT CCTATCAGCC GGAAGTCAGC GTGACCAGCG CCGAGCGCGC GGTGCTGCGC
CACGTCGCGG CCCATGGCGG GCGTGTCGGC GGCCTCGATA CGGCGCCTTT CATCCTGGCG
CGGGCAGGGC TGCTCGATCA TCATCGCGTG GCACTGCACT GGGAGAGCGT GCCGGCGTTC
CGGGCGGAGT TTCCCCACAT CGCCGTCAGC GAGGCGCGCT TCGAGTTCGC CGGACAGCGC
CTGACCGGCT CGGGCGGGGC GGCAGGCATC GACATGATGC TGCAATGGAT CGAGCACGAT
TACGGGCCGG CGCTGGCCAA TGCCGTGGGA CGTCAGCTGG TGCACCAGCG CGTCTCCGAA
GAAGAGGCCT GGAAGGCCGG CGCCGCGCGT ACCCTGGAGG ACTTGCCGCG TAGCGTGGTG
CGGGCGCTGG CGGTGATGGA GGCCCACCTC GACCAGGTAC TGCCGATGCC CGAGATCTGC
CGCCTGGCGG GACAGTCGCA GCGTCAGCTG ACGCGCCTCT TCGTGGCACA TTTCGGAGAA
ACCCCCAAGC AGTGCTACCT GGGCATGCGC CTGGACTATG CGCGGCGCCT GCTCGCCGAT
AGCCGCTGCC GTGTCACCGA GGCCGCGCTG AGCAGCGGCT TCGTGCATCT GGCCCACTTC
TCGCGCGCCT ACCAGGCCCG CTTCGGCGAG CGCCCGAGCG TGACGGCCGA GCGCGGTACC
GTGTGA
 
Protein sequence
MIFAMSELST TDWPYPRPEQ PPVPRPGEAA HTVDVWVVPN FSMLTLFCLL EPLRVANRFG 
RTLFAWRLLS ADGEAVVASN GVRIEVDGPL DAHCHGELLM VISSYQPEVS VTSAERAVLR
HVAAHGGRVG GLDTAPFILA RAGLLDHHRV ALHWESVPAF RAEFPHIAVS EARFEFAGQR
LTGSGGAAGI DMMLQWIEHD YGPALANAVG RQLVHQRVSE EEAWKAGAAR TLEDLPRSVV
RALAVMEAHL DQVLPMPEIC RLAGQSQRQL TRLFVAHFGE TPKQCYLGMR LDYARRLLAD
SRCRVTEAAL SSGFVHLAHF SRAYQARFGE RPSVTAERGT V