Gene Csal_2540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2540 
Symbol 
ID4026121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2847628 
End bp2848683 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content63% 
IMG OID637967747 
ProductAraC family transcriptional regulator 
Protein accessionYP_574586 
Protein GI92114658 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGCGG TAGTGGCGCA TCGCCTCATG CCTGAGTCGG GCAAGCACGA GATCGAAGCC 
AAGGAATTGC GACGGCTATC GGCATCGCTG GTCACGCGAC AGGCCCTCGA TGCACCGAAC
TGGGACCGAC GCTCGGCCCT GCTCGACGGA CGCATGCAAC TCGCCGAGCT CCAGCCAGGC
ATGCAGCTGC GCTTGGCCGA TGTCAGTGAT CGTTACGATC TGGTCACCCG TGCCTTACTA
CCGGCGGGGG TCAAGATCGC CCTGGTCGTC GCCGGCGAGG CTCGCGTGAG CTATGGCGAT
CAGGCAGTGT CGCTAGGGCT GTCGGCGCCG AGCACCGGAC TGCTGGCCAG GCTTCCCGAA
GCCACTCGCT TCGCTCGACG GGGACGTATC GGCGGCCACG AGCGGACCCT GACCCTAACC
CTCACACCGG ACTGGCTGTT ACGACACGGC TACTCGATTA CCTCGAGTCA CACAGCGCAG
CTGGTCCGCT GGTCGCCATC GCCGGGACTG CTGAGACTCG CCGAACGGTT GTTCGACGAG
CGCTTCCTGT ACAGCCGGGA TGATGCCCAT CGCCTGCAAC TGAGCGGCTG TGCTATGGCG
ATGGCCGGTG AAGCATTGGC CGCTCTCGGA CACGATCGGG AGGGGCAGAA ACATGACGAG
GAAAGAGAAC ACTACCCGAC AGACCGTCGA CTGCAACGCT TGATGACGCT AGTCGAGAGC
GGCGAAGCCC ATCGTCTGGG GCAGGAAGAA CTGGCCCGGC GTCTGGGTAT GAGCCTGAGC
AGTCTGCAAC GACGATTCCT CGCCTGTTAC GGCAAACCAC TGGGACGCTT CCTTCGACGT
CGTCGCCTCG AAACAGCCTT GGCGGCACTG CGCAATGAAG CCATCAGTGT CGAAGCTGCA
GCCATTCTGG CCGGCTATAC CAACGCCGCC AACTTCGCCA CGGCCTTCAA GCGCGAATTT
GGAGCACGGC CCGGCGACCT GCGTCGAGGA CCTCAGACAA GCCAAGAAGA AACTCGCCTG
CTCAAGGTGG CATCAGGCGG TAGCGGGCGG AGTTGA
 
Protein sequence
MTAVVAHRLM PESGKHEIEA KELRRLSASL VTRQALDAPN WDRRSALLDG RMQLAELQPG 
MQLRLADVSD RYDLVTRALL PAGVKIALVV AGEARVSYGD QAVSLGLSAP STGLLARLPE
ATRFARRGRI GGHERTLTLT LTPDWLLRHG YSITSSHTAQ LVRWSPSPGL LRLAERLFDE
RFLYSRDDAH RLQLSGCAMA MAGEALAALG HDREGQKHDE EREHYPTDRR LQRLMTLVES
GEAHRLGQEE LARRLGMSLS SLQRRFLACY GKPLGRFLRR RRLETALAAL RNEAISVEAA
AILAGYTNAA NFATAFKREF GARPGDLRRG PQTSQEETRL LKVASGGSGR S