Gene Csal_0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0804 
Symbol 
ID4026177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp901060 
End bp902106 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content65% 
IMG OID637965970 
Productaldo/keto reductase 
Protein accessionYP_572860 
Protein GI92112932 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.825748 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTATC AACCATTCGG CAATACCGGT TTGTTCGTCT CCGAACTGTG CCTCGGCACC 
ATGACGTTCG GTGGCCAGGG CGAACTCTGG AGCCAGATCG GCGACTTGCA GCAAGGCGAT
GCCGAGCGCT TGATCGGCCG TGCGCTGGAT GCCGGCATCA ACTTCATCGA TACCGCCGAT
GTGTATTCCG AGGGCCAGTC CGAAATCATC ACCGGGCAGG CGCTCAAGAA CCTGAACGTG
CCACGCGAGG AGATCGTCGT CGCGTCCAAG GTGTTCGGCG AAACCGGGCG CGGCGGCGTG
AACGAGCGTG GCATGACGCG CTACCACATC ATGGAGGGCA TCAAGGCCAG CTTGAAGCGC
CTGCAGCTCG ATCACCTCGA CGTCTACCAG ATTCATGGCT TCGACCCCGC CACGCCCATC
GAGGAGGCGG TCCGCGCGCT CGATACGCTC GTGCAACACG GCCACGTACG TTACGTCGGG
GTGTCCAACT GGGCCGCCTG GCAGATCATG AAGGCACTCG GCATCGCCGA ACGCCTGGGC
CTGGCACGCT TCGAGTCGCT GCAGGCGTAT TACACGCTTG CCGGGCGCGA TCTCGAACGC
GAGCTGATCC CCATGCTCCA GAGCGAGAAC CTGGGGCTGA TGGTCTGGAG CCCACTCGCG
GGCGGGCTGC TCAGCGGCAA GTACACGCGC GACAATCAAG GCGAGGCCGG CAGTCGCCGA
CTCACCTTCG ATTTCCCGCC GGTCGACAAG GCCCGCGCCT TCGACTGCGT CGATGTCATG
CGCCGCATCG CCACCCGGCA CGATGTTTCC GTGGCGCAGA TCGCGCTGGC CTGGCTGCTG
CACCAGCCGG CGGTGACGAG CGTGATCGTG GGCGCCAAGC GCCTCGACCA GCTCGACGAC
AACATCGCCG CCACGCACGT TGCACTGAGC GCCGAGGATC TCGCCGAACT CGACGCGGTA
AGCGCCTTGC CCGCCGAGTA CCCTGGCTGG ATGTTCGAAC GCCAGGGCGC TTACCGTCGC
CAGCAACTGG CCGATGCCAA GCGCTGA
 
Protein sequence
MRYQPFGNTG LFVSELCLGT MTFGGQGELW SQIGDLQQGD AERLIGRALD AGINFIDTAD 
VYSEGQSEII TGQALKNLNV PREEIVVASK VFGETGRGGV NERGMTRYHI MEGIKASLKR
LQLDHLDVYQ IHGFDPATPI EEAVRALDTL VQHGHVRYVG VSNWAAWQIM KALGIAERLG
LARFESLQAY YTLAGRDLER ELIPMLQSEN LGLMVWSPLA GGLLSGKYTR DNQGEAGSRR
LTFDFPPVDK ARAFDCVDVM RRIATRHDVS VAQIALAWLL HQPAVTSVIV GAKRLDQLDD
NIAATHVALS AEDLAELDAV SALPAEYPGW MFERQGAYRR QQLADAKR