Gene Csal_0995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0995 
Symbol 
ID4026218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1118221 
End bp1119414 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content64% 
IMG OID637966172 
Productalcohol dehydrogenase GroES-like protein 
Protein accessionYP_573051 
Protein GI92113123 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR02819] formaldehyde dehydrogenase, glutathione-independent 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.171586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACAG CCAACCGTGG CGTCATCTAC ATGGGCGCCG GCCATGTCGA GGTTCAGCCG 
ATTGCCTTTC CCGAGCTGGC GCTGGGGGAT CGCAAGTGCC AGCACGGCGT GATACTCAAG
GTCGTCACCA CCAACATATG CGGCAGTGAC CAGCACATGG TGCGCGGGCG TACCACCGCG
CCCGCCGGCA TGGTGCTGGG ACACGAGATC ACCGGCGAGG TCATCGAGTG CGGTCGCGAC
GTCGAGTTCG TCAGTGTCGG CGATATCGTT TCGGTTCCTT TCAACATCGC CTGTGGGCGT
TGCCGCAACT GCAAGGAAGG GCAGACGGGC ATCTGCCTCA ACGTCAATCC GGCTCGCCCG
GGTGCCGCTT ACGGATATGT CGACATGGGC GGCTGGGTCG GCGGCCAGTC CGAATACGTC
ATGGTGCCCT ATGCGGACTT CAACCTCTTG AAGTTCCCCG ATCGCGACAT CGCCATGGAC
AAGATCCGCG ACTTGACGTT GCTCTCCGAT ATCTTCCCGA CCGGTTTTCA TGGCTGCGCC
ACCGCCGGTG TCGGCCCCGG CAGCACCGTT TACATCGCCG GTGCCGGACC GGTCGGGCTG
GCTGCGGCGG TCTCCGCCCA GCTGCTGGGA GCCGCCTGCG TCATCGTCGG CGACATGAAC
CCGCAGCGTC TCGATCAGGC CAAGAGCTTC GGTTGCGAGA CCCTGGATCT GCGCCAGGAC
ACGCCCATGC CCGACATGCT GGAGCCGATC CTCGGTGAAC GCGAAGTGGA CTCGGCGGTG
GATGCGGTGG GCTTCGAGGC GCGCTGCCAT GGCCACAATC ATGCCCATGA GCAGCCGGCC
ACGGTGCTCA ATGCCTGCAT GGACGTCACC CGGGCCGGTG GCCAGATCGG GATTCCGGGA
CTTTACGTCA CCGAAGATCC CGGCGCGGAC AGTGAGGAAG CGCGGCACGG CAACCTCTCG
ATGCGCCTTG GGCAGGGCTG GGCCAAGTCC CATTCCTTCC ACACCGGTCA GTGCCCGGTA
ATGAAATACC ACCGGCAACT GATGCAGGCG ATCCTGTTCG ACAAGGTGAA CCTCGCCGAT
GCGGTCAACG TCGAGATCAT CTCGTTGGAC GATGCGCCGC GTGGGTATGC GGATTTCGAT
GGCGGCGTGG CCAAGAAATT CGTCATCGAT CCGCACGGCA GCGTGGCCGC CTGA
 
Protein sequence
MSTANRGVIY MGAGHVEVQP IAFPELALGD RKCQHGVILK VVTTNICGSD QHMVRGRTTA 
PAGMVLGHEI TGEVIECGRD VEFVSVGDIV SVPFNIACGR CRNCKEGQTG ICLNVNPARP
GAAYGYVDMG GWVGGQSEYV MVPYADFNLL KFPDRDIAMD KIRDLTLLSD IFPTGFHGCA
TAGVGPGSTV YIAGAGPVGL AAAVSAQLLG AACVIVGDMN PQRLDQAKSF GCETLDLRQD
TPMPDMLEPI LGEREVDSAV DAVGFEARCH GHNHAHEQPA TVLNACMDVT RAGGQIGIPG
LYVTEDPGAD SEEARHGNLS MRLGQGWAKS HSFHTGQCPV MKYHRQLMQA ILFDKVNLAD
AVNVEIISLD DAPRGYADFD GGVAKKFVID PHGSVAA