Gene Csal_1515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1515 
Symbol 
ID4029211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1723499 
End bp1724983 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content64% 
IMG OID637966698 
Productbetaine aldehyde dehydrogenase 
Protein accessionYP_573567 
Protein GI92113639 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01804] glycine betaine aldehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCACTT TCGAGACACA GAAGCTCTAT ATCGGTGGAC GACTCGTCGA CGCGACGTCG 
GGCGAGACCT TCGACACGAT CAACCCGGTG GATGGCAGCG TGCTGGCCAG CGTCCAGCAG
GCCGCACAGG CCGATGTCGA CCGCGCCGTC ACCTCGGCCC GCGAGGGGCA GCGCGTCTGG
GCGGCCATGA GCGGCATGGA GCGTAGCCGT ATCCTGCATC GCGCCGTCGC CCTGCTCCGC
GAGCGCAACG ACGAACTGGC GCGCCTGGAA ACGCTGGATA CCGGCAAGCC GATCAGCGAA
ACGCAAGCCG TGGACATCGT CACCGGTACC GACTCGCTGG AATACTATGC CAATCTGGCG
CCCTCCATCG AAGGCACCCA GGTCCCGCTG CGCGAAGATT CGTTCTTCTA CACGCGCCGC
GAACCGCTGG GCGTCATCGG TGCCATCGGG GCCTGGAACT ATCCCATCCA GATCGCCTGC
TGGAAGTCCG CGCCGGCGCT GGCGGCGGGC AACGCCGTGG TGTTCAAGCC CAGCGAGGTC
ACCCCGCTGA CCACCATGAA GCTGGCAGAG ATCCTGACCG AAGCCGGCCT GCCGGATGGC
GTGTTCAATG TCGTGCAGGG CGACGGACGC GTCGGTCAGA TGCTCACCAA CCATGCCGAC
ATCGACAAGA TCACCTTCAC CGGTGAAGTC GGGACCGGCA AGAAGGTCAT GGCCGCCGCC
GCGGGATCGA CGCTCAAGGA AGTCACCATG GAGCTGGGCG GCAAGTCGCC GTTGATCGTC
TTCGAGGATG CCGACCTGGA ACGTGCCGCC GACGCCGCGA TGATGGCCAA CTTCTACTCC
AGCGGCCAGG TCTGCACCAA CGGCACTCGC GTCTTCGTGC AGCGCTCGGT GCAGGCGGAC
TTCGAGGCCA AGATCAAGGA GCGTGTCGAG CGCATCAAGG CCGGCGATCC GCTGGATCCG
GCGGTCAACT TCGGTCCGCT GGTCAGCTTC GAGCATCTCG AGAAGGTCCA GAGCTATATC
GACCTGGGCA GCCAGGAAGG CGCTCGACTG CTGGTGGGCG GCGGTCGCTG GAACCAGGGC
AATGCCGCGG GCATTGATTG GTCCAAGGGT GCGTGGGCCG CGCCGACGGT CTTTACCGAT
TGCCGCGACG ACATGCGCAT CGTGCGCGAG GAAATCTTCG GACCGGTGAT GTCGATCCTG
ACCTTCGACG ATGAAGAAGA AGTGATCCGG CGCTCCAACG ACACTTCCTA CGGCCTTGCC
GCCGGCCTGT TCAGCGAAAG CCTGAACCGC GCGCATCGCG TCATTCATCG TCTGCAGGCC
GGCATCTGCT GGATCAACAC CTGGGGCGAC TCGCCGGCGG AAATGCCGGT GGGCGGCTAC
AAGGAGTCGG GCATCGGCCG TGAAAACGGT CTCTCGTCGC TCGATCAGTA CACGCAGATC
AAATCGGTAC AGATCGAAAT GGGGCCCTTC CCCGCCGTGT TCTGA
 
Protein sequence
MATFETQKLY IGGRLVDATS GETFDTINPV DGSVLASVQQ AAQADVDRAV TSAREGQRVW 
AAMSGMERSR ILHRAVALLR ERNDELARLE TLDTGKPISE TQAVDIVTGT DSLEYYANLA
PSIEGTQVPL REDSFFYTRR EPLGVIGAIG AWNYPIQIAC WKSAPALAAG NAVVFKPSEV
TPLTTMKLAE ILTEAGLPDG VFNVVQGDGR VGQMLTNHAD IDKITFTGEV GTGKKVMAAA
AGSTLKEVTM ELGGKSPLIV FEDADLERAA DAAMMANFYS SGQVCTNGTR VFVQRSVQAD
FEAKIKERVE RIKAGDPLDP AVNFGPLVSF EHLEKVQSYI DLGSQEGARL LVGGGRWNQG
NAAGIDWSKG AWAAPTVFTD CRDDMRIVRE EIFGPVMSIL TFDDEEEVIR RSNDTSYGLA
AGLFSESLNR AHRVIHRLQA GICWINTWGD SPAEMPVGGY KESGIGRENG LSSLDQYTQI
KSVQIEMGPF PAVF