Gene Csal_1187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1187 
Symbol 
ID4026998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1361382 
End bp1362911 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content65% 
IMG OID637966364 
Productaldehyde dehydrogenase (acceptor) 
Protein accessionYP_573242 
Protein GI92113314 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCACT CCACCCCTTC AAACCCTGTC ACTCACGCCG ATTGGCAGGC CCTGGCCGAG 
CACCTGACTG TCGAAGCAGG CCTCGAGGCG CGTGCCTATA TCGATGACAC CTTCGTCGAT
GCCGCCGATG GCGCGACCTT CACCACGCTC AATCCGGCCA CCGGCGAGAC GCTCGCCGAA
GTGGCCAGTT GCGATGCGGC CGACGCCGAG ACGGCGGTGT CGGTGGCGCG GCGTGCCTTC
GAGAGCGGCG CGTGGTCGCG TTCGTCACCG GGCGAACGCA AGGCCGTGCT GCTGCGTCTG
GCCGACCTGA TGGAGGCGCA CAAGCACGAG CTGGCGCTGC TCGACAGCCT GGACATGGGG
AAGCCGGTAT CGAGTGCCAT GGGCGACATG GCCGGTGCCA TCGGTTGCAT TCGTCACCAT
GCCGAGTCCA TCGACAAGCT CTATGGTGAA ATCGCGCCCA CCGGCGAGGA AAGCCTGGGG
CTGGTTCTGC GCGAGCCGCT GGGTGTGGTG GCGTCGATCG TGCCCTGGAA CTTCCCGTTG
ATGATGACGG CCTGGAAGAT CGGCCCGGCG CTGGCGGCTG GCAACAGTGT CATCCTCAAG
CCGTCGGAAA AATCACCGCT TTCGGCGTTG CGCCTCGCCC AGCTGACGCG CGAAGCCGGC
TTGCCGCGCG GCGTTTTTCA GGTACTGCCC GGCTTCGGTC ATACCGTGGG CAAGGCCTTG
GCATTGTCCA TGGGGGTCGA CTGTCTGGCC TTTACCGGCT CCACCGGGGT CGGCAAGCAA
TTGATGCAGT ACGCCGGTCA GTCGAATCTC AAGAAGGTCT TTCTGGAGTG TGGCGGCAAG
AGTCCGAATC TCGTGTTCGC CGACTGCAAG GACCTGGATC GCGTGGCCGA ACACGCTGCT
GCCGCGATCT TCCACAATCA GGGCGAGGTG TGCATCGCCG GCTCGCGCCT GCTGGTCGAG
AACAGCATTC GCGAGCGTTT CGTCGGCAAG GTGCTGGCCG CCGCCGAACG CATGCAGCCC
GGCGACCCGT TGGATCCGGC GAGCTTCATG GGCGCGATGG TCGATCAGAC CCAGTATCAA
CGCGTACTCG ACTACATCCG CCAGGGTGTC GAAGAAGGCG CGACGCTACG TGCCGGCGGC
CAGGCGCTGG ATATCGAGGG GGCCAAGGGC CTGTTCATCG GGCCGACGGT ATTCGATGGC
GTCACCGACA CCATGGCCAT CGGTCGGGAG GAAATATTCG GCCCGGTATT GGCGGTGTTC
GGCTTCGACA CCGAGGAGGA GGCCGTACGT CTGGCCAACG ACAGCGACTA CGGCCTGGCG
GCGGGCCTGT GGAGTCAGGA CATCGATCGC ATCATGCGCG TCACCCGTCG GCTGCGCTCG
GGCCAGGTCT TCGTCAACAA CTGGGCCGAT ATGGATCAGA CGGTGCCCTT CGGCGGGGTC
AAGCAGTCCG GCAACGGTCG CGACAAGTCC CACCATTCGC TGGAGGAATA CTCCGATCTC
AAGACCGTCT GGATGACGCT CGCCACCTGA
 
Protein sequence
MTHSTPSNPV THADWQALAE HLTVEAGLEA RAYIDDTFVD AADGATFTTL NPATGETLAE 
VASCDAADAE TAVSVARRAF ESGAWSRSSP GERKAVLLRL ADLMEAHKHE LALLDSLDMG
KPVSSAMGDM AGAIGCIRHH AESIDKLYGE IAPTGEESLG LVLREPLGVV ASIVPWNFPL
MMTAWKIGPA LAAGNSVILK PSEKSPLSAL RLAQLTREAG LPRGVFQVLP GFGHTVGKAL
ALSMGVDCLA FTGSTGVGKQ LMQYAGQSNL KKVFLECGGK SPNLVFADCK DLDRVAEHAA
AAIFHNQGEV CIAGSRLLVE NSIRERFVGK VLAAAERMQP GDPLDPASFM GAMVDQTQYQ
RVLDYIRQGV EEGATLRAGG QALDIEGAKG LFIGPTVFDG VTDTMAIGRE EIFGPVLAVF
GFDTEEEAVR LANDSDYGLA AGLWSQDIDR IMRVTRRLRS GQVFVNNWAD MDQTVPFGGV
KQSGNGRDKS HHSLEEYSDL KTVWMTLAT