Gene Csal_1706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1706 
Symbol 
ID4028544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1938048 
End bp1939607 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content69% 
IMG OID637966894 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_573757 
Protein GI92113829 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.615626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCCC TCAACACCAC GGCCGTCGAT CGCCTCACCA AGCTGCTGGC CACGCTGGGC 
ATGCAGGCCG GCATGAATGT TGATACCCCC CTGAGCAACT GGGTCGGGGG GGAGCTCGCA
CCCGGCCAGG GCGAACGCAT CGAACTGCTC GACCCCGTGA CCGGATTGGC GCTGATCGAG
TATCGCGATG CCGGCGCCGA GCGGGTGGCC AGCGCCGTGG AAGCGGCCAC TCTCGCCCAG
CAGGAATGGA TGTCGCTGAC GGCCAGCGAG CGTGGCCGCC GCATGACCAC GGCGGCCTGG
GCGCTACGCG GCCACGAAGA GACCCTCGCC CAGTTGGAAA GCGTGGTCGC GGGCAAGCCC
ATTCGCGACT GCCGGGGCGA GGTAAACAAG GTCCGCGAGA TGTTCGAATA CTATGCCGGC
TGGTGTGACA AGCAGCACGG CGACGTGATT CCGGTTCCCA CCTCACACCT CAACTATGTG
CGTCATGTGC CCTATGGCGT GGTGGGTCAG ATCACCCCGT GGAACGCCCC CATGTTCACC
TGCGCCTGGC AACTGGCCCC GGCGATCGCC GCCGGCAACG GGGTCGTGCT CAAGCCCTCG
GAAATGACGC CCTTCTCGTC GGTGGCCATC GCCATGCTGC TGGAGCGCAG CGGTTTGCCC
AAGGGGCTGA TCAACATCGT CAACGGCGTC GGCCCCACCA CGGGCGCGGC ACTCACCGGG
CATGACGGCA TCAGCAAGCT GGTGTTCGTC GGTTCCCCCG AAAGCGGCCG CCGCATCGCC
CAGGCCGGCG CCGAGCGTCT GGTGCCCAGC GTGCTGGAGC TGGGCGGCAA GTCGGCCAAC
ATCGTGTTCG ACGACGCCCG GCTCGACGAT GCCGTGGCCG GCGCGCAGGC CGCGATCTTC
GCCGCCGCCG GGCAGAGTTG TGTCGCGGGA TCGCGCCTGC TGGTGCAGCG CGAGGTGTTC
GAGGTGGTCT GCGAGCGGCT GGCACGCGCC GCGAGCGAGA TCCGCGTGGG GCTGCCCAGC
GACGAGGCGA CCCAGATGGG TCCGATACAG AACGCCAAGC AGTACCGGCA CATCACCGCG
ATGATCGACA CGGCCCGCCA GCACGGCGCG CGTCTCTTGT GTGGCGGCCA GCGCCCGGCG
GATTTGCCGG CGGACGCCGA GGGCTATTTC CTGGCGCCCA CGGTGCTGGC GGATGTCACC
GAAGAGATGG CGATCGCCCG GGAGGAAGTG TTCGGCCCGG TGGTGGTGGT CATGCCGTTC
GACAGCGAAG AAGACGCCGT GCGCCTGGCC AATGCCACGC GCTTCGGCCT GGCCGGCGCC
GTCTGGACCC AGGACCCGGC ACGCGCCCAT CGCGTCGCCG CGCGCTTGCG GGCGGGCACG
GTATGGATCA ACAGCTACAA GGCGATCAAT GTCATGTCGC CGTTCGGCGG CTTCGGCGAC
AGCGGCTTCG GGCGCTCCAG CGGCCTGGAA GGACTCAAGG AGTACACCGT TGCCCAGAGC
GTCTGGGTGG AAACCGCCCC CACCGCCAGC GTCGCCATGG GCTATGGCAG TGGGGCATAG
 
Protein sequence
MNALNTTAVD RLTKLLATLG MQAGMNVDTP LSNWVGGELA PGQGERIELL DPVTGLALIE 
YRDAGAERVA SAVEAATLAQ QEWMSLTASE RGRRMTTAAW ALRGHEETLA QLESVVAGKP
IRDCRGEVNK VREMFEYYAG WCDKQHGDVI PVPTSHLNYV RHVPYGVVGQ ITPWNAPMFT
CAWQLAPAIA AGNGVVLKPS EMTPFSSVAI AMLLERSGLP KGLINIVNGV GPTTGAALTG
HDGISKLVFV GSPESGRRIA QAGAERLVPS VLELGGKSAN IVFDDARLDD AVAGAQAAIF
AAAGQSCVAG SRLLVQREVF EVVCERLARA ASEIRVGLPS DEATQMGPIQ NAKQYRHITA
MIDTARQHGA RLLCGGQRPA DLPADAEGYF LAPTVLADVT EEMAIAREEV FGPVVVVMPF
DSEEDAVRLA NATRFGLAGA VWTQDPARAH RVAARLRAGT VWINSYKAIN VMSPFGGFGD
SGFGRSSGLE GLKEYTVAQS VWVETAPTAS VAMGYGSGA