Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1706 |
Symbol | |
ID | 4028544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 1938048 |
End bp | 1939607 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637966894 |
Product | betaine-aldehyde dehydrogenase |
Protein accession | YP_573757 |
Protein GI | 92113829 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.615626 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCCC TCAACACCAC GGCCGTCGAT CGCCTCACCA AGCTGCTGGC CACGCTGGGC ATGCAGGCCG GCATGAATGT TGATACCCCC CTGAGCAACT GGGTCGGGGG GGAGCTCGCA CCCGGCCAGG GCGAACGCAT CGAACTGCTC GACCCCGTGA CCGGATTGGC GCTGATCGAG TATCGCGATG CCGGCGCCGA GCGGGTGGCC AGCGCCGTGG AAGCGGCCAC TCTCGCCCAG CAGGAATGGA TGTCGCTGAC GGCCAGCGAG CGTGGCCGCC GCATGACCAC GGCGGCCTGG GCGCTACGCG GCCACGAAGA GACCCTCGCC CAGTTGGAAA GCGTGGTCGC GGGCAAGCCC ATTCGCGACT GCCGGGGCGA GGTAAACAAG GTCCGCGAGA TGTTCGAATA CTATGCCGGC TGGTGTGACA AGCAGCACGG CGACGTGATT CCGGTTCCCA CCTCACACCT CAACTATGTG CGTCATGTGC CCTATGGCGT GGTGGGTCAG ATCACCCCGT GGAACGCCCC CATGTTCACC TGCGCCTGGC AACTGGCCCC GGCGATCGCC GCCGGCAACG GGGTCGTGCT CAAGCCCTCG GAAATGACGC CCTTCTCGTC GGTGGCCATC GCCATGCTGC TGGAGCGCAG CGGTTTGCCC AAGGGGCTGA TCAACATCGT CAACGGCGTC GGCCCCACCA CGGGCGCGGC ACTCACCGGG CATGACGGCA TCAGCAAGCT GGTGTTCGTC GGTTCCCCCG AAAGCGGCCG CCGCATCGCC CAGGCCGGCG CCGAGCGTCT GGTGCCCAGC GTGCTGGAGC TGGGCGGCAA GTCGGCCAAC ATCGTGTTCG ACGACGCCCG GCTCGACGAT GCCGTGGCCG GCGCGCAGGC CGCGATCTTC GCCGCCGCCG GGCAGAGTTG TGTCGCGGGA TCGCGCCTGC TGGTGCAGCG CGAGGTGTTC GAGGTGGTCT GCGAGCGGCT GGCACGCGCC GCGAGCGAGA TCCGCGTGGG GCTGCCCAGC GACGAGGCGA CCCAGATGGG TCCGATACAG AACGCCAAGC AGTACCGGCA CATCACCGCG ATGATCGACA CGGCCCGCCA GCACGGCGCG CGTCTCTTGT GTGGCGGCCA GCGCCCGGCG GATTTGCCGG CGGACGCCGA GGGCTATTTC CTGGCGCCCA CGGTGCTGGC GGATGTCACC GAAGAGATGG CGATCGCCCG GGAGGAAGTG TTCGGCCCGG TGGTGGTGGT CATGCCGTTC GACAGCGAAG AAGACGCCGT GCGCCTGGCC AATGCCACGC GCTTCGGCCT GGCCGGCGCC GTCTGGACCC AGGACCCGGC ACGCGCCCAT CGCGTCGCCG CGCGCTTGCG GGCGGGCACG GTATGGATCA ACAGCTACAA GGCGATCAAT GTCATGTCGC CGTTCGGCGG CTTCGGCGAC AGCGGCTTCG GGCGCTCCAG CGGCCTGGAA GGACTCAAGG AGTACACCGT TGCCCAGAGC GTCTGGGTGG AAACCGCCCC CACCGCCAGC GTCGCCATGG GCTATGGCAG TGGGGCATAG
|
Protein sequence | MNALNTTAVD RLTKLLATLG MQAGMNVDTP LSNWVGGELA PGQGERIELL DPVTGLALIE YRDAGAERVA SAVEAATLAQ QEWMSLTASE RGRRMTTAAW ALRGHEETLA QLESVVAGKP IRDCRGEVNK VREMFEYYAG WCDKQHGDVI PVPTSHLNYV RHVPYGVVGQ ITPWNAPMFT CAWQLAPAIA AGNGVVLKPS EMTPFSSVAI AMLLERSGLP KGLINIVNGV GPTTGAALTG HDGISKLVFV GSPESGRRIA QAGAERLVPS VLELGGKSAN IVFDDARLDD AVAGAQAAIF AAAGQSCVAG SRLLVQREVF EVVCERLARA ASEIRVGLPS DEATQMGPIQ NAKQYRHITA MIDTARQHGA RLLCGGQRPA DLPADAEGYF LAPTVLADVT EEMAIAREEV FGPVVVVMPF DSEEDAVRLA NATRFGLAGA VWTQDPARAH RVAARLRAGT VWINSYKAIN VMSPFGGFGD SGFGRSSGLE GLKEYTVAQS VWVETAPTAS VAMGYGSGA
|
| |