Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2844 |
Symbol | |
ID | 4028646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 3175369 |
End bp | 3176814 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637968052 |
Product | betaine-aldehyde dehydrogenase |
Protein accession | YP_574889 |
Protein GI | 92114961 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGCAC TCGACCACCA ATTCATCGAT AACCGCTGGG TCGCGAGTCA CGGCACTCGA CGCCTGGCGG TGATGGACCC GTATCATGAG CGCCAGATCG CCGAGGTCAC GGCCGGCGAT GCCCGCGACG TCGAGGCGGC CGTCGAGGCG GCACGCCGTG CGCTGCCCGG ATGGCATGCC CTGGGCGGCG AACGTCGCGG CGCCTATCTG AATGCGCTGG CCGACGCCCT GACGGCCCGC CGCGAAGCCC TGATGGAGCT GTCCGCGACC AACAACGGAA AGGCGCTCGC CGAGGCCGGC ATCGATCTCG ACGACGCCAT CGCCTGCTAT CGCTACTACG CACGTCAGGC CGGCGACCTG GAAGCGCGTC AGGGACGCCG CATCACCCAC GACATCGAGG GCGTCGACGC CCACTGCTAC GAAGATCCCG CCGGGGTGAT CGGCCTGATC ACGCCGTGGA ACTTCCCGCT GGTGACCAGC GCCTGGAAGA TCGCGCCGGC CCTGGCGGCG GGCTGCACGG TGGTCTTCAA GCCCTCCGAG GTCACGCCGC TGCCGGAACA GGCCCTCGCC GAGATCGCCC TGGAGATCGC CTTGCCGCCG GGCGTGCTCA ACCTGCTGCA TGGCGATGGC GACGGCATCG GCATACCGCT GACGCATCAT CGCGGCATCG ACAAGCTGTC GTTCACCGGC AGCAACGCCG TGGGCGAGCG CGTCATGCAG GCCGCCGCCG AGGGGAGCCG CGGCGTGTCG CTGGAACTCG GCGGCAAGTC GCCGATCCTG GTACTCGAGG ATGCCGAGGT CGAACAGGCC GCCGACTGGG TCATGGCCGG CATCTATTTC AACAGCGGGC AGATTTGCTC GGCGACCTCG CGGCTGATCG TGCACGAGAC CCTGGCCGAG GCGCTGTACG AGGCACTGGC CACGCGCATC GACGCCATAC GCCTGGGCGA TCCGCTGGGC GAGAACACCG ACATGGGACC GATGACCAGC CAGCGTCAGC GCGATCGAGT CCGCGACTAC CTGGCCGTCG CCGAGCGCGA GGGGCTGGAT GCGGTGCGCG ACGCCCGCCA TCGCCAGTTG CCGTCGCAGG GCTATTTCAT CGCTCCGACG CTGTATCGCG ACGTGCCCAC CGACAGCCGC CTGTGGTGCG AGGAGATCTT CGGTCCGGTG CTGTGCGCAC GCAGTGTCGC CTCCGATGAC GATGCCATCG CCCTGGCCAA CGACAACGAG TTCGGCCTGG CCGCCACGGT CATCAGCGGC GATCCGCAAC GCGCCCAGCG GGTCGGCCGT GCATTGCGCG CCGGCAACAT CTGGTACAAC AGCGAGCAGA TCGTCATGCC CGAGGCCAGT TGGGGCGGCT TCGGGCGCAG CGGCATCGGC CGCGAGCTCG GTCCCTGGGG GCTGTCCGCC TATCTGGAAG TCAAGCACCT GATCGGGCCG GCATAA
|
Protein sequence | MQALDHQFID NRWVASHGTR RLAVMDPYHE RQIAEVTAGD ARDVEAAVEA ARRALPGWHA LGGERRGAYL NALADALTAR REALMELSAT NNGKALAEAG IDLDDAIACY RYYARQAGDL EARQGRRITH DIEGVDAHCY EDPAGVIGLI TPWNFPLVTS AWKIAPALAA GCTVVFKPSE VTPLPEQALA EIALEIALPP GVLNLLHGDG DGIGIPLTHH RGIDKLSFTG SNAVGERVMQ AAAEGSRGVS LELGGKSPIL VLEDAEVEQA ADWVMAGIYF NSGQICSATS RLIVHETLAE ALYEALATRI DAIRLGDPLG ENTDMGPMTS QRQRDRVRDY LAVAEREGLD AVRDARHRQL PSQGYFIAPT LYRDVPTDSR LWCEEIFGPV LCARSVASDD DAIALANDNE FGLAATVISG DPQRAQRVGR ALRAGNIWYN SEQIVMPEAS WGGFGRSGIG RELGPWGLSA YLEVKHLIGP A
|
| |