Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0680 |
Symbol | |
ID | 4027144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 763619 |
End bp | 764998 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637965850 |
Product | aldehyde dehydrogenase |
Protein accession | YP_572740 |
Protein GI | 92112812 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0429817 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGCATA TTCTGAGATG TATCTCGCCG ATCGATGGGT CGGTCTTCGC CGAGCGTGAG GCGCTGTCGC CCGAGGCTGC ACGGCAGGCA GCGGATCGGG CGCGTGCGGC CCAGGCCGAG TGGGCCGCAC GTCCTCTCCG GGAGCGCATC GATCTGGTGC GGGCCGGCAT CGCCGCCGTG GGAGCGATGA ATGACGAACT GGTGCCGGAG CTCGCGCAGA TGATGGGGCG CCCGGTTCGC TACGGTGGCG AGTTCGGTGG CTTCGAAGAA CGGGGTAACC ATATGGCGAC CATCGCCGAA GAGGCGCTGG CCGATATTGC GGTGGGCGAG GACGCCACCG TCAAACGCTA CATCAAGCGC ATTCCACATG GGGTGGTGCT GGTAGTGGCC CCCTGGAACT ACCCCTATAT GACGGCGATC AACACGGTGG CGCCGGCGCT GATCGCAGGC AACAGCGTAC TGTTGAAGCA TGCCACCCAG ACCCTGCTGG CAGGCGAGCG GATGGCGCGT GCCTTCCATT CGGCCGGCGT GCCCGAGGCA GTGTTCCAGA ATGTCTTTCT GGATCACGCC ACCACCTCGA CGCTGCTTGC CGATCGAGCG TTCGATTTCG TCAACTTCAC CGGCTCCGTG GACGGTGGAC GTGCCATGGA ACAGGCGGCG GCCGGTACTT TTACCGGGCT AGGGCTCGAG CTGGGCGGCA AGGATCCAGG CTATGTCATG GAGGATGCCG ACCTCGATGC CGCGGTGGCG GCCCTGATCG ACGGCGCGAT GTTCAATTCA GGCCAATGCT GCTGCGGGAT CGAACGCATC TATGTGCACG AGAGCCTCTA TGAAGCCTTC GTCGACAAGG CGGTAGCGAT CGTCGAGGGC TACAAGCTCG GCAATCCGCT GGCTGCCGAT ACGGATATCG GTCCGATGGC GAACATCCGT TTTGCCAGGG AGGTCCGTAG CCAGATCGAT GAGGCCTTGG CCGCCGGCGC GAGGGCCCAT GTCACGCAGA AACCCGAAGA TGACGGCGGC ACCTACCTGA GTCCGCAGAT CCTTACCGAG GTAACGCATG ACATGCGTGT GATGCGCGAG GAAACCTTCG GTCCGGTGGT CGGCATCATG AAGGTGAGTG GCGATGAAGA GGCCATTCGT CTCATGAATG ACAGCCGATT CGGGCTGACC GCGAGCCTGT GGACGGCCGA TATCGCGCGT GCCCAGCGAG TGGGTGATCG GGTCGAGACC GGTACCGTAT TCATGAATCG TGCGGATTAT CTGGATCCTG GCCTGTGCTG GACCGGCTGC AAGGAGTCGG GGCGCGGTGG TGGCTTGTCG GTGATTGGCT ATCACAACCT GACACGCCCC AAGTCCTACT ATCTAAAGAA AACGACATAA
|
Protein sequence | MGHILRCISP IDGSVFAERE ALSPEAARQA ADRARAAQAE WAARPLRERI DLVRAGIAAV GAMNDELVPE LAQMMGRPVR YGGEFGGFEE RGNHMATIAE EALADIAVGE DATVKRYIKR IPHGVVLVVA PWNYPYMTAI NTVAPALIAG NSVLLKHATQ TLLAGERMAR AFHSAGVPEA VFQNVFLDHA TTSTLLADRA FDFVNFTGSV DGGRAMEQAA AGTFTGLGLE LGGKDPGYVM EDADLDAAVA ALIDGAMFNS GQCCCGIERI YVHESLYEAF VDKAVAIVEG YKLGNPLAAD TDIGPMANIR FAREVRSQID EALAAGARAH VTQKPEDDGG TYLSPQILTE VTHDMRVMRE ETFGPVVGIM KVSGDEEAIR LMNDSRFGLT ASLWTADIAR AQRVGDRVET GTVFMNRADY LDPGLCWTGC KESGRGGGLS VIGYHNLTRP KSYYLKKTT
|
| |