Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1414 |
Symbol | aldA-2 |
ID | 5135577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 1512348 |
End bp | 1513868 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640532872 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001217357 |
Protein GI | 147674305 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTATG CACAACCGGG TAGTGACAAC GCCGTTATCA CCTTCAAGAG CCATTACGAT AATTTCATTG GCGGCCAATG GGTTAAACCT GTCAGTGGCG AATACTTTGG TAATATTTCA CCCGTCAATG GACAGGTTTA TTGCCAAGTC GCTCGCTCAA CCCAAGCAGA TATTGATTTG GCGTTAGATG CCGCACATCA AGTGCGTGAA GCATGGGCAA AAACCAGTGT GACTGAGCGC TCTAACCTGC TACTGAAAAT TGCTGATCGG ATTGAAGCAA ACATAGAGCA GCTTGCGGTA GCGGAATGCT GGGAGAATGG TAAACCTGTG CGTGAAACTC TAGTGGCCGA CCTCCCTCTG GTGGTTGACC ATTTCCGGTA TTTTGCAGGC TGTATTCGTG CTCAAGAAGG CAGCGCCGCT GAGCTCGATA GCCACACGGC CAGTTACCAT TTTCCTGAGC CAATCGGAGT GGTGGGACAA ATCATTCCTT GGAATTTCCC AATGCTGATG GCAGCGTGGA AGTTAGCGCC AGCCCTTGCT GCAGGCTGCT GTGTGGTACT CAAACCTGCC GAGCAAACCC CTACTTCGAT CTTGGTGTTG ATCGAAAAAA TCGCCGATCT CATCCCAGCA GGCGTGCTCA ACGTGGTCAA CGGTTTTGGC AGTGAAGCAG GCCAAGCCTT GGCCACCAGT CAACGTATTG CCAAACTGGC GTTTACGGGC TCAACCCAAG TGGGTCAACA CATTCTCAAA TGTGCGGCGC AGAGTTTAAT TCCATCAACC GTAGAGCTCG GTGGGAAGTC TCCTAATATC TATTTCCCAG ATATTTTTGA CCACGAAGAC ACCTATTTAG AGAAATGTAT CGAAGGTACT TTGCTGGGCT TTTTCAACCA AGGCGAAGTG TGTACTTGCC CTTCTCGCGT TCTGGTTCAT GAGTCAATTT ACGATCGCTT TGTGGCCAAA GTCGCGGAGC GGGCGAAAGG CATCAAACAA GGCAATCCAC TGGATACCGC TACTCAAGTA GGGGCTCAAG CATCTCAAGA GCAGTTTGAC AAGATCTTGA GTTACTTAGA TATTGGCCGC CAGGAGGGCG CAAAAGTGCT GTTTGGTGGT GGCGTTGCCA AACAGGAAGG GGAACTCGGT CAAGGCTACT ATATTCAGCC GACCCTACTG CAAGGCCACA ACAAGATGCG AGTCTTCCAA GAAGAGATTT TTGGTCCTGT GATTGCCATT ACCTCTTTTA AAGACGAAGC CGAAGCGTTG GCATTGGCTA ACGATACCGA ATACGGCCTC GGCGCAGGTA TTTGGACTCG TGACCAAAAC CTCGCCTACC GTATGGGAAG AAATATTCAA GCAGGCAGGA TCTGGATCAA CTGTTACCAC GCTTACCCCG CGCATGCGGC CTTTGGTGGC TATAAAAAAT CGGGTATTGG GCGTGAAACC CACAAAATGA TGTTGAACCA CTACCAGAAC ACCAAAAACT TACTGATCAG CTACGATGTG AATCCTCTCG GTTTCTTCTA A
|
Protein sequence | MIYAQPGSDN AVITFKSHYD NFIGGQWVKP VSGEYFGNIS PVNGQVYCQV ARSTQADIDL ALDAAHQVRE AWAKTSVTER SNLLLKIADR IEANIEQLAV AECWENGKPV RETLVADLPL VVDHFRYFAG CIRAQEGSAA ELDSHTASYH FPEPIGVVGQ IIPWNFPMLM AAWKLAPALA AGCCVVLKPA EQTPTSILVL IEKIADLIPA GVLNVVNGFG SEAGQALATS QRIAKLAFTG STQVGQHILK CAAQSLIPST VELGGKSPNI YFPDIFDHED TYLEKCIEGT LLGFFNQGEV CTCPSRVLVH ESIYDRFVAK VAERAKGIKQ GNPLDTATQV GAQASQEQFD KILSYLDIGR QEGAKVLFGG GVAKQEGELG QGYYIQPTLL QGHNKMRVFQ EEIFGPVIAI TSFKDEAEAL ALANDTEYGL GAGIWTRDQN LAYRMGRNIQ AGRIWINCYH AYPAHAAFGG YKKSGIGRET HKMMLNHYQN TKNLLISYDV NPLGFF
|
| |