Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_3601 |
Symbol | |
ID | 4693392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | + |
Start bp | 3979076 |
End bp | 3980521 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639851355 |
Product | aldehyde dehydrogenase |
Protein accession | YP_998336 |
Protein GI | 121610529 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.256061 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCCG CCGAACTCCA GGCTTGCTTC GATCTGCAAC ACCGGGCCAG CCGCGCGCAG CCCGATGTGC CGCTGCAACT GCGCCGCGAG CGCCTGCTGC GCCTGCGCCG GCTGCTCGAC GAGCATGGCC CGGTGCTGGC GGCGGCAGTG CAGGCCGACT TCGGCATCCG CTCGCCGCGC CTGACCGAGA TGGCCGATTT CCTGCTGCTG CGTGCCCTGC TGTCGCACAC CCTGCGCCAT CTGGCCCGGT GGATGAAGCC GCAAAAGCTG CGCACACCGC TGTATCTGCA GCCCGCCAGC GCCTGGGTGC AGCGCCAGCC CCTGGGGGTG GTGGGCGTGA TCGCGCCATG GAACTACCCG GTGCAACGGG CCTTGGCGCC CACCATCACG GCGCTGGCCG CAGGCAACCG CGTGCTGCTC AAGCCCAGCG AGCACACGCC GCACCTGGCG GCCCAGTTGA CGGCCCTGGT GGCGCAGCTT TTTGCGCCCG ACGAATGGTG TGTGCTGCCG GGCGATGCCG CACTGGCGGC CCGGTTTGCC GCGCTGCCGT TCGACCACCT GGTGTTCACC GGCTCCAGCG CCGTGGGGCG CCTGGTGGCG CAGGCAGCAG CGCAGAACCT GACCCCCACC ACGCTGGAGT TGGGCGGCAA GTCGCCCTGC ATCATCGACG CCGATTGCAA TCTGCAGGAC GCAGCGCTGA AGATTGCCCA TGGCAAGCTG CTCAACGCCG GCCAGACCTG CATTGCGCCC GACTACCTGT TGCTGCCCCG GGGGCGCGAG GGCGCGTTTG CGCAGGCTTA CCGGGCGGCG GTGGCGCGCC TGTTCCCCGC CGGCATCGAG GGCAATCCCG ACTACGCCGC CATCATCACG GCGCGCCACC ATGCACGCTT GCAGGCCCTG CTGCAGCAAG CGCAGGCCCA GGGGGCCGAT GTGCAGACCG TGGCGCCGGT GCCGCTGCCG AAAACCAAAC CCAAGGCCCA ACCGGCGCTC GGCGATGGCG CCAGCCGCCA GATGGCGCCA TCGCTGGTGT TTGGCGCCAC AGCGGCGATG GCGCTGATGC AGGAGGAGAT TTTCGGCCCC ATCCTGCCAG TGCTCCCCTA CGAGCACCTC GACGACGCGC TGGCCCATGT CAACGCCGGG CCGCGCCCGC TGGCGCTGTA CTGGTTCGGT CGGAACCCGG CAGCGCGCGC GGCGGTGCTG CGCGGCACCG TGAGCGGGGG CGTGACGCTG AACGACACGC TGCTGCACAT GGCGCACCCC GGCTTGCCGT TCGGTGGTGT GGGCCACAGT GGCTGGGGCG CCTGCCACGG CGAGCAGGGC TTTGCCCGCC TGTGCCAGCA AAAAGCGGTG CTGCAGCAAT CGCGCTGGTC CCTGGCGGCC TGGTGCTACC CGCCGTATGG GGCCCGCTTC GACCGGGTCA TGGCCTTGCT GCGCCGCTGG CTGTAG
|
Protein sequence | MTAAELQACF DLQHRASRAQ PDVPLQLRRE RLLRLRRLLD EHGPVLAAAV QADFGIRSPR LTEMADFLLL RALLSHTLRH LARWMKPQKL RTPLYLQPAS AWVQRQPLGV VGVIAPWNYP VQRALAPTIT ALAAGNRVLL KPSEHTPHLA AQLTALVAQL FAPDEWCVLP GDAALAARFA ALPFDHLVFT GSSAVGRLVA QAAAQNLTPT TLELGGKSPC IIDADCNLQD AALKIAHGKL LNAGQTCIAP DYLLLPRGRE GAFAQAYRAA VARLFPAGIE GNPDYAAIIT ARHHARLQAL LQQAQAQGAD VQTVAPVPLP KTKPKAQPAL GDGASRQMAP SLVFGATAAM ALMQEEIFGP ILPVLPYEHL DDALAHVNAG PRPLALYWFG RNPAARAAVL RGTVSGGVTL NDTLLHMAHP GLPFGGVGHS GWGACHGEQG FARLCQQKAV LQQSRWSLAA WCYPPYGARF DRVMALLRRW L
|
| |