Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0685 |
Symbol | |
ID | 4068775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 842617 |
End bp | 844074 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637982691 |
Product | betaine-aldehyde dehydrogenase |
Protein accession | YP_589764 |
Protein GI | 94967716 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR03216] 2-hydroxymuconic semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGTCGC TGCACAGCTA CATTGGCGGA CGCTTCGTGC CCGGCAAACG GGGGTTTGCG GACATTAATC CCGCAGATGG AAGTGTTGCT GCCACCGTGA CGGAAGCGTC AGCCGAGATG GTGGACGAAG CTGTTGCTGC GGCGCGCAAG GCGCTGCATG GGGAGTGGGG CAAACTCGGC GCGCGTGGTC GAGCGACTGT TCTCTACAAG ATTGCGGAAG GAATTGAGAA GCGCTTTGAC TGCTTTGTAC AGGCGGAGGT CGCAGACACG GGCAAGCCCT TATCGCTGGC CTCCCGACTG GATGTGCCAC GCGCCGCCGC GAACTTTCGA GTGTTTGCTG ATGTGATCAA AATCGCGGGG CTGGAAGCGT TCGAGACCGA ATTGCCGGAC GGTCGCCGCG CGCTGAATTA CACCGTGCGC AAGCCGATTG GTGTAGTTGG GATCGTGACT CCGTGGAACT TGCCGCTGCT GCTACTGACG TGGAAGGTGG CTCCGGCACT GGCGTGCGGA AACGCGGTGG TGGTGAAGCC GTCGGAAGAG ACGCCAGCGA CCGCGACACT GCTAGCGGAA GTGATGCAGG AAGCGGGCGT ACCCGATGGC GTCTATAACG TGGTGCACGG GTTCGGGCCA AACTCGGCGG GTGAGTTTCT GGTCAGCCAT CCGGGCGTGA ACGCGGTGAC TTTTACGGGG GAGTCGCAGA CGGGCGCCTC TATTATGCGG GCTTGCGCGC CGACCGTGAA GCCGGTTTCG TTTGAGTTGG GCGGCAAGAA TGCCGCCGTC ATTTTTGCTG ACTGTGATTT CGACGCGACC ATCGCGGGCA TGAGTGATGC GGTGTTCCTC AACACCGGCC AGGTTTGTCT GTGTGCAGAG CGCGTGTACG TGGAAAGACG AATCTTCGAT AGGTTTGTCG CGGCTCTGAC GGAGCGCGCG AAGAGTTATG ACCTGGGATG GCCGATGGAA CCGGCTACAT CGATGGGGCC TCTGATTTCG AAGGTGCATC GCGAGAAAGT TCTGTCTTAT TTCGACCTGG CGCGCGAAGA GGGCGCAACT GTCGTAATCG GCGGTGGCGT GCCGACGTTT GGTGATGGCC GCGATAGCGG CTTCTATGTA CAGCCAACGA TCTTCACGGG ACTGAAGGAA TCGGCGCGCT GCGTGAAAGA AGAAATCTTC GGACCGGTGT GCCACGTTGC ACCGTTCGAT TCCGAAGAAG AAGCAGTGGC GCTGGCGAAC GATACGCGGT ATGGTTTGGC GGCTTCAATT TGGACGAGTG ATTTGCAGAG GGCGCACCGC GTGGCGCCGC AGATGAATGC AGGCATCACG TGGGTGAATT GCTGGTTCCT GCGCGATTTG CGCACGCCAT TCGGCGGAGT TGGGCTATCG GGAATTGGGC GCGAGGGCGG GATGCACTCG CTGAATTTCT ATTCCGAGTT GAACAACATC TGCATTCGGA CCGAGTAA
|
Protein sequence | MKSLHSYIGG RFVPGKRGFA DINPADGSVA ATVTEASAEM VDEAVAAARK ALHGEWGKLG ARGRATVLYK IAEGIEKRFD CFVQAEVADT GKPLSLASRL DVPRAAANFR VFADVIKIAG LEAFETELPD GRRALNYTVR KPIGVVGIVT PWNLPLLLLT WKVAPALACG NAVVVKPSEE TPATATLLAE VMQEAGVPDG VYNVVHGFGP NSAGEFLVSH PGVNAVTFTG ESQTGASIMR ACAPTVKPVS FELGGKNAAV IFADCDFDAT IAGMSDAVFL NTGQVCLCAE RVYVERRIFD RFVAALTERA KSYDLGWPME PATSMGPLIS KVHREKVLSY FDLAREEGAT VVIGGGVPTF GDGRDSGFYV QPTIFTGLKE SARCVKEEIF GPVCHVAPFD SEEEAVALAN DTRYGLAASI WTSDLQRAHR VAPQMNAGIT WVNCWFLRDL RTPFGGVGLS GIGREGGMHS LNFYSELNNI CIRTE
|
| |