Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1531 |
Symbol | |
ID | 4073019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1868021 |
End bp | 1869466 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637983540 |
Product | aldehyde dehydrogenase |
Protein accession | YP_590607 |
Protein GI | 94968559 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGA TCGTCGCGGC CCCGGCGGCA TCGTATCCGT TCCTGCTCAA CGGCGAGTGG ATCTCCGACG GTAGTCCCGT CGAGATCCAC TCGCCTTTTG ACCACAAAGT TATCGGCCAG GTCTTCTACG GTTCTTCCGC TCACGTGGAA GCGGCCATCC GCGCAAGTGT TGAGGCGTTC CAGATCACCC GAAAGCTGGG CAGCTACGAG CGCGAACGAA TCCTGAGTGC GATCTCCCAA AAGCTTTCCG AACAGCGCGA AGACTTTGCG CACACGATTG CACTCGAAGC CGGCAAGCCA ATCAAGACGG CTCGGCAGGA AGTCGAACGT GCGATTTACA CATTCAAGGT TGCGGCAGAA GAAAGCACGC GCATCGAGGG AGAGTACCTT CACCTCGATA CGATTGAGGC GACAAAAGGG CGATGGGGAA TTGTGCGGCG CTTCCCTATC GGCCCGATCT TTGCGATCAC GCCGTTCAAT TTCCCGCTGA ACCTCGTGGC GCACAAGCTC GCGCCTGCAA TTGCCGCAGG GTGCCCAGTC ATCCTAAAGC CCGCACCGCA GACGCCGATC ACCGCTCTGA AGCTCGCGCG CGTGATTCAC GAATCAGGCT GGCCGGCAGG CGCACTCACC GTCATGCCGC TGTCGAACGA AGACGCAAGC CTGCTCGTCA CCGACGAACG CATCAAGCTA CTCACGTTCA CTGGCAGCTC CATTGGTTGG GACCTGAAGA GCAAAGCCGG CAAAAAGCGG GTGCTCCTCG AACTCGGCGG AAATGCCGCC ATCATTATCC ATTCCGACGC TGATCTCAGG TTCGCTGCCG AGCGATGCGC GCACGGCGCA TTCGGTTACG CCGGTCAAAG CTGCATTTCC GTTCAGCGGA TCCTGGTGGA AAAGAGCGTC TATAGCGAGT TCCGCCAAAT GCTCGTAAAC GCAGCCGGAA AACTAAAGAC CGGAGACCCT CTCGATGAGG CGACCGACGT CGGGCCACTC ATTCGCGAAT CGGACGCCTT GCGTGCGGAA TCATGGGTGA AGGAAGCGGT GGCTCAGGGC GCGACCTTGC TCTGCGGTGG GACCCGCAAA GGCAGCTTGC TCGAGCCCAC CGTGCTGACG AATACGCGCC CGGAGATGCT GGTCAATTGC CGCGAAATCT TCGCCCCCGT GGTCACCGTG GAAGCATACG ACGACTTCAA CGAAGCCCTG AGGCAAGTCA ACAATTCGCC ATTCGGTCTG CAAGCAGGCA TTTTGACTCG CGATGCGCAG CGCATCTTCA CGGCCTTTAA CGGGCTCGAT GTTGGCGGGG TTGTGGCAGG CGACGTACCG ACCTTCCGCA TTGACCACAT GCCCTACGGC GGGATCAAAG ATTCAGGTCT CGGACGCGAA GGCGTGCGCT ATACGATTGA GGAAATGACC GAGCCGAAGT TACTCGTGAT GAACCTCGGC GCGTAG
|
Protein sequence | MSEIVAAPAA SYPFLLNGEW ISDGSPVEIH SPFDHKVIGQ VFYGSSAHVE AAIRASVEAF QITRKLGSYE RERILSAISQ KLSEQREDFA HTIALEAGKP IKTARQEVER AIYTFKVAAE ESTRIEGEYL HLDTIEATKG RWGIVRRFPI GPIFAITPFN FPLNLVAHKL APAIAAGCPV ILKPAPQTPI TALKLARVIH ESGWPAGALT VMPLSNEDAS LLVTDERIKL LTFTGSSIGW DLKSKAGKKR VLLELGGNAA IIIHSDADLR FAAERCAHGA FGYAGQSCIS VQRILVEKSV YSEFRQMLVN AAGKLKTGDP LDEATDVGPL IRESDALRAE SWVKEAVAQG ATLLCGGTRK GSLLEPTVLT NTRPEMLVNC REIFAPVVTV EAYDDFNEAL RQVNNSPFGL QAGILTRDAQ RIFTAFNGLD VGGVVAGDVP TFRIDHMPYG GIKDSGLGRE GVRYTIEEMT EPKLLVMNLG A
|
| |