Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1381 |
Symbol | |
ID | 4068916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1672429 |
End bp | 1673919 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637983390 |
Product | aldehyde dehydrogenase (acceptor) |
Protein accession | YP_590457 |
Protein GI | 94968409 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.514457 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGTCG TGTCGGCCGT AGAACTGAAC AGCAACGTCA GCCAATTCAT CACAAAGCCG CGAAAGATGC TGATCGGCGG GAACTGGATC GATAGTGCGT CGGGTAAATT CTTTGAGACC CTGAATCCGG CGACTGGCGA AGTACTGGCG CGAGTTGCCG AAGGCGATCG TGCCGACATT GATCTCGCCG TCGCCGCGGC GCGGAAGGCA TTTGAGAGCG GACCGTGGTC GAAAATGTCG CCGTCGCAAC GGGGACGTCT TTTGTGGAAA CTCGCGGACC TGCTCGAGCA GCACCTGGAA GAGTTCGCCG AGTTGGAATC GCTCGACAAT GGGAAGCCGC TGTCGGTGGC GCGGGTGGCC GACGTTCCGC TCGCGGTAGA CCTTTTCCGT TACATGGCTG GCTGGGCGAC GAAGGTGGAA GGCAACACGA TTCCGCTGGG CCCGCAGTTC CATGCTTATA CCTATCGCGA ACCGGTGGGC GTAATCGGCC AGATCATTCC CTGGAACTTT CCGCTGCTGA TGGCCGCGTG GAAACTGGGT CCGGCGCTGG CGGTTGGGTG CACGGTGGTG TTGAAGCCGG CGGAACAGAC ACCTCTCTCC GCGCTGCGCC TGGGCGAACT GATCATGGAA GCAGGCTTCC CCGATGGCGT GGTGAACGTT GTGCCGGGCT TCGGCGAAAC TGCCGGCGCT GCGCTGGCCG CCCATCCGGA CGTCGACAAG ATTGCGTTCA CCGGATCGAC AGAAGTCGGC AAATTGATTG TGCAGGCTGC CGCAGGCAAC CTGAAAAAAG TCTCTCTCGA ACTGGGCGGC AAGTCGCCGA ATATCGTGCT CGCTGATGCG GACCTGGACA TTGCGATATC AGGCAGCGCG AACGCGATCT TTTTCAATCA CGGCCAGTGC TGCTGCGCGG GCTCACGGCT GTTCGTACAC AAGAGCCAGT TCGACAAAGT GGTGGAGGGT GTGGCCGAAG CCGCAAAGAA CATTCGCTTG GGATCTGGGC TTGATCCGGC AACCAACATG GGTCCGCTGG TTTCGCAGGA GCAACTCGAT CGCGTGTGCG GGTATCTCGA ATCTGGGGTG CAACAAGGAG CAAAACCCCT GGTTGGCGGG AAGAAACAGA CGGGGCCGGG CTACTTCGTG GAGCCAACGG TGCTGGTGGA TGTGAAGCCG ACGATGAAAG TCGTTTGCGA AGAGATCTTC GGACCCGTGG TCACGGCGAT CCCGTTCAAC AGCGTGGACG AGGTGTTGAA CTCAGCCAAT GCGTCGAGCT ACGGTCTCGC GGCAGCGGTG TGGACGCGCG ACATTAACAA GGCGCATTCA CTGGCGGCAA AGCTGCGCGC CGGCACAGTG TGGGTGAATT GTTACAACGT GTTCGACGCC GCGCTGCCGT TTGGGGGTTA TAAGCAATCG GGCTGGGGAC GCGAGATGGG GCACGACGCA CTCGAGCTCT ACACCGAGAC CAAAGCGGTC TGTGTGCGCC TGGAAAACTA A
|
Protein sequence | MSVVSAVELN SNVSQFITKP RKMLIGGNWI DSASGKFFET LNPATGEVLA RVAEGDRADI DLAVAAARKA FESGPWSKMS PSQRGRLLWK LADLLEQHLE EFAELESLDN GKPLSVARVA DVPLAVDLFR YMAGWATKVE GNTIPLGPQF HAYTYREPVG VIGQIIPWNF PLLMAAWKLG PALAVGCTVV LKPAEQTPLS ALRLGELIME AGFPDGVVNV VPGFGETAGA ALAAHPDVDK IAFTGSTEVG KLIVQAAAGN LKKVSLELGG KSPNIVLADA DLDIAISGSA NAIFFNHGQC CCAGSRLFVH KSQFDKVVEG VAEAAKNIRL GSGLDPATNM GPLVSQEQLD RVCGYLESGV QQGAKPLVGG KKQTGPGYFV EPTVLVDVKP TMKVVCEEIF GPVVTAIPFN SVDEVLNSAN ASSYGLAAAV WTRDINKAHS LAAKLRAGTV WVNCYNVFDA ALPFGGYKQS GWGREMGHDA LELYTETKAV CVRLEN
|
| |