Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3495 |
Symbol | |
ID | 4072753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4122496 |
End bp | 4123605 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637985517 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_592570 |
Protein GI | 94970522 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1088] dTDP-D-glucose 4,6-dehydratase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.437184 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATGA ATAAAGGAAG TTTCCGTTCG GCACTCATTC TGGGCGGAGC TGGCTTTATC GGCTCCAATC TGGCAAGTTG GCTCCTCCAA AATACCTCGG CTAAAGTACA CATTTTCGAC AACTTATCGC GTTTCGGCGT GCGCAACAAT TTGGATTGGC TGCAGGGAAT GGCGGCCACT TCTGGGCGGC TGCAGATCAC CGTTGGGGAC GTTCGCGACG CCGCCCACGT GGAGCGAGTG GTTCGGCACG CGACGGAGAT CTACCACTTC GCGGCGCAGG TTGCAGTAAC CACATCAATC TCCGATCCGA GGCACGATTT CGAGGTAAAT CTCGGGGGGA CGGTCAACGT ACTCGAAGCC GCGCGGAAAA GCGACAATCA GCCCTTCATC TTCTTTACTT CGACCAACAA GGTATACGGC GATTTCGGTG CGGAAGACCT TTATTTAGAC GGCAAGCGGT ATCGCAGCAA AAACGCAGCC GGAACTTCAG AAACGCAGCC GCTGGATTTC CACTCGCCCT ATGGATGCTC GAAGGGGGCG GCCGACCAAT ACGTTCGTGA CTATGCCCGG ATTTATGGAC TGAACACGGT AGTGTTCCGG ATGTCGTGTA TCGCGGGCCA GCAGCAGTTC GGCAATGAAG ACCAGGGATG GGTGGCACAT TTTCTGTACT CTGCTCTGCG CGGCGCGCCG ATCACGATCT ACGGCAATGG CAAACAAGTC CGCGATGTGC TCTGCGTGGA TGACCTGGTA CGCGCAATCG ATCTGGCGCG GCAGTTGCCG GCCTCATCCG AGGGGCGGAT TTACAACATC GGTGGCGGCG CGGAGAATGC GCTCTCGCTT CTCGAATTAA TGGACCTTGT AAAGAGCGTG ACGGGGCACG GCTGCGATGT GACCTACGAT GCCGCTCGGC CCGGTGACCA GCTTTACTAC GTCACCGACT TCGCGAAGTT CAAGCGGGAT TCAGGGTGGC AACCGGAGAT CAGCCCCGAA GGCACGCTCA AAAAGATCTA CGACTTCTAC AAGAAGAACC GCGATCTGTT CGCGCTGACT GCTGCAAGAC CTTCGATCTT GCCTGCGGCT TCCGCTTCGG AGCTGGCGCA ACCGGCATGA
|
Protein sequence | MTMNKGSFRS ALILGGAGFI GSNLASWLLQ NTSAKVHIFD NLSRFGVRNN LDWLQGMAAT SGRLQITVGD VRDAAHVERV VRHATEIYHF AAQVAVTTSI SDPRHDFEVN LGGTVNVLEA ARKSDNQPFI FFTSTNKVYG DFGAEDLYLD GKRYRSKNAA GTSETQPLDF HSPYGCSKGA ADQYVRDYAR IYGLNTVVFR MSCIAGQQQF GNEDQGWVAH FLYSALRGAP ITIYGNGKQV RDVLCVDDLV RAIDLARQLP ASSEGRIYNI GGGAENALSL LELMDLVKSV TGHGCDVTYD AARPGDQLYY VTDFAKFKRD SGWQPEISPE GTLKKIYDFY KKNRDLFALT AARPSILPAA SASELAQPA
|
| |