Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1589 |
Symbol | |
ID | 3832735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1623596 |
End bp | 1624537 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637829518 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_430438 |
Protein GI | 83590429 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000000469702 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.717874 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCTTC TGATAACTGG CGGGTCTGGT GACGTCGGAC GGTACCTGGT CCGGGATTTA GCCGGCCGCG GCCATCGGGT ACGGGTCCTG GACCGGGCTC TACCTAACGG TGATGGCCTC CCTGTCAGCC AAGAGACTCT TTTTAAAGGC CAGCTGGAGG ATAAGGAACT AGTAGTCAGG GCTGTTAAAG GGGTTGAAGC GGTAATTCAC CTGGCCTGGA GCTTCAGCGA CGACCCCCTG GAGGTTTTCG GCGGCGACCT GATTGGGCAT ATAAATCTCT TAACAGCGGC TACCAGGGCT GGGGTGAAGC ACTTTATTTA TGCCAGTACC GCTACAGTTT ACGGCCGGGC TGCCGGGCAT CCAGTTGTAG AAGAACATCC CTGCCTGGTG GGAGAAGCGC GTAAACCCCT TTATGCCCTG GGCAAGTTTG CCGCCGAGGA GCTGTGCCGT CAGTACTGCC GTGAACAGGG ATTGCCGGTG ACCATTTTCC GTTTCTGGTG GGCCTTTGGC GATGAGATTG GCGGTCGCCA TTTGCGCAAC CTCATACGGG CGGCCCTCAA TGAGGAACCC ATCAAGGTAC CAGTCGCTGC CGGGGGCACC TTTGTCAGTA TGGCTGACCT GGCTGCCGCC TGCCGGCTGG TCCTGGCGGG GGAAGGGGCT TGCGGCCAGG TCTATAACCT GGGCAGCCTG TATTTGACCT GGGAAGAGAT CGCCAGCAAG ATAATTGAAC TTACCGGTTC CGCGGGGGAG CTACAACTGG TACCCCAGAA TGAATGGACA GGACCGGCTT TCTTAAACGA AGTCTGGGAT CTCAGCTGGG AAAAGGCGGC CCGGGAATTG GGCTATCGAC CTACCCTCAC CGTCGATGAG GGTCGGTTGG CCTTTACCAG GGCATTGCTT CGCTGTGTGG ATAAAGTCCG AACGGAAATG GGGAAAAACT AG
|
Protein sequence | MELLITGGSG DVGRYLVRDL AGRGHRVRVL DRALPNGDGL PVSQETLFKG QLEDKELVVR AVKGVEAVIH LAWSFSDDPL EVFGGDLIGH INLLTAATRA GVKHFIYAST ATVYGRAAGH PVVEEHPCLV GEARKPLYAL GKFAAEELCR QYCREQGLPV TIFRFWWAFG DEIGGRHLRN LIRAALNEEP IKVPVAAGGT FVSMADLAAA CRLVLAGEGA CGQVYNLGSL YLTWEEIASK IIELTGSAGE LQLVPQNEWT GPAFLNEVWD LSWEKAAREL GYRPTLTVDE GRLAFTRALL RCVDKVRTEM GKN
|
| |