Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0673 |
Symbol | |
ID | 3832160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 705290 |
End bp | 706303 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637828611 |
Product | UDP-galactose 4-epimerase |
Protein accession | YP_429541 |
Protein GI | 83589532 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1087] UDP-glucose 4-epimerase |
TIGRFAM ID | [TIGR01179] UDP-glucose-4-epimerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000299597 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTAAGG TACTGGTTAC CGGCGGGGCG GGATATATCG GCAGCCACGT GGTGAAGGCT CTAGGGGAGA GGGGTTACCG GGTGCTGACC TACGACAGCC TGGTAACGGG TCACCCCTGG GCGGTCTTAT ACGGCGACCT GGTGGTGGGC GACCTTTTGG ACGCCGCCAA ACTGGAAGCG GTTATCCGGG ACTTCCGGCC CGATGCCGTC ATGCACTTTG CCGCCCACAT CGTCGTCCCC GAGTCCGTGG CCCAGCCCTT GAAGTATTAC ATAAACAATG TACAAGGCAC TCTGAATCTC CTCGCCTGTA TGCAGAAGAG CGGCGTTAAT AAGCTCATTT TCTCTTCCAG CGCCGCCGTC TACGGCATCC CGGAGCGCAT CCCGGTGCCG GAAGAAGCCC CCCTGCACCC CATCAACCCT TACGGCCACA GCAAGGCCAT GGTGGAGAGG ATACTGCAGG ACCTGTCCGC CGCCGGTGGG ATAACCTACG TATCCCTCCG GTATTTCAAT GTTGCCGGTG CCGACCGGGA CGGGAGAATT GGCGAGGGGA AAGAAGACGC CACCCACCTC ATCACACTGG CTACCCGGAC GGCGGCGGGC AAGAGACCTT ATTTAAGCGT CTTCGGCACG GACTATCCCA CCCCGGACGG CACCTGCATA CGCGACTATA TCCACGTCGA GGACCTGGCG GCAGCCCACG TCCTGGCCCT GGAGTACCTT TTGGACGGCG GGAAGAGCGA GGTGTTCAAC TGCGGCTACG GCCGGGGCTA CTCCGTCCTG GAGGTGATTG CAGCGGCCAA AAAGGTGACC GGTGTAGACT TCCCGGTGCG CTACGAAGGC CGGCGGCCGG GCGACCCGCC GGCTCTGGTG GCCGACGCCA GGAAAATCCG CGAGCGCCTG GGCTGGGTAC CGGCTTACGA CAACCTGGAG GGCATTATCT ATTCCGCCTG GCAGTGGGAA CGGAAGAGGA ATAGCGTCCC GGCTAGGCCC GCGACTATCT CAACCAGGCG GTGA
|
Protein sequence | MAKVLVTGGA GYIGSHVVKA LGERGYRVLT YDSLVTGHPW AVLYGDLVVG DLLDAAKLEA VIRDFRPDAV MHFAAHIVVP ESVAQPLKYY INNVQGTLNL LACMQKSGVN KLIFSSSAAV YGIPERIPVP EEAPLHPINP YGHSKAMVER ILQDLSAAGG ITYVSLRYFN VAGADRDGRI GEGKEDATHL ITLATRTAAG KRPYLSVFGT DYPTPDGTCI RDYIHVEDLA AAHVLALEYL LDGGKSEVFN CGYGRGYSVL EVIAAAKKVT GVDFPVRYEG RRPGDPPALV ADARKIRERL GWVPAYDNLE GIIYSAWQWE RKRNSVPARP ATISTRR
|
| |