Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1365 |
Symbol | |
ID | 3832287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1408612 |
End bp | 1409802 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637829301 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_430221 |
Protein GI | 83590212 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0275215 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGTTG CCATGTTACA TTGGGCTTTC CCGCCGATCA TCGGGGGAGT GGAATCCCAC CTAGCCCTCC TGTGCCCTTA CCTCGTCCGG CAGGGGCACC AGGTAAGCCT CCTAACGGCC ACAGCCCCCG GTACCCCGGT GGAGGAAAGC TGGCAGGGAG TTGTAATTAA GCGCTCGCCC TTGCTGGATC TAAATTCCCT TACTCCAGCA GTTATTGAGG CCAGGGCCGG GGAGATCAAG GAACTCTTGG AAAACTTTTT ACTGGCAGTG CGGCCGGATG TGGTCCACGC CCATAACTTT CACTATTTCA GCTATGTACA CGCGGCCAGT CTCCAGGAAA TTTGCCGCCG TCACGGCTGG CCCCTGGTCC TCACGGCCCA TAATGTGTGG GATGACGAAC TGTGGACCAG GATGAATAGC CTGGCCAGGG GCTGGGACCT GGTTATTGCC GTCAGCCACT ACATACGCCA GGAATTGGTG GTTAATGGCT ATCCGCCGGA GCGGGTGACA GTTGTCTACC ATGGCACAGA TACTAATACC TTCCGGCCGC CCTCCCCGGA GGACAGGCAG GCCCTTTATA CCTCCTATCC GGAATGGCGG GGACGGCGGA TTATCTTCCA CCCGGCCAGG ATGAGCCGGG CCAAGGGCTG TGACGTCAGC ATTCGCGCCC TGGATCTCAT CCGCCGGGAA ATCCCCGACG TTCTCCTGGT ACTGGCCGGT ACCACCAACA CCGTTGACTG GGGCCAAAAA CAGCCGGCGG AAGTAGCCTC TCTCCAGGAT CTTATCGCCA GCTTGGGCCT GGAGGAAAAT GTCTTCATCC GTTTCTTCCC GTGGCAGGAG ATGCCTGCTG TTTACCAGGG GGCCGAGGTC TGCCTCTACC CATCGGCCTT CCAGGAGCCG TTTGGTTTGG TCATGCTGGA AGCCATGGCC ACGGCCAGGC CTATCATCGT CAGCCGCGCC GGCGGCATGC CGGAGATCAT TCGTCCCGGA TATAACGGCT TTTTGGTCTC TATGGGGGAT CACGAGGAAC TCGCCCGCTA TACCACTTTC CTTCTCCGTA ATCCGGAGGT GGCCAGGACC ATGGGCCAGG ACGGCCGCAG GCTGGTAGAA GAAAACTTTA CTACCGCCGT GATGGCCCGA AATACCCTGG AGGCATATAA CCAGTTGTTG GCCCTGCCCC GGGCCAGTTA G
|
Protein sequence | MRVAMLHWAF PPIIGGVESH LALLCPYLVR QGHQVSLLTA TAPGTPVEES WQGVVIKRSP LLDLNSLTPA VIEARAGEIK ELLENFLLAV RPDVVHAHNF HYFSYVHAAS LQEICRRHGW PLVLTAHNVW DDELWTRMNS LARGWDLVIA VSHYIRQELV VNGYPPERVT VVYHGTDTNT FRPPSPEDRQ ALYTSYPEWR GRRIIFHPAR MSRAKGCDVS IRALDLIRRE IPDVLLVLAG TTNTVDWGQK QPAEVASLQD LIASLGLEEN VFIRFFPWQE MPAVYQGAEV CLYPSAFQEP FGLVMLEAMA TARPIIVSRA GGMPEIIRPG YNGFLVSMGD HEELARYTTF LLRNPEVART MGQDGRRLVE ENFTTAVMAR NTLEAYNQLL ALPRAS
|
| |