Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1831 |
Symbol | |
ID | 3832800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1889192 |
End bp | 1890613 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637829761 |
Product | glycosyltransferase |
Protein accession | YP_430674 |
Protein GI | 83590665 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00000115115 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTGGG AAACGATAAG TCCAGCGCAA ATCCTTTATT TAATTACCGT GGGGTTCTAC CTGCTTTTTT TCGGCCTCTT CCTGAGATAT TTCTATTGGA AATGGTATGC CGTCAAGTAT CACTGGCGTA AACGCCTGCC CCTTGATGCT GAAAAGGTAA AGGCCCTGGC CGCCGCGAAG GGCCTGGAGA TCCCTTTCTT TACCATTATG GTACCGGCCC GCAACGAGTC CGAGGTTATA GCCAACACCA TCGAACACCT GGCCTCCTTA AACTACCCCA ATGATCGTTA TGAGATCCTG GTAATCACCG ATGAAAAGGA AGCCCTGGCC AAGGCCGAAG GCCAGGGCGA GGGGCCTACC ACCATGGAAG TGGTCGAGGC CAAGATCCGT GAGTTTGCAG CGCGCCCGGG TATGCCCCAG CTGAAGCATT GTACCGTTCC CTACGATTTT GACGGCCGCT TCCGGGGTTC ACGGCGGGGG CACAGCATCC CTTCCACCAA GGGCCGGGCC CTGAACTACG GCCTGGAGTT TGTCGACCCG CGGACGACTA TTTGTGGTTT CTATGACGCC GAGAGCCATC CAGAGGCTGA TGTTCTCCTT TACATAGCCT GGTCCTGGCT CCATGACCCG CGGGAGCGTA TCTGGCAGGG TCCCGTCTTC CAGGTGCGTA ATTTCTACCA GCTGGGTATT ATTACAAAGA TCGCCGCCAT CTACCAGGCC ATCTCCCATG AGATCTACCT GCCCATACTA ATGAAGAAGC TGCCCTTCGT AGGGGGCACC AATCTCTTCG TCGGCCGGCG CCTCCTGGAG CGTATCGGGG GTTATGATCA CCGCGCCCTG ACGGAAGACC TCGAGCTGGG GGTGCGGGCC TTCCTGGAGA CAGGGGTGTG GGCCGAGTAT TTCCCTTATT TCAGCACCGA ACAAACGCCG GCCACCCTGT ACGCCTTTTT CCGGCAGCGC TTGCGCTGGG GTAGCGGTCA CCTCCAGGTC TGTGATAAAT TCCGTTATGC CTACCAGTAT TCCTGGGATA AGAGGGGCCC ACTACTCCAC AACCTCTTCT GGAAGGGCCA GGGCGAGTGG CTCCTCTATC AGGGCGCGGT ACTGGTGCCT TTATCCATTG TCATCCTGGG GCTGAACGGC GGGCTTGATC CCTCGATCGT CCCTTTTAAA ATCCGCGTGG TCCTCCATTA CCTGGTTTTC ATCTACTTTG CCTTTACCTT TTATGCTTAC GGCCACTTCC ACCGCTTGAT GGCGCCGGTT AACTGGTGGC AGCAGTTTAT CGGGTTCCTG CAGCTCCTGG CCCTGCCCTT TGCCAGTTTC TTTTTGCCCC TGCCATATAC GGCGGCTTCC ATCATGAAGG CCCTGAACCG CCAGCCCCAG ACGTGGGTTA AAACTCCACG GACCAAAGAG GCGACCCGCT AG
|
Protein sequence | MDWETISPAQ ILYLITVGFY LLFFGLFLRY FYWKWYAVKY HWRKRLPLDA EKVKALAAAK GLEIPFFTIM VPARNESEVI ANTIEHLASL NYPNDRYEIL VITDEKEALA KAEGQGEGPT TMEVVEAKIR EFAARPGMPQ LKHCTVPYDF DGRFRGSRRG HSIPSTKGRA LNYGLEFVDP RTTICGFYDA ESHPEADVLL YIAWSWLHDP RERIWQGPVF QVRNFYQLGI ITKIAAIYQA ISHEIYLPIL MKKLPFVGGT NLFVGRRLLE RIGGYDHRAL TEDLELGVRA FLETGVWAEY FPYFSTEQTP ATLYAFFRQR LRWGSGHLQV CDKFRYAYQY SWDKRGPLLH NLFWKGQGEW LLYQGAVLVP LSIVILGLNG GLDPSIVPFK IRVVLHYLVF IYFAFTFYAY GHFHRLMAPV NWWQQFIGFL QLLALPFASF FLPLPYTAAS IMKALNRQPQ TWVKTPRTKE ATR
|
| |