Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1658 |
Symbol | |
ID | 3830946 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1692222 |
End bp | 1693340 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637829583 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_430503 |
Protein GI | 83590494 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00025615 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00378247 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGCTA AAAAGGTGGC CCTGTTGACG ACCAATTTCT TTCACCCATC CAGTGAGAGA ATTATCATGG GCGGTGCCGA GCGTTACCAG GTCGACCTCT GCCGCCTGTT AAAGGAACTT GGCTATTATG TCGAGGTCTG GCAGATCGGT AGCGGCTGGA CGCGGGAGTT TGATGGGGTG CGTATTCGCA GCATCCCCGT GACCAAAAGC GAGTATAATA CCTTTCCCGA CCTGGCTACC GCTTTTTACG AAAACTCCAT GGCCTTTGAT TATGCCCTTT ACTTTATCCT CACCCTGGCC TATCCGATAG CCCGGGAGAA AAGCATTGCC ATCAGCCACG GGGTTTTCTG GGATTGGCCG GGGTTTGACC TCATGGCCGG GAAGCCGGAG GACCGCCAGG AATGGCTACG ACGCCTGAGT ATTGCCCTGG CCGGACCGCA GAAACTGGTA TCCGTCGATA CCAATACCAT TAACTATTTC AACGCTACCC TGGCCGGCTT TTATCATAAA TGGGAGTACA TCCCCAACTA TGTTGATACT GACCTCTTTA GCCCGCCGGC AGAAGAGCCA GCTGGCGACG ATACCGTCCG TGTTCTTTTT CCCCGCCGCC TGGTCCCGGT ACGGGGCATC AACGAAACCA TGAGGGCGGC GGAAAAACTG ACCTCCCGCT ACCCCTGGAT TGAGTTTCAC TTCTGCGGCC GCGGTCATGA TGATAACGCC GAGAGGCTCA TGAACCAATG GGCCGGTAAC CGGGAGCGCT GTTTCTATTA CTGGAAGCCC CTGGAGATGA TGCCAGAGAT CTACCGGCAG GCCGATATTG TCCTGATTCC CTCCCGCTCA ACAGAAGGTA CCAGCCTGGC CGCCCTGGAG GCCATGGCCT GCGGGAAGCC GGTAATTGCC GGCCTGGCGG GCGGTTTGAG CGACATTATC CTGCACGGCT ACAACGGTTA TCTTATTAAG CCGACAGTAG AAAACCTGGT GGCGGCCATT GAGGAATTGG CCCGGGATGA AGGTAAGAGG AAGCTCATGG GCCGGCGGGC CCGGGAAGTG GCCTTAACCT TTAACCGGAA AATATGGGCT GAGCGCTGGG CCAGGGTTCT GGAGGAGGTA TTTCGCTAA
|
Protein sequence | MSAKKVALLT TNFFHPSSER IIMGGAERYQ VDLCRLLKEL GYYVEVWQIG SGWTREFDGV RIRSIPVTKS EYNTFPDLAT AFYENSMAFD YALYFILTLA YPIAREKSIA ISHGVFWDWP GFDLMAGKPE DRQEWLRRLS IALAGPQKLV SVDTNTINYF NATLAGFYHK WEYIPNYVDT DLFSPPAEEP AGDDTVRVLF PRRLVPVRGI NETMRAAEKL TSRYPWIEFH FCGRGHDDNA ERLMNQWAGN RERCFYYWKP LEMMPEIYRQ ADIVLIPSRS TEGTSLAALE AMACGKPVIA GLAGGLSDII LHGYNGYLIK PTVENLVAAI EELARDEGKR KLMGRRAREV ALTFNRKIWA ERWARVLEEV FR
|
| |