Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1661 |
Symbol | |
ID | 3830949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1694879 |
End bp | 1695979 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637829586 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_430506 |
Protein GI | 83590497 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000116715 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00132012 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGGAAAGGG AGATAGCGAT TTTCACCCCC AACCTGTTAG ACTGGGAGGG GAAGAAACCG GTAATCGGCG GCCTGGAAAG GTATATCCGC GCCCTGGCCG AACTCCTGAC AGATATGGGC TATGCGGTAT CCTTTCACCA GAACGCACAC CGGGATTTCC AGATTACTTA CCTGGGGTGG CCGGTATACG GCTACCAGGC CGATCCCCAG CATTTAAATG TAACCATAGA GCGTATAGAG AAAAGCGTCC CCGGGAGGGT TTTATATTCC TCCATCCTGC AGCAGGTCTA TTACCGCCCC GGCTCTATCT GCATCTCCCA CGGCGTCTGG TGGGAACTGC CGGGATCCAG CCCTGCTCTG GCCCGGGCTG CCTATGAAAA CCACGTAGCC GCCGCCCTTT TCCAGGCCAG CTTGATAATA TCCTGCGACT ATAATTTCCT GAATGTCGCC CGGGCCATCT ATCCCGACCT GGCCGGCCGC AAGATCCAGG TGATCCCTAA CTTTGTCGAC CTGGAGCGTT TTTACCCTCG GGAGGGTAGC AGCAGGAAAG GGATACGCGT CCTTTACCCG CGCCGTCTGT CCCGCGAAAG GGGTTTTGAC CTTTTGCAGG CAGTTATCCC GTCTTTACTG GCTGCCTACC CCGAGCTGGA GTTCCAGTTC GCCATCGACA CCAATACCCC CCGCTACCTG GAGGCTTTCC ACGCCTGGCG GCAGGGGGAA GCCCACAACG AGCGCCTTCT TTACTGTCAT CCCGATTTTG ATGCCATGCC CGGCGTTTAC GCTGATGCGG ACATAGTTGT CATTCCGACT ATTTATTCCG AGGGCACCAG CTTCTCCTGC CTGGAGGCCA TGGCCATGGG CAAAGCCATC ATTGCCACCA ATGTCGGCGG CCTGACCAAC CTTATTATCG ATAATTATAA CGGCCTGCTT ATCCATCCCA CCGCGGAATA CCTGGCCCAG GCCCTCCGCT TTTTAATCGA ACACCCGCGG GAACGTGCCC GCCTGGGCAA AAACGCCGCG GCCACCGCTC GGGTCTTTGA CCGGAAGCTC TGGGAAGCCC GCTGGCGCCA GTTTATCAAC AAGGTCTATC CCTTGGGGTA G
|
Protein sequence | MEREIAIFTP NLLDWEGKKP VIGGLERYIR ALAELLTDMG YAVSFHQNAH RDFQITYLGW PVYGYQADPQ HLNVTIERIE KSVPGRVLYS SILQQVYYRP GSICISHGVW WELPGSSPAL ARAAYENHVA AALFQASLII SCDYNFLNVA RAIYPDLAGR KIQVIPNFVD LERFYPREGS SRKGIRVLYP RRLSRERGFD LLQAVIPSLL AAYPELEFQF AIDTNTPRYL EAFHAWRQGE AHNERLLYCH PDFDAMPGVY ADADIVVIPT IYSEGTSFSC LEAMAMGKAI IATNVGGLTN LIIDNYNGLL IHPTAEYLAQ ALRFLIEHPR ERARLGKNAA ATARVFDRKL WEARWRQFIN KVYPLG
|
| |