Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2168 |
Symbol | |
ID | 3833017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2268244 |
End bp | 2269836 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637830090 |
Product | hypothetical protein |
Protein accession | YP_431000 |
Protein GI | 83590991 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00013257 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000197316 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTACCTGG TAACGGCAGC AGAGATGGGA CAGCTGGATC GCCTGGCGTC CAGCGAGTAC ATGATACCCA GTATCGTCTT AATGGAAAAC GCCGGCTTGC GGGTGGTAGA ATCCATCGAG CGCCACTTTC AGGGCCAGGT AGCCAACCGC CGGATTTTAA TCTTCTGTGG TAAAGGCAAC AATGGCGGCG ACGGCCTGGT GGTCGCCCGC CATCTCCTGA ACCGGGGGGC CGAGGTCAAG GTCTTTCTTC TGGCCCGGCC GGAGGATATA AGGGGCGACG CCAGGACCAA CCTGGAGATT TACCAGAAAA TGGGCGGCAA GCTGCTGTTG CTCCTCGGGG AGAGCCACCT GCAGCGGGCC GACATCGCCC TGCTCTATGC CGACCTGGTG GTGGACGCCA TCTTTGGTAC GGGCTTTAAA GGGGCGGCCA TGGGGCTGCC GGCCGCCGTC ATTAATATGA TCAATAAAGC CCACCGGGAG ACGGTGGCCG TGGACCTGCC CTCCGGGCTG GAGGCGGATA CGGGGCGTTG CTTCGGACCC TGCATCCAGG CCACCTGGAC GGTTACCTTC GCCCTGCCTA AACTCGGCCT GGTCGTCGAG CCAGGAGCCA GCCTGACCGG CCGCCTGGAG GTAGCCGATA TCGGCATTCC CCAGAAACTC GTAGCCACCC AGCATTTTAA CCGGCGGCTC CTGACGGCCG CCTGGTGCCG CTCCCAGTTG CCACGTCGGG AGGCCAGCGG CCACAAGGGT TTATATGGTA GGGTCCTGGC GGTGGGCGGT TCACCGGGTC TTACCGGCGC TATTACCCTG GCGGCTACGG CCGCTTTAAA GGCCGGGGCC GGCCTGGTAA CGGCTGCCGT CCCCCGGGGG GTTCAGGGTA TCCTGGCCAT GAAAACTACC GAGATCATGA CCATGTCCCT GCCGGAGACG CCGGCGGGGG CCTTAAGCCG TGACGCCCTG GACCCGCTCC TGGAGCGCCT GGCAGAAGTC GACGTCCTGG CCATCGGCCC GGGCCTTTCC CGGGACCCGG CTACGGTAGA CCTGGTAAAA GAGTTGCTTC CCCGGGTACA GGTGCCGGCG GTGGTAGACG CCGATGCCCT GAACGCCCTG GCGACAGATA CGAGGGTCCT GACCGGCGAT CATGGCCCCC TGGTCCTGAC CCCGCACCCC GGAGAAATGG CCCGCCTGCT GGGAACTACC GCCGCCAAGA TCCAGGAAGA CCGCCTGGAG ATAGCCGCCA AGTACGCCCG GGAATGGCAG GCGGTCCTGC TGTTGAAGGG TGCCCGGACA GTTATTGCCT GGCCGGACGG GCAGGTATAT ATCAATCCTA CCGGTAACCC CGGCATGGCT ACCGCCGGCA GCGGCGATGT ATTGACAGGG ATTATTGCCG GGCTTGCAGG TCAGGGGCTT AAGCCCGGGG TGGCTGCCGC CCTGGGAGCC TATCTCCACG GGGCGGCCGG GGATGAAGCA GCCAGGCAGC GGGGCCAGCG GGCCATGATG GCCGGGGATC TGTTGGACTT TTTGCCATAC GTCTTGCGTA ACCTGGAGGA GGAGGTAGAG ACTATTGTCG CGGCCGGTTT GGGCCGAGAT TGA
|
Protein sequence | MYLVTAAEMG QLDRLASSEY MIPSIVLMEN AGLRVVESIE RHFQGQVANR RILIFCGKGN NGGDGLVVAR HLLNRGAEVK VFLLARPEDI RGDARTNLEI YQKMGGKLLL LLGESHLQRA DIALLYADLV VDAIFGTGFK GAAMGLPAAV INMINKAHRE TVAVDLPSGL EADTGRCFGP CIQATWTVTF ALPKLGLVVE PGASLTGRLE VADIGIPQKL VATQHFNRRL LTAAWCRSQL PRREASGHKG LYGRVLAVGG SPGLTGAITL AATAALKAGA GLVTAAVPRG VQGILAMKTT EIMTMSLPET PAGALSRDAL DPLLERLAEV DVLAIGPGLS RDPATVDLVK ELLPRVQVPA VVDADALNAL ATDTRVLTGD HGPLVLTPHP GEMARLLGTT AAKIQEDRLE IAAKYAREWQ AVLLLKGART VIAWPDGQVY INPTGNPGMA TAGSGDVLTG IIAGLAGQGL KPGVAAALGA YLHGAAGDEA ARQRGQRAMM AGDLLDFLPY VLRNLEEEVE TIVAAGLGRD
|
| |