Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1050 |
Symbol | |
ID | 3831856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1079707 |
End bp | 1082418 |
Gene Length | 2712 bp |
Protein Length | 903 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637828978 |
Product | translation initiation factor 2 |
Protein accession | YP_429907 |
Protein GI | 83589898 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0532] Translation initiation factor 2 (IF-2; GTPase) |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR00487] translation initiation factor IF-2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000107022 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000000303018 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGGCCAAAA CACGTGTTTA CGAATTAGCC AAGGAATTAA AGGTGACCAA TAAGGATCTT ATAGATACCA TGGCCCGGCT GGGGATTTAT ACCCGTTCCC ACATGAGCGT CCTGGAAAAT GGGGAAGTTA TTAAAGTACG CAATCATTAC CGGCAGCAGT GGCGGGCGGC TAAATTGGCC CGGATGCACC GGGAACAAGC TACCCTGAAG GGAGAAGGGC CGGTACCTAC CCGCGGGGTG CCTGATGCTC CGCGGGCAGA AGAGGTGCCT TCGCGTCAAG TACCAGCTCC CGAAACTGGA GCACCGGCGC AAGCAACCGG AGCAACCAGG CAGCCTGCCA CCGGGTCAGC CAGGCCTGCT AACACCGATG AAACCCGCGT CCAGGAGCAC AAACCGGCTA CAGCTGCCAG ACAGGCCGGT GATGCACCGG CAGCAGAAGG AACGGCGGCC GCCGGGCAGC CCCTGGATGG TAAAGAGCTC CGGCAGGCCG TTGCCAATGG TCAGGGGACG GAAGAGCCCC GGCAGCAGCC GGCAGCAACA GGCCAGCGGG CCCAGGGCGG CGAGGCCGGG CGGGGACAAC AGTCACGGCA GAAGAAAAAA CGCAGGGGAG AAGGTCGTTC CCGGCAGGAT GAAAATAAGG GTTCGGCCCG GGAAGACCAG GCAAATCGGT TTGCAACCAG GGACAAAGAG GCCGCTCCAT CTGCCGGCCA GCAATCGCCG GCAGAGAAGG GCCAGCGCCG GCCGGCCCAC AGCAAGCCCC TACGGATTCC CAAGCCCCCC GAGGCCGTCA CCAAGGATTT ACCGGAAAAG CGGCGTGACC GATCGAACGC CAGGCCCGGT GCCAAACCTG CCGAGTCCGG CCGCAGCCGT AAGCGGGAAA TGGAGAACCA GCTCGAAGAG CGCCTGATGC GTCGGGATAA GAACAAGGGG AAAGCCCAGA AGCATAAAGA GACTCCCAAA GTGGTATTCA AGATTACCCT GACCGGTAGC ATCACCGTCC AGGAGCTGGC GAAAAGGATT GGCAAAACGG CGGCCGAGGT TATCAAGTAC CTCATGGGCC AGGGAATTAT GGCAACTATC AACCAGGAAC TGGACCTGGA GACTGCAGCC CTGGTAGCCC AGGACCTGGG TGCCATAGTG GAGATCAAGG CCGAAAAACC GATTACCGAG CTGGAAGACC TGGTCGATCC CCCGGAGACC CTCCGGGAGA GGCCGCCGGT CGTCACTGTC ATGGGCCACG TCGACCACGG TAAGACCTCC TTGCTGGACG CCATCCGGCG TACCAACGTT ACGGCCAGCG AGGCCGGTGG TATTACCCAG CATATCGGCG CCTACCAGGT AAGGCTGAAG AACAGAAAGA TCACCTTCCT GGATACCCCC GGCCATGCTG CCTTTACCGC CATGCGGGCC CGTGGTGCCC AGGCGACAGA TATCGCTATC CTGGTGGTAG CCGCCGATGA CGGCGTCATG CCCCAGACCA TTGAGGCTAT TAACCATGCC AAAGCTGCCG GGGTACCTAT TGTAGTGGCC ATTAACAAGA TCGATCGTCC TGAGGCCAAT CCGGAACGGG TGAAGCAGCA ATTGACGGAA TACGGCCTGG TCCCGGAGGA ATGGGGCGGC GATACCATCA TGGTCCCGGT GTCGGCGGTA ACTAAAGAAG GCATTAATGA TCTCCTGGAA ATGGTCTTGC TGACGGCCGA TGTAGCTGAA CTCAAGGCCA ACCCCGATCG TCCGGCCCGG GGTATCGTCA TTGAAGCCAA GCTCGACAGG GGCCGCGGCC CGGTGGCTAC CATGCTGGTC CAAAAGGGTA CCCTGAAGAT AGGCGATAAC CTGGTAGCCG GTTCGGTTTA TGGTCGCGTC CGGGCCATGA TTGATGACCG GGGCGAGAGG GTGAACAGCG CCCCGCCTTC CACACCGGTG GAAGTCCTGG GCCTGTCGGA ATTGCCCGAG GCCGGCGATA TCTTCCAGGT GGTGGAGGAT GAAAAGCTGG CCCGCCAGAT TGCCTCTTCC CGCCAGGAAG AAAAACGACA GGAAGAACTA AAGGCCGCGA GCAAGACCAC CCTGGACGAC CTGTTCAAGC AGATGGAAGC CGGAGAAGTC AAGGAACTTA ACCTGGTCAT TAAGGGGGAT GTCCAGGGTT CGGTAGAAGC CCTGCGGGGC GCCCTGGAGC AACTTTCGAC CAGTGAGGTT AAGGTCAACC TCCTCCACGG CGGTGTGGGG GCCATTACCG AGACCGACGT CATGCTGGCG GCGGCCTCGA AGGCGATTAT CATCGGCTTT AACGTGCGCC CCGAGGCTAA CGTGCGCAAG GCGGCCGAGG AAGCTGGCGT TGAAATCAGG CTTTACCGGG TTATTTATGA GGTTATCGAC GATGTCAAGG CGGCCATGTC CGGCCTTCTG GAACCGGAGG AGCGCGAAGT TATCCTGGGA CGGGCCGAAG TCCGGGCTAC CTTCAAGGTA CCCAAAGCCG GGACGGTTGC CGGGTGCTTT GTTACCGAAG GCAAAATCCA GAATCGCGCC CTGGCCCGGG TTATAAGGGA TGGTGTGGTA GTCTTTGAAG GCCGGATTGA ATCCTTAAAA CGCTTTAAGG ACGATGTGCG TGAGGTAGCC CAAGGCTACG AGTGCGGCGT TGGCCTGGAG AAGTTTAACG ATATTAAAGA AGGCGACGTC ATCGAAGCCT ATACTATCGA AGAGATTCAA AGGGAATTGT AG
|
Protein sequence | MAKTRVYELA KELKVTNKDL IDTMARLGIY TRSHMSVLEN GEVIKVRNHY RQQWRAAKLA RMHREQATLK GEGPVPTRGV PDAPRAEEVP SRQVPAPETG APAQATGATR QPATGSARPA NTDETRVQEH KPATAARQAG DAPAAEGTAA AGQPLDGKEL RQAVANGQGT EEPRQQPAAT GQRAQGGEAG RGQQSRQKKK RRGEGRSRQD ENKGSAREDQ ANRFATRDKE AAPSAGQQSP AEKGQRRPAH SKPLRIPKPP EAVTKDLPEK RRDRSNARPG AKPAESGRSR KREMENQLEE RLMRRDKNKG KAQKHKETPK VVFKITLTGS ITVQELAKRI GKTAAEVIKY LMGQGIMATI NQELDLETAA LVAQDLGAIV EIKAEKPITE LEDLVDPPET LRERPPVVTV MGHVDHGKTS LLDAIRRTNV TASEAGGITQ HIGAYQVRLK NRKITFLDTP GHAAFTAMRA RGAQATDIAI LVVAADDGVM PQTIEAINHA KAAGVPIVVA INKIDRPEAN PERVKQQLTE YGLVPEEWGG DTIMVPVSAV TKEGINDLLE MVLLTADVAE LKANPDRPAR GIVIEAKLDR GRGPVATMLV QKGTLKIGDN LVAGSVYGRV RAMIDDRGER VNSAPPSTPV EVLGLSELPE AGDIFQVVED EKLARQIASS RQEEKRQEEL KAASKTTLDD LFKQMEAGEV KELNLVIKGD VQGSVEALRG ALEQLSTSEV KVNLLHGGVG AITETDVMLA AASKAIIIGF NVRPEANVRK AAEEAGVEIR LYRVIYEVID DVKAAMSGLL EPEEREVILG RAEVRATFKV PKAGTVAGCF VTEGKIQNRA LARVIRDGVV VFEGRIESLK RFKDDVREVA QGYECGVGLE KFNDIKEGDV IEAYTIEEIQ REL
|
| |