Gene Moth_1050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1050 
Symbol 
ID3831856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1079707 
End bp1082418 
Gene Length2712 bp 
Protein Length903 aa 
Translation table11 
GC content59% 
IMG OID637828978 
Producttranslation initiation factor 2 
Protein accessionYP_429907 
Protein GI83589898 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0532] Translation initiation factor 2 (IF-2; GTPase) 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00487] translation initiation factor IF-2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000107022 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000303018 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGCCAAAA CACGTGTTTA CGAATTAGCC AAGGAATTAA AGGTGACCAA TAAGGATCTT 
ATAGATACCA TGGCCCGGCT GGGGATTTAT ACCCGTTCCC ACATGAGCGT CCTGGAAAAT
GGGGAAGTTA TTAAAGTACG CAATCATTAC CGGCAGCAGT GGCGGGCGGC TAAATTGGCC
CGGATGCACC GGGAACAAGC TACCCTGAAG GGAGAAGGGC CGGTACCTAC CCGCGGGGTG
CCTGATGCTC CGCGGGCAGA AGAGGTGCCT TCGCGTCAAG TACCAGCTCC CGAAACTGGA
GCACCGGCGC AAGCAACCGG AGCAACCAGG CAGCCTGCCA CCGGGTCAGC CAGGCCTGCT
AACACCGATG AAACCCGCGT CCAGGAGCAC AAACCGGCTA CAGCTGCCAG ACAGGCCGGT
GATGCACCGG CAGCAGAAGG AACGGCGGCC GCCGGGCAGC CCCTGGATGG TAAAGAGCTC
CGGCAGGCCG TTGCCAATGG TCAGGGGACG GAAGAGCCCC GGCAGCAGCC GGCAGCAACA
GGCCAGCGGG CCCAGGGCGG CGAGGCCGGG CGGGGACAAC AGTCACGGCA GAAGAAAAAA
CGCAGGGGAG AAGGTCGTTC CCGGCAGGAT GAAAATAAGG GTTCGGCCCG GGAAGACCAG
GCAAATCGGT TTGCAACCAG GGACAAAGAG GCCGCTCCAT CTGCCGGCCA GCAATCGCCG
GCAGAGAAGG GCCAGCGCCG GCCGGCCCAC AGCAAGCCCC TACGGATTCC CAAGCCCCCC
GAGGCCGTCA CCAAGGATTT ACCGGAAAAG CGGCGTGACC GATCGAACGC CAGGCCCGGT
GCCAAACCTG CCGAGTCCGG CCGCAGCCGT AAGCGGGAAA TGGAGAACCA GCTCGAAGAG
CGCCTGATGC GTCGGGATAA GAACAAGGGG AAAGCCCAGA AGCATAAAGA GACTCCCAAA
GTGGTATTCA AGATTACCCT GACCGGTAGC ATCACCGTCC AGGAGCTGGC GAAAAGGATT
GGCAAAACGG CGGCCGAGGT TATCAAGTAC CTCATGGGCC AGGGAATTAT GGCAACTATC
AACCAGGAAC TGGACCTGGA GACTGCAGCC CTGGTAGCCC AGGACCTGGG TGCCATAGTG
GAGATCAAGG CCGAAAAACC GATTACCGAG CTGGAAGACC TGGTCGATCC CCCGGAGACC
CTCCGGGAGA GGCCGCCGGT CGTCACTGTC ATGGGCCACG TCGACCACGG TAAGACCTCC
TTGCTGGACG CCATCCGGCG TACCAACGTT ACGGCCAGCG AGGCCGGTGG TATTACCCAG
CATATCGGCG CCTACCAGGT AAGGCTGAAG AACAGAAAGA TCACCTTCCT GGATACCCCC
GGCCATGCTG CCTTTACCGC CATGCGGGCC CGTGGTGCCC AGGCGACAGA TATCGCTATC
CTGGTGGTAG CCGCCGATGA CGGCGTCATG CCCCAGACCA TTGAGGCTAT TAACCATGCC
AAAGCTGCCG GGGTACCTAT TGTAGTGGCC ATTAACAAGA TCGATCGTCC TGAGGCCAAT
CCGGAACGGG TGAAGCAGCA ATTGACGGAA TACGGCCTGG TCCCGGAGGA ATGGGGCGGC
GATACCATCA TGGTCCCGGT GTCGGCGGTA ACTAAAGAAG GCATTAATGA TCTCCTGGAA
ATGGTCTTGC TGACGGCCGA TGTAGCTGAA CTCAAGGCCA ACCCCGATCG TCCGGCCCGG
GGTATCGTCA TTGAAGCCAA GCTCGACAGG GGCCGCGGCC CGGTGGCTAC CATGCTGGTC
CAAAAGGGTA CCCTGAAGAT AGGCGATAAC CTGGTAGCCG GTTCGGTTTA TGGTCGCGTC
CGGGCCATGA TTGATGACCG GGGCGAGAGG GTGAACAGCG CCCCGCCTTC CACACCGGTG
GAAGTCCTGG GCCTGTCGGA ATTGCCCGAG GCCGGCGATA TCTTCCAGGT GGTGGAGGAT
GAAAAGCTGG CCCGCCAGAT TGCCTCTTCC CGCCAGGAAG AAAAACGACA GGAAGAACTA
AAGGCCGCGA GCAAGACCAC CCTGGACGAC CTGTTCAAGC AGATGGAAGC CGGAGAAGTC
AAGGAACTTA ACCTGGTCAT TAAGGGGGAT GTCCAGGGTT CGGTAGAAGC CCTGCGGGGC
GCCCTGGAGC AACTTTCGAC CAGTGAGGTT AAGGTCAACC TCCTCCACGG CGGTGTGGGG
GCCATTACCG AGACCGACGT CATGCTGGCG GCGGCCTCGA AGGCGATTAT CATCGGCTTT
AACGTGCGCC CCGAGGCTAA CGTGCGCAAG GCGGCCGAGG AAGCTGGCGT TGAAATCAGG
CTTTACCGGG TTATTTATGA GGTTATCGAC GATGTCAAGG CGGCCATGTC CGGCCTTCTG
GAACCGGAGG AGCGCGAAGT TATCCTGGGA CGGGCCGAAG TCCGGGCTAC CTTCAAGGTA
CCCAAAGCCG GGACGGTTGC CGGGTGCTTT GTTACCGAAG GCAAAATCCA GAATCGCGCC
CTGGCCCGGG TTATAAGGGA TGGTGTGGTA GTCTTTGAAG GCCGGATTGA ATCCTTAAAA
CGCTTTAAGG ACGATGTGCG TGAGGTAGCC CAAGGCTACG AGTGCGGCGT TGGCCTGGAG
AAGTTTAACG ATATTAAAGA AGGCGACGTC ATCGAAGCCT ATACTATCGA AGAGATTCAA
AGGGAATTGT AG
 
Protein sequence
MAKTRVYELA KELKVTNKDL IDTMARLGIY TRSHMSVLEN GEVIKVRNHY RQQWRAAKLA 
RMHREQATLK GEGPVPTRGV PDAPRAEEVP SRQVPAPETG APAQATGATR QPATGSARPA
NTDETRVQEH KPATAARQAG DAPAAEGTAA AGQPLDGKEL RQAVANGQGT EEPRQQPAAT
GQRAQGGEAG RGQQSRQKKK RRGEGRSRQD ENKGSAREDQ ANRFATRDKE AAPSAGQQSP
AEKGQRRPAH SKPLRIPKPP EAVTKDLPEK RRDRSNARPG AKPAESGRSR KREMENQLEE
RLMRRDKNKG KAQKHKETPK VVFKITLTGS ITVQELAKRI GKTAAEVIKY LMGQGIMATI
NQELDLETAA LVAQDLGAIV EIKAEKPITE LEDLVDPPET LRERPPVVTV MGHVDHGKTS
LLDAIRRTNV TASEAGGITQ HIGAYQVRLK NRKITFLDTP GHAAFTAMRA RGAQATDIAI
LVVAADDGVM PQTIEAINHA KAAGVPIVVA INKIDRPEAN PERVKQQLTE YGLVPEEWGG
DTIMVPVSAV TKEGINDLLE MVLLTADVAE LKANPDRPAR GIVIEAKLDR GRGPVATMLV
QKGTLKIGDN LVAGSVYGRV RAMIDDRGER VNSAPPSTPV EVLGLSELPE AGDIFQVVED
EKLARQIASS RQEEKRQEEL KAASKTTLDD LFKQMEAGEV KELNLVIKGD VQGSVEALRG
ALEQLSTSEV KVNLLHGGVG AITETDVMLA AASKAIIIGF NVRPEANVRK AAEEAGVEIR
LYRVIYEVID DVKAAMSGLL EPEEREVILG RAEVRATFKV PKAGTVAGCF VTEGKIQNRA
LARVIRDGVV VFEGRIESLK RFKDDVREVA QGYECGVGLE KFNDIKEGDV IEAYTIEEIQ
REL