Gene Cthe_0991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0991 
Symbol 
ID4811285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1183983 
End bp1187090 
Gene Length3108 bp 
Protein Length1035 aa 
Translation table11 
GC content40% 
IMG OID640106409 
Producttranslation initiation factor 2 
Protein accessionYP_001037416 
Protein GI125973506 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0532] Translation initiation factor 2 (IF-2; GTPase) 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00487] translation initiation factor IF-2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0315386 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCAAAAA AGAGAGTTTA TGAGCTTGCC AAAGAACTTA ACACGACAAG TAAGCGACTG 
ATGGAAAAAC TGGAAGAAAT AAATATAGTC GTCAAGAATC ACATGAGCTT TTTAGAGGAG
GATGAACTGG AAGCTTTATA TGACCATATA GGCGTTATCA GACATAAAGA TGACAAATCC
AATACGGATG ATAATAAAAC TGCTTCAGCT CATTCAGTAG CCCAACATAG CAGCGAGGCC
ATGAAAGAGC TCAAAAAAGA AGCAAAAAAA GCGCCGAGGA TTATTAGAAC AACGGAAATC
TATCTTGATT CAAAAGATGA GGAAATAAAG GAAATAAAGG CAAATGATGC CAAGAGTCAA
AAGAAGCCCG AAGAGAAACG TAAGAAAAAT GATTTTGTCA GAGTTGAGAC TGAGACTTCG
GGTTTAAGAC CCGGTCTCGT AAGGGAAACG AAGCCGGAAT ATATGAGGAT TCTTGAAGAA
CAGAAAAAAA GTGAAGCATC AAAAGCTCAA ACCAATGAGA AAAAGGATGC TGAAAAAAAC
AGTGTAAAAG AAGTTGTAAA AAAAGAGGAA GGTTCAAAAC AAACTGCTGA AATTAAGGAT
GGCTCTGTGA ATATGGAAGG CAAAGTGTTA GAAGAAGTAA AAGCTACCGT GGCGGATAGC
GCTACCAATG TTAATTTGAA TGAAAGCATT GACAAAGATA AAAAAACAAA TGACAACAGG
CAGGTTTCAA CGGATAACAG TGCGGTAAAC AATGAGGAAA ATGCTGCGGA CACTTTAAAT
AAAAAAGATA TGGATAAGAA AAATAATAAT AAAAAGAATG AAGCAAAAAA GAATGCTGAA
AAGAAAAACG AGGCAAAAAA GAATGAAAAA AATGACAACA AAGGTGGCAA TGCAAAGAAA
AATGAACATA GAAGCCCTGA TATGAAGAAA AATGATTCCA ATCGCCCGCA GGATGCAAAC
AAGCAGAATA GCAAAGCCGC TGCCGATAAA AATCGGGAGG AAGGTCGTAC TGGATCAAAG
AAGTCTTTGG AAATACCAAA GGTTGAACTT ACCACTTCCC AAAAAGAAGA GTTTAACTCA
CAGCGTGCCG AAAGACGTGA GTACAATAAA GATGCCGAAA AAGACTCCAA GCGGGAGCTT
AGAAAGGAAC AGCCGAGATC CGCAATCAGC GGAGGAAGAA ATAAAAATCA CAAAGTAATA
AAAAATGTTT TTAATTCCAG AAAAGGAGTT TCCGAAGTAT TATCCGATGA TTTTGAGATG
GATGATTTTT ACTTCGGTGG TTCAAAGAAA AGCAGGAAGA TAAAAAAGAA GAAAGAAGAG
AAAAAAGAGG AAAAACCGGC TCCGCCAAAG CCTGTGGTTA CGTCAATAAA AATTGCCGCG
CCCATTACTG TCAAGGAATT GGCAGAAGCT CTTAAAAAGA CATCAGCTGA AGTAATAAAG
AAACTAATGT CTTTGGGAAT CATGGCAACG TTAAATCAGG AACTGGATTT TGATACGGCG
GCTATAGTTG CGGACGAATT TGGGGTGAAG GCGGAAGAGG AAGTTGTTGT AAACGAGGAG
GATATCCTCT TTGACGATTC CGATGATCCG AACGATCCTG AGGCGGTGCC AAGACCTCCT
GTGGTGGTTG TTATGGGACA TGTTGACCAC GGAAAAACAT CGCTTCTGGA TGCAATCAAG
AAAACAAATG TTACGGAAAA AGAAGCCGGT GGAATAACTC AGCATATAGG TGCTTACATG
GTGAAAATAA ATAACAGAAA TATTACGTTC CTCGATACTC CGGGTCACGA AGCTTTTACG
GCCATGAGAG CAAGAGGTGC CCAGGTTACT GACATTGCCG TTTTGGTTGT GGCCGCGGAT
GACGGTGTCA TGCCTCAGAC AATCGAGGCA ATAAACCATG CAAAGGCCGC AAATGTTACG
ATCATTGTTG CAATTAACAA AATTGACAAG CCGACTGCAA ACCCGGAAAA AGTCAAGCAG
GAATTGACAG AATATGGACT CATTCCTGAG GAATGGGGCG GCGATACAAT TTTTGTTGAA
GTTTCTGCAA AAAAAGGAGT TAATATCGAC TATCTGCTGG AAATGATTCT TTTGGCTGCG
GATATGCTGG AGCTTAAGGC AAATCCGAAC AAACAGGCAA AAGGTACCGT TATTGAGGCA
AAACTTGACA AAGATAAAGG GCCTGTTGCA ACGGTGCTTG TACAGAGGGG AACATTATGT
GTTGGTGATT CCATTATAGT CGGCACCACT ACCGGAAGAA TAAGGGCAAT GACGGACGAT
AAAGGCCACA GAATCAAAAA GGCCGGACCT TCAACACCGG TTGAGATTCT TGGATTGCAT
GAAGTTCCCG AAGCTGGGGA AACATTCTAT GTGATAACCG ACGAAAAAAC TGCAAAACAA
TTAATAGAAA AGAGAAAACT AAAACAAAGA GAACAGCTGC TTAAAGCCAG CGCAAGAGTT
ACTCTCGATG ATCTGTTCAA TCAGATAAAA GAGGGTAAGG TAAAAGAACT GAATATCATT
GTAAAAGCCG ATGTTCAAGG TTCCGTTGAG GCTTTGAAAC AGTCTTTGGA GAAGCTTAGC
AATGATGAAG TCAGAGTAAA AATTATTCAC GGAGGAGTAG GTTCGGTAAC CGAAACCGAC
GTTACTTTGG CACAGGTGTC CAACGCCATT ATAATCGGAT TCAATGTAAG ACCTCCGGCC
AACGTTATTG ATGCGGCAAA GAAAGCAGGG GTTGATTTAA GGCTTTACAC AATTATATAC
AATGCAATTG AGGATATTGA AGCCGCTATG AAAGGAATGC TTGAACCAAC TTACAAGGAA
GTTGTAATCG GACATGTTGA AATAAGGCAG ATATTCAAAG TTTCCGGTGT AGGAACGGTT
GGCGGCGGCT ATGTAACTGA CGGAAAGATT ACAAGAAATG CCAATATCAG ACTTGTAAGG
GACGGAATAG TAGTTCATGA AGGCAAGCTT GGTTCATTAA AGAGATTTAA AGATGATGTG
AGAGAAGTTG CGGAAGGGTA TGAATGCGGA TTGTCTATAG AAAAGTTCAA TGATATAAAA
GAGGGAGACG TAGTTGAAGT CTATGTTATG GAAGAAGTAA AAGAGTAA
 
Protein sequence
MAKKRVYELA KELNTTSKRL MEKLEEINIV VKNHMSFLEE DELEALYDHI GVIRHKDDKS 
NTDDNKTASA HSVAQHSSEA MKELKKEAKK APRIIRTTEI YLDSKDEEIK EIKANDAKSQ
KKPEEKRKKN DFVRVETETS GLRPGLVRET KPEYMRILEE QKKSEASKAQ TNEKKDAEKN
SVKEVVKKEE GSKQTAEIKD GSVNMEGKVL EEVKATVADS ATNVNLNESI DKDKKTNDNR
QVSTDNSAVN NEENAADTLN KKDMDKKNNN KKNEAKKNAE KKNEAKKNEK NDNKGGNAKK
NEHRSPDMKK NDSNRPQDAN KQNSKAAADK NREEGRTGSK KSLEIPKVEL TTSQKEEFNS
QRAERREYNK DAEKDSKREL RKEQPRSAIS GGRNKNHKVI KNVFNSRKGV SEVLSDDFEM
DDFYFGGSKK SRKIKKKKEE KKEEKPAPPK PVVTSIKIAA PITVKELAEA LKKTSAEVIK
KLMSLGIMAT LNQELDFDTA AIVADEFGVK AEEEVVVNEE DILFDDSDDP NDPEAVPRPP
VVVVMGHVDH GKTSLLDAIK KTNVTEKEAG GITQHIGAYM VKINNRNITF LDTPGHEAFT
AMRARGAQVT DIAVLVVAAD DGVMPQTIEA INHAKAANVT IIVAINKIDK PTANPEKVKQ
ELTEYGLIPE EWGGDTIFVE VSAKKGVNID YLLEMILLAA DMLELKANPN KQAKGTVIEA
KLDKDKGPVA TVLVQRGTLC VGDSIIVGTT TGRIRAMTDD KGHRIKKAGP STPVEILGLH
EVPEAGETFY VITDEKTAKQ LIEKRKLKQR EQLLKASARV TLDDLFNQIK EGKVKELNII
VKADVQGSVE ALKQSLEKLS NDEVRVKIIH GGVGSVTETD VTLAQVSNAI IIGFNVRPPA
NVIDAAKKAG VDLRLYTIIY NAIEDIEAAM KGMLEPTYKE VVIGHVEIRQ IFKVSGVGTV
GGGYVTDGKI TRNANIRLVR DGIVVHEGKL GSLKRFKDDV REVAEGYECG LSIEKFNDIK
EGDVVEVYVM EEVKE