Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0991 |
Symbol | |
ID | 4811285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1183983 |
End bp | 1187090 |
Gene Length | 3108 bp |
Protein Length | 1035 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106409 |
Product | translation initiation factor 2 |
Protein accession | YP_001037416 |
Protein GI | 125973506 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0532] Translation initiation factor 2 (IF-2; GTPase) |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR00487] translation initiation factor IF-2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0315386 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCAAAAA AGAGAGTTTA TGAGCTTGCC AAAGAACTTA ACACGACAAG TAAGCGACTG ATGGAAAAAC TGGAAGAAAT AAATATAGTC GTCAAGAATC ACATGAGCTT TTTAGAGGAG GATGAACTGG AAGCTTTATA TGACCATATA GGCGTTATCA GACATAAAGA TGACAAATCC AATACGGATG ATAATAAAAC TGCTTCAGCT CATTCAGTAG CCCAACATAG CAGCGAGGCC ATGAAAGAGC TCAAAAAAGA AGCAAAAAAA GCGCCGAGGA TTATTAGAAC AACGGAAATC TATCTTGATT CAAAAGATGA GGAAATAAAG GAAATAAAGG CAAATGATGC CAAGAGTCAA AAGAAGCCCG AAGAGAAACG TAAGAAAAAT GATTTTGTCA GAGTTGAGAC TGAGACTTCG GGTTTAAGAC CCGGTCTCGT AAGGGAAACG AAGCCGGAAT ATATGAGGAT TCTTGAAGAA CAGAAAAAAA GTGAAGCATC AAAAGCTCAA ACCAATGAGA AAAAGGATGC TGAAAAAAAC AGTGTAAAAG AAGTTGTAAA AAAAGAGGAA GGTTCAAAAC AAACTGCTGA AATTAAGGAT GGCTCTGTGA ATATGGAAGG CAAAGTGTTA GAAGAAGTAA AAGCTACCGT GGCGGATAGC GCTACCAATG TTAATTTGAA TGAAAGCATT GACAAAGATA AAAAAACAAA TGACAACAGG CAGGTTTCAA CGGATAACAG TGCGGTAAAC AATGAGGAAA ATGCTGCGGA CACTTTAAAT AAAAAAGATA TGGATAAGAA AAATAATAAT AAAAAGAATG AAGCAAAAAA GAATGCTGAA AAGAAAAACG AGGCAAAAAA GAATGAAAAA AATGACAACA AAGGTGGCAA TGCAAAGAAA AATGAACATA GAAGCCCTGA TATGAAGAAA AATGATTCCA ATCGCCCGCA GGATGCAAAC AAGCAGAATA GCAAAGCCGC TGCCGATAAA AATCGGGAGG AAGGTCGTAC TGGATCAAAG AAGTCTTTGG AAATACCAAA GGTTGAACTT ACCACTTCCC AAAAAGAAGA GTTTAACTCA CAGCGTGCCG AAAGACGTGA GTACAATAAA GATGCCGAAA AAGACTCCAA GCGGGAGCTT AGAAAGGAAC AGCCGAGATC CGCAATCAGC GGAGGAAGAA ATAAAAATCA CAAAGTAATA AAAAATGTTT TTAATTCCAG AAAAGGAGTT TCCGAAGTAT TATCCGATGA TTTTGAGATG GATGATTTTT ACTTCGGTGG TTCAAAGAAA AGCAGGAAGA TAAAAAAGAA GAAAGAAGAG AAAAAAGAGG AAAAACCGGC TCCGCCAAAG CCTGTGGTTA CGTCAATAAA AATTGCCGCG CCCATTACTG TCAAGGAATT GGCAGAAGCT CTTAAAAAGA CATCAGCTGA AGTAATAAAG AAACTAATGT CTTTGGGAAT CATGGCAACG TTAAATCAGG AACTGGATTT TGATACGGCG GCTATAGTTG CGGACGAATT TGGGGTGAAG GCGGAAGAGG AAGTTGTTGT AAACGAGGAG GATATCCTCT TTGACGATTC CGATGATCCG AACGATCCTG AGGCGGTGCC AAGACCTCCT GTGGTGGTTG TTATGGGACA TGTTGACCAC GGAAAAACAT CGCTTCTGGA TGCAATCAAG AAAACAAATG TTACGGAAAA AGAAGCCGGT GGAATAACTC AGCATATAGG TGCTTACATG GTGAAAATAA ATAACAGAAA TATTACGTTC CTCGATACTC CGGGTCACGA AGCTTTTACG GCCATGAGAG CAAGAGGTGC CCAGGTTACT GACATTGCCG TTTTGGTTGT GGCCGCGGAT GACGGTGTCA TGCCTCAGAC AATCGAGGCA ATAAACCATG CAAAGGCCGC AAATGTTACG ATCATTGTTG CAATTAACAA AATTGACAAG CCGACTGCAA ACCCGGAAAA AGTCAAGCAG GAATTGACAG AATATGGACT CATTCCTGAG GAATGGGGCG GCGATACAAT TTTTGTTGAA GTTTCTGCAA AAAAAGGAGT TAATATCGAC TATCTGCTGG AAATGATTCT TTTGGCTGCG GATATGCTGG AGCTTAAGGC AAATCCGAAC AAACAGGCAA AAGGTACCGT TATTGAGGCA AAACTTGACA AAGATAAAGG GCCTGTTGCA ACGGTGCTTG TACAGAGGGG AACATTATGT GTTGGTGATT CCATTATAGT CGGCACCACT ACCGGAAGAA TAAGGGCAAT GACGGACGAT AAAGGCCACA GAATCAAAAA GGCCGGACCT TCAACACCGG TTGAGATTCT TGGATTGCAT GAAGTTCCCG AAGCTGGGGA AACATTCTAT GTGATAACCG ACGAAAAAAC TGCAAAACAA TTAATAGAAA AGAGAAAACT AAAACAAAGA GAACAGCTGC TTAAAGCCAG CGCAAGAGTT ACTCTCGATG ATCTGTTCAA TCAGATAAAA GAGGGTAAGG TAAAAGAACT GAATATCATT GTAAAAGCCG ATGTTCAAGG TTCCGTTGAG GCTTTGAAAC AGTCTTTGGA GAAGCTTAGC AATGATGAAG TCAGAGTAAA AATTATTCAC GGAGGAGTAG GTTCGGTAAC CGAAACCGAC GTTACTTTGG CACAGGTGTC CAACGCCATT ATAATCGGAT TCAATGTAAG ACCTCCGGCC AACGTTATTG ATGCGGCAAA GAAAGCAGGG GTTGATTTAA GGCTTTACAC AATTATATAC AATGCAATTG AGGATATTGA AGCCGCTATG AAAGGAATGC TTGAACCAAC TTACAAGGAA GTTGTAATCG GACATGTTGA AATAAGGCAG ATATTCAAAG TTTCCGGTGT AGGAACGGTT GGCGGCGGCT ATGTAACTGA CGGAAAGATT ACAAGAAATG CCAATATCAG ACTTGTAAGG GACGGAATAG TAGTTCATGA AGGCAAGCTT GGTTCATTAA AGAGATTTAA AGATGATGTG AGAGAAGTTG CGGAAGGGTA TGAATGCGGA TTGTCTATAG AAAAGTTCAA TGATATAAAA GAGGGAGACG TAGTTGAAGT CTATGTTATG GAAGAAGTAA AAGAGTAA
|
Protein sequence | MAKKRVYELA KELNTTSKRL MEKLEEINIV VKNHMSFLEE DELEALYDHI GVIRHKDDKS NTDDNKTASA HSVAQHSSEA MKELKKEAKK APRIIRTTEI YLDSKDEEIK EIKANDAKSQ KKPEEKRKKN DFVRVETETS GLRPGLVRET KPEYMRILEE QKKSEASKAQ TNEKKDAEKN SVKEVVKKEE GSKQTAEIKD GSVNMEGKVL EEVKATVADS ATNVNLNESI DKDKKTNDNR QVSTDNSAVN NEENAADTLN KKDMDKKNNN KKNEAKKNAE KKNEAKKNEK NDNKGGNAKK NEHRSPDMKK NDSNRPQDAN KQNSKAAADK NREEGRTGSK KSLEIPKVEL TTSQKEEFNS QRAERREYNK DAEKDSKREL RKEQPRSAIS GGRNKNHKVI KNVFNSRKGV SEVLSDDFEM DDFYFGGSKK SRKIKKKKEE KKEEKPAPPK PVVTSIKIAA PITVKELAEA LKKTSAEVIK KLMSLGIMAT LNQELDFDTA AIVADEFGVK AEEEVVVNEE DILFDDSDDP NDPEAVPRPP VVVVMGHVDH GKTSLLDAIK KTNVTEKEAG GITQHIGAYM VKINNRNITF LDTPGHEAFT AMRARGAQVT DIAVLVVAAD DGVMPQTIEA INHAKAANVT IIVAINKIDK PTANPEKVKQ ELTEYGLIPE EWGGDTIFVE VSAKKGVNID YLLEMILLAA DMLELKANPN KQAKGTVIEA KLDKDKGPVA TVLVQRGTLC VGDSIIVGTT TGRIRAMTDD KGHRIKKAGP STPVEILGLH EVPEAGETFY VITDEKTAKQ LIEKRKLKQR EQLLKASARV TLDDLFNQIK EGKVKELNII VKADVQGSVE ALKQSLEKLS NDEVRVKIIH GGVGSVTETD VTLAQVSNAI IIGFNVRPPA NVIDAAKKAG VDLRLYTIIY NAIEDIEAAM KGMLEPTYKE VVIGHVEIRQ IFKVSGVGTV GGGYVTDGKI TRNANIRLVR DGIVVHEGKL GSLKRFKDDV REVAEGYECG LSIEKFNDIK EGDVVEVYVM EEVKE
|
| |