Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_1827 |
Symbol | |
ID | 5877820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | - |
Start bp | 1838828 |
End bp | 1841926 |
Gene Length | 3099 bp |
Protein Length | 1032 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 641542180 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_001663445 |
Protein GI | 167040460 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAGGA AAGTTATATT TAACTTTGAT GATAATTTAG AATACCAAAA GAAAGCTATC AACTCAGTTG TAGCCCTCTT TAGAGGACAA GATCGGGAGC TGGGAGATGT AATTTATCGT GGGAAAACCC GACATATAGG ACGTATATCT GAGGAAATTA TAAGGAATAG ATTGGACATA GGAAGAAATG TTATTCTAGA AAATTTACGA GAAATACAAA CGCAAAATAT GCTATTTCCC TCTGCTGATT TACAGCCTAT ATATAATTTT TCTGTAGAAA TGGAAACAGG TACTGGTAAA ACTTACGTAT ATTTGCGGAC TATTTTGGAG CTCTATAAAA ATTATAATTT TTTAAAGTTT ATAATTGTAG TGCCTACAGT GGCTATTCGT AAAGGAGTAG AAAAAAATAT TGAAATTTTG AAAGAACATT TAAAAACCCT TTATGACCTT GACATTTCAA ATTATGCTTT CGTTTATGAT TCAAATAACT TAAATAAGTT AATGGACTTT GTGGAAGCTA GGGATTTACG CATTGTAATT ATGAATATAC AGGCATTTAA TAAAGACAGC AATAAAATTC GCAGAGAAGA TGAAAGAGGG CGAGTTTTGT GGGACCTAAT AAAATATACT AAACCAATTG TCATTATAGA TGAACCACAG CGACTTGAAG GAAATGGGAA AAAGAAAAGT GCTTCTTTAA AGGCTATTGA AGAATTGGAT CCGCTATTCA TTTTAAGGTA TTCAGCAACT CATAAAAAGC TTTACCATCA AGTATATAAA CTTGATTCGT ACCAGGCTTA TAAACAGGAT TTAGTAAAGA AAATTGAGGT TAAAACTGTT TATGGCAATA TCAGTAAAGA CTACCCTTAT GTTCGCTATA TAGAATTTAC CCGGGACTTA AAAGCAAAAA TTGAAATTTT CTATCGTGAA CCGGGCGGGC AGGTTAAATT ACGTACCTTC AATGTGCAAA AAGGGGTAAG CCTTTATGAA TTATCCGGAG AATTATCCCA ATATAGAGAT ATGATTGTAT TGGAAGACCC GCACAAAATT AACGGATTGA AAATTGGTTA TAAGGAAGGT ATCTTAATAT TAAAAGAAGG AGAAAATAAT TATAACTTAG ATGAACTGGA CTTAATCAGA ATATTAATCC GTTTAACTAT ACAAACGCAT TTAACAAAGC AACTAAAAAT TTTAGAAAAA GGCTATAAAA TCAAAGTTTT GTCCCTTTTC TTTATAGATC GGGTTAAAAA TTTTAGAGAT AGTGAAGCAC CTGACGGCCG AGGAATTTAT GCCCGCATTT TTGATGAGGA ATATGAGAAA GTAATTAAAG ATCCAAGATA TAATGAACTT TTTAAAAAAT ATCCTGATTT ATTTCCTGAA TACAAAAATG TGCAAAAGGT AAGAGAAGGT TATTTTGCAC GGGATAAGAA AAATAATGAA ACTGAGATAG AGGATTGGGA CTGGGGGAAA GATGAATTAG AGGTAAAAGC TAAATCTCAG GAAGATATTG AACGTGGAAT ACAACTTATT TTAGAGAAAA AGGATGAACT AATTTCTTTT GATGAACCAT TGGCATTTAT TTTTTCTCAT TCAGCTTTAA GAGAAGGCTG GGATAACCCT AATGTTTTCC AACTGTGTAC ATTAAAAAAA GGAAGTTCAG AAATTGCTAA AAAGCAAGAA ATAGGACGAG GCTTGCGACT GCCAGTGGAT ATTTATGGAA ACCGGTGTTT TGACAGTGAA ATAAATGTTT TGACTGTAAT TGCCAATGAC TATTACGATC ACTTCGCTGA AGCCTTGCAA AAAGATTTTA GATTGGAGGA AGGACTAGAT CAGGAAGAAG TTACTTTTGA AGTTATTTAT CAAACCTTGG CAGACGCTGG GATTCCAGAG GAATTTCTTG TGGCAGATAC AGTAGAAGCT TTTAGGACTG AACTTATTAA TGCTGGAATT ATTAATGCTA AAACTAACCA ACTAACGAAA GATGCAGCAG AGATAATATA TTATGATTTT ACAAATGCAA TTTTACAACC GTATCAGGAA GCTATTAAAG AGAAATTTAT AGAAAATATG AAAGCAAAAG GAAGTAAGCG AATTGAGATT AAAAATGGTG ATGAAGAACC TGTGGAAAAT GGATATAACA GTTTTGTAAC GGAAAAAGAA TTTCAAAAGC TTTTAGCAGA ATTACGTGAA CGAATAAACA AAAGAACAAT TTACCATGTA AACATTGATA AAGAGGAGTT TATTAAACAA AGTATAGAAG AAATCAATAA ACATCTTACT TATAAATCTA TTTATCGTCA ATATGAAGTA GCAACAGGAG GTATAGATTT CAAAAAATCT AATGAGTTGC AAATACATGA TCCTGCTAAA TTTACAATTA ATCTTGGAAC AGAAGCAATA GAAAATAATA AAAGTGATTT TGAAATAATA AATTATCTCA TGCATCAAAC TTTCTTGCCA AGGAAAGCTT TAATCAGAAT TTATTCAGCT TTAGAAAATA AAATTTTATT GCAGAAACAA GAGATTTTAG ATGAAGTAAT TAAGATTCTC CAGAATAAAT TGAATGAATT CCAGGCTAAG AATATAGAAT ATGAGGTTTT GGATGGAGTT GTGTTTGAAG AGAAGAATAT TTTTGCTGTG GATGAAATTG AACAATGGAT GCTAAATGAA AGTGCTAAAA AAGCATATAA AACAAAAGAA GAATACCGAA AAGCTCTCCA GAAATATATC CGTGTAGATA GCCATGGGGA GTATGAATTT GCTAGAAGCC TTGATGAAGA TCCTGATGTT CTTCTCTTCA CAAAATTAAA AAAGGGAGGA CTTGTAATTG AAACCCCTTA TGGAAATTAT ACCCCTGACT GGGCAATAAT CCATAAAGTT GATGATCATA GAGCAAAGTT GTATTTTATT GTAGAGTCAA AATTTGATAA AGAAAGAGTT AATTTAACCG ATGTGGAAAA AGCCAAAATT GAATGTGCCC GTAAACATTT TGCCACCGTA TCCAGTGATG TAGTTTTTGA TTGGGTCAAT AGCTATGAAA GATTTAAAGA TATTGTAAAT AGAAGTATAC AAAAAAATAA TGTAGTGAAA GGAAGTTAA
|
Protein sequence | MTRKVIFNFD DNLEYQKKAI NSVVALFRGQ DRELGDVIYR GKTRHIGRIS EEIIRNRLDI GRNVILENLR EIQTQNMLFP SADLQPIYNF SVEMETGTGK TYVYLRTILE LYKNYNFLKF IIVVPTVAIR KGVEKNIEIL KEHLKTLYDL DISNYAFVYD SNNLNKLMDF VEARDLRIVI MNIQAFNKDS NKIRREDERG RVLWDLIKYT KPIVIIDEPQ RLEGNGKKKS ASLKAIEELD PLFILRYSAT HKKLYHQVYK LDSYQAYKQD LVKKIEVKTV YGNISKDYPY VRYIEFTRDL KAKIEIFYRE PGGQVKLRTF NVQKGVSLYE LSGELSQYRD MIVLEDPHKI NGLKIGYKEG ILILKEGENN YNLDELDLIR ILIRLTIQTH LTKQLKILEK GYKIKVLSLF FIDRVKNFRD SEAPDGRGIY ARIFDEEYEK VIKDPRYNEL FKKYPDLFPE YKNVQKVREG YFARDKKNNE TEIEDWDWGK DELEVKAKSQ EDIERGIQLI LEKKDELISF DEPLAFIFSH SALREGWDNP NVFQLCTLKK GSSEIAKKQE IGRGLRLPVD IYGNRCFDSE INVLTVIAND YYDHFAEALQ KDFRLEEGLD QEEVTFEVIY QTLADAGIPE EFLVADTVEA FRTELINAGI INAKTNQLTK DAAEIIYYDF TNAILQPYQE AIKEKFIENM KAKGSKRIEI KNGDEEPVEN GYNSFVTEKE FQKLLAELRE RINKRTIYHV NIDKEEFIKQ SIEEINKHLT YKSIYRQYEV ATGGIDFKKS NELQIHDPAK FTINLGTEAI ENNKSDFEII NYLMHQTFLP RKALIRIYSA LENKILLQKQ EILDEVIKIL QNKLNEFQAK NIEYEVLDGV VFEEKNIFAV DEIEQWMLNE SAKKAYKTKE EYRKALQKYI RVDSHGEYEF ARSLDEDPDV LLFTKLKKGG LVIETPYGNY TPDWAIIHKV DDHRAKLYFI VESKFDKERV NLTDVEKAKI ECARKHFATV SSDVVFDWVN SYERFKDIVN RSIQKNNVVK GS
|
| |