Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_0120 |
Symbol | |
ID | 5877622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | + |
Start bp | 114840 |
End bp | 117773 |
Gene Length | 2934 bp |
Protein Length | 977 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641540466 |
Product | sigma-54 factor interaction domain-containing protein |
Protein accession | YP_001661778 |
Protein GI | 167038793 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG1221] Transcriptional regulators containing an AAA-type ATPase domain and a DNA-binding domain [COG1762] Phosphotransferase system mannitol/fructose-specific IIA domain (Ntr-type) [COG3933] Transcriptional antiterminator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGAAA GGATTTTTGA AATTATACAA AAAGAGGATA AAAAGAATCC TTTAACAGAT GACCAAATAG CGGCAATTCT CAATATTAAA AGAGAAGATG TGACACAATT TAGACTGAAA AATAATATTC CTGACTCAAG AGAAAGAAGA AAACCTTACC TTTTAGAAGA TGCAAAAAAA ATAATTTCAA AGGACCCAAA TATTTCAGAT AGAAATTTGA CAAAACAATT AAATAATTTA GGATATAATA TTTCTCGTTT TGTAGCTACA CAAATAAAAA AGGAAATTTT AAAAGATAAA ATTGCAGATA ATTTAGTGCA ACGAAGTACT ATTAAAAACG ATACTGTAAA TTTTGAAACT CCTAAAACAG CTAAAAGCTT ACTCTCATTT AAAGAGATTA TTGGAAGCGA AGGAAGTCTT AAAGTGCAGA TAAGCCTTGC AAAAGCTGCA GTACTTTATC CTCCCCATGG ATTGCATACT TTAATAGTAG GACCATCTGG ATCTGGAAAA AGCCAGCTGG CTGAAGCAAT GTACAACTAT GCAATAGAGT CTGGAAGATT TAATGAAAAT GCGCCTTTTG TAGTTTTTAA CTGTGCTGAT TATGCGGATA ATCCACAACT ACTCATGGCG CAACTTTTTG GTTATGTAAA AGGCGCTTTC ACAGGGGCTG ATACTGCAAA AGCGGGATTG GTAGAAAAAG CTGATGGTGG AATTTTGTTT TTGGACGAAG TCCACAGACT CCCCAGTGAA GGACAAGAGA TACTCTTTTA TTTGTTGGAC AAAGGAAAAT TCAGACGATT AGGGGAAACA GAAAGTACTA GAGAGGCACA AATTATGCTT ATTGCTGCTA CTACTGAAAA TCCTGAATCT TCGCTATTAC TTACTTTTAG GAGAAGAATT CCTATGGTGA TAGAACTTCC TTCTCTTTCA GAAAGACCTC CCCAAGAAAG ATATGAAATT ATACATCATT TTTTCTCAAA AGAGTCTTTT AGAATTAATA AATCGATTAT TGTAAAAAAA GAGGCAGTAA GAGCTCTTAT GCTTTATGAC TGTCCGGGGA ATATAGGGCA GCTAAGGAGC GATATTCAAG TTGCTTGTGC GAGGAGCTTT TTAAATTCAT TAGGGAGTAA AAGCTCTTCA TTGACAATAG ACCTTTCAGA TTTACCTAAC CACGTAAAAA TGGGGATAAT CAGAGTAAAT AAAAGAGATC CCGAAATAGA AAGATATTCA AATGAAGACA TTATGGTGTA TCCTGACAAG GAAATTAAGT TATATCCAAA AGAAGATAGA TATATGCTGC CTGATGAAAT ATATCAATTT ATAGAAGAAA GGTTTATTGA CTTAAAAAGA CAAGGACTTA CAAAAGAGGA GATAGATAAA ATTCTCGGGA AAGAGATAGA GGCGGAGCTG AAGAAATTTG CTGTAAATGT AAAATCTAAC ATAACAATTT CAAAAAAAGA ATTGACAAAC ATAGTAGGAG AAAAAATTGT AAATGCTGTT GAAAAAGCAT ACGAGATTGC TAAAAAAAGT TTTAAAAACT TAGAGGATAA TCTCTTTTAT TCCCTTGCTA TACACTTAAG TGCTGCTTAT GAAAGAATAT TAAGTGGCAA ACCAATAATT AATCCTCAAC TTGAAAACAT AGTGAAAGAA TATCCTGTTG AGTATTCTAT TGCTAAAATT ATGGCAAAAC AAATTAACAA AGAATTAGAA ATACAGCTGC CTGATGAAGA AGTGGGCTTT ATTGCTATGT ATCTAAGGAC ATTCTCAGGG GACAAAGAAA TAACTAAAGG CAGAGTGGGC GTAGTTGTGC TTACGCACGG CCATGTTGCA AGTGGCATGG CAGAGGTGGC AAATAAACTC TTAGGGGTAA ATCATGCAGT CGGTATAGAT ATGGCTCTTG ATGAAAGCCC AGAATCTGTT CTTGAGAGGA CAATTGAAGT AGTGAAAAGA ATAGACGAAG GCAAAGGCTG CATTATTTTG GTTGACATGG GTTCATTAAT TACCTTTGGT GAAATTATAA CCAAAAGGAC AGGGATACCT ACAAGAGTGG TAGGAAGAGT AGATACGGTT ATGGTGTTGG AAGCTGTCAG AAGAGCTATT ATACCGGATA CTAATTTAGA TGAAATAGCA GATGCGTTAG ACGTAGATAA AACATATATA GGAAGAGTTG AAGGTATAAA GAGTAAAGAT AAGCTTCCAA AAGCAATTGT AACTGTATGT ATTACAGGAG AGGGAACAGC GTTAAAAATA AAGAAATATA TTGAAGATGT CTTACCACAA CTAAAAGATG ATTACAAAAT AATACCTGTT GGAATGCTCA GGCAGGAAGA TATTGAAAAA GAAATTTCAA AAATAAGAGA AGAAAATGAA GTGGTAGCTT TTGTTGGTAC AATAAATCCC GGCATAAAAA GTATACCTTT TATATCTGTT GAAGATATTT TACATGGCAC AGGGATTGAA AAGCTTAAAA AAATCCTCAA TTTAAAGGTT GAAAATCCAT TGAAAGAAAT CATTGATGAA AACCTTATCT TATGTGATGT GGATATTTCT ATGAAAAGTG ATGTCATAGA TAAATTAGTA GAATTGCTTC AAGATAAAGG TTATGTAGAT GATAAGTTTT TGCTAAGTGT ATACAAAAGA GAATCAATGG GAGCCACCTG GATGAAAGGC GGTATAGCTA TACCACATGG ATACACGAAA AACGTTACGA AATCTGCCAT AGCAATTGCA AAATTAAAAA AACCTATTTT TTGGGAAGGA GAGCTTAAAG CGGATTTAGT TTTTATGATA GCGTTAAAAG AAGATTCGAA GGATTATATG CTTGATTTAT ATAAAGTAAT GACAGACGAA AAGATTGTAA ATGCTTTGAA GGGAGCTAAA AGTCCGGTTC AAATAAAAGA AATAATATTA AAAAATACAC TACCGGCCAA TTAA
|
Protein sequence | MIERIFEIIQ KEDKKNPLTD DQIAAILNIK REDVTQFRLK NNIPDSRERR KPYLLEDAKK IISKDPNISD RNLTKQLNNL GYNISRFVAT QIKKEILKDK IADNLVQRST IKNDTVNFET PKTAKSLLSF KEIIGSEGSL KVQISLAKAA VLYPPHGLHT LIVGPSGSGK SQLAEAMYNY AIESGRFNEN APFVVFNCAD YADNPQLLMA QLFGYVKGAF TGADTAKAGL VEKADGGILF LDEVHRLPSE GQEILFYLLD KGKFRRLGET ESTREAQIML IAATTENPES SLLLTFRRRI PMVIELPSLS ERPPQERYEI IHHFFSKESF RINKSIIVKK EAVRALMLYD CPGNIGQLRS DIQVACARSF LNSLGSKSSS LTIDLSDLPN HVKMGIIRVN KRDPEIERYS NEDIMVYPDK EIKLYPKEDR YMLPDEIYQF IEERFIDLKR QGLTKEEIDK ILGKEIEAEL KKFAVNVKSN ITISKKELTN IVGEKIVNAV EKAYEIAKKS FKNLEDNLFY SLAIHLSAAY ERILSGKPII NPQLENIVKE YPVEYSIAKI MAKQINKELE IQLPDEEVGF IAMYLRTFSG DKEITKGRVG VVVLTHGHVA SGMAEVANKL LGVNHAVGID MALDESPESV LERTIEVVKR IDEGKGCIIL VDMGSLITFG EIITKRTGIP TRVVGRVDTV MVLEAVRRAI IPDTNLDEIA DALDVDKTYI GRVEGIKSKD KLPKAIVTVC ITGEGTALKI KKYIEDVLPQ LKDDYKIIPV GMLRQEDIEK EISKIREENE VVAFVGTINP GIKSIPFISV EDILHGTGIE KLKKILNLKV ENPLKEIIDE NLILCDVDIS MKSDVIDKLV ELLQDKGYVD DKFLLSVYKR ESMGATWMKG GIAIPHGYTK NVTKSAIAIA KLKKPIFWEG ELKADLVFMI ALKEDSKDYM LDLYKVMTDE KIVNALKGAK SPVQIKEIIL KNTLPAN
|
| |