Gene Teth514_0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_0120 
Symbol 
ID5877622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp114840 
End bp117773 
Gene Length2934 bp 
Protein Length977 aa 
Translation table11 
GC content34% 
IMG OID641540466 
Productsigma-54 factor interaction domain-containing protein 
Protein accessionYP_001661778 
Protein GI167038793 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG1221] Transcriptional regulators containing an AAA-type ATPase domain and a DNA-binding domain
[COG1762] Phosphotransferase system mannitol/fructose-specific IIA domain (Ntr-type)
[COG3933] Transcriptional antiterminator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGAAA GGATTTTTGA AATTATACAA AAAGAGGATA AAAAGAATCC TTTAACAGAT 
GACCAAATAG CGGCAATTCT CAATATTAAA AGAGAAGATG TGACACAATT TAGACTGAAA
AATAATATTC CTGACTCAAG AGAAAGAAGA AAACCTTACC TTTTAGAAGA TGCAAAAAAA
ATAATTTCAA AGGACCCAAA TATTTCAGAT AGAAATTTGA CAAAACAATT AAATAATTTA
GGATATAATA TTTCTCGTTT TGTAGCTACA CAAATAAAAA AGGAAATTTT AAAAGATAAA
ATTGCAGATA ATTTAGTGCA ACGAAGTACT ATTAAAAACG ATACTGTAAA TTTTGAAACT
CCTAAAACAG CTAAAAGCTT ACTCTCATTT AAAGAGATTA TTGGAAGCGA AGGAAGTCTT
AAAGTGCAGA TAAGCCTTGC AAAAGCTGCA GTACTTTATC CTCCCCATGG ATTGCATACT
TTAATAGTAG GACCATCTGG ATCTGGAAAA AGCCAGCTGG CTGAAGCAAT GTACAACTAT
GCAATAGAGT CTGGAAGATT TAATGAAAAT GCGCCTTTTG TAGTTTTTAA CTGTGCTGAT
TATGCGGATA ATCCACAACT ACTCATGGCG CAACTTTTTG GTTATGTAAA AGGCGCTTTC
ACAGGGGCTG ATACTGCAAA AGCGGGATTG GTAGAAAAAG CTGATGGTGG AATTTTGTTT
TTGGACGAAG TCCACAGACT CCCCAGTGAA GGACAAGAGA TACTCTTTTA TTTGTTGGAC
AAAGGAAAAT TCAGACGATT AGGGGAAACA GAAAGTACTA GAGAGGCACA AATTATGCTT
ATTGCTGCTA CTACTGAAAA TCCTGAATCT TCGCTATTAC TTACTTTTAG GAGAAGAATT
CCTATGGTGA TAGAACTTCC TTCTCTTTCA GAAAGACCTC CCCAAGAAAG ATATGAAATT
ATACATCATT TTTTCTCAAA AGAGTCTTTT AGAATTAATA AATCGATTAT TGTAAAAAAA
GAGGCAGTAA GAGCTCTTAT GCTTTATGAC TGTCCGGGGA ATATAGGGCA GCTAAGGAGC
GATATTCAAG TTGCTTGTGC GAGGAGCTTT TTAAATTCAT TAGGGAGTAA AAGCTCTTCA
TTGACAATAG ACCTTTCAGA TTTACCTAAC CACGTAAAAA TGGGGATAAT CAGAGTAAAT
AAAAGAGATC CCGAAATAGA AAGATATTCA AATGAAGACA TTATGGTGTA TCCTGACAAG
GAAATTAAGT TATATCCAAA AGAAGATAGA TATATGCTGC CTGATGAAAT ATATCAATTT
ATAGAAGAAA GGTTTATTGA CTTAAAAAGA CAAGGACTTA CAAAAGAGGA GATAGATAAA
ATTCTCGGGA AAGAGATAGA GGCGGAGCTG AAGAAATTTG CTGTAAATGT AAAATCTAAC
ATAACAATTT CAAAAAAAGA ATTGACAAAC ATAGTAGGAG AAAAAATTGT AAATGCTGTT
GAAAAAGCAT ACGAGATTGC TAAAAAAAGT TTTAAAAACT TAGAGGATAA TCTCTTTTAT
TCCCTTGCTA TACACTTAAG TGCTGCTTAT GAAAGAATAT TAAGTGGCAA ACCAATAATT
AATCCTCAAC TTGAAAACAT AGTGAAAGAA TATCCTGTTG AGTATTCTAT TGCTAAAATT
ATGGCAAAAC AAATTAACAA AGAATTAGAA ATACAGCTGC CTGATGAAGA AGTGGGCTTT
ATTGCTATGT ATCTAAGGAC ATTCTCAGGG GACAAAGAAA TAACTAAAGG CAGAGTGGGC
GTAGTTGTGC TTACGCACGG CCATGTTGCA AGTGGCATGG CAGAGGTGGC AAATAAACTC
TTAGGGGTAA ATCATGCAGT CGGTATAGAT ATGGCTCTTG ATGAAAGCCC AGAATCTGTT
CTTGAGAGGA CAATTGAAGT AGTGAAAAGA ATAGACGAAG GCAAAGGCTG CATTATTTTG
GTTGACATGG GTTCATTAAT TACCTTTGGT GAAATTATAA CCAAAAGGAC AGGGATACCT
ACAAGAGTGG TAGGAAGAGT AGATACGGTT ATGGTGTTGG AAGCTGTCAG AAGAGCTATT
ATACCGGATA CTAATTTAGA TGAAATAGCA GATGCGTTAG ACGTAGATAA AACATATATA
GGAAGAGTTG AAGGTATAAA GAGTAAAGAT AAGCTTCCAA AAGCAATTGT AACTGTATGT
ATTACAGGAG AGGGAACAGC GTTAAAAATA AAGAAATATA TTGAAGATGT CTTACCACAA
CTAAAAGATG ATTACAAAAT AATACCTGTT GGAATGCTCA GGCAGGAAGA TATTGAAAAA
GAAATTTCAA AAATAAGAGA AGAAAATGAA GTGGTAGCTT TTGTTGGTAC AATAAATCCC
GGCATAAAAA GTATACCTTT TATATCTGTT GAAGATATTT TACATGGCAC AGGGATTGAA
AAGCTTAAAA AAATCCTCAA TTTAAAGGTT GAAAATCCAT TGAAAGAAAT CATTGATGAA
AACCTTATCT TATGTGATGT GGATATTTCT ATGAAAAGTG ATGTCATAGA TAAATTAGTA
GAATTGCTTC AAGATAAAGG TTATGTAGAT GATAAGTTTT TGCTAAGTGT ATACAAAAGA
GAATCAATGG GAGCCACCTG GATGAAAGGC GGTATAGCTA TACCACATGG ATACACGAAA
AACGTTACGA AATCTGCCAT AGCAATTGCA AAATTAAAAA AACCTATTTT TTGGGAAGGA
GAGCTTAAAG CGGATTTAGT TTTTATGATA GCGTTAAAAG AAGATTCGAA GGATTATATG
CTTGATTTAT ATAAAGTAAT GACAGACGAA AAGATTGTAA ATGCTTTGAA GGGAGCTAAA
AGTCCGGTTC AAATAAAAGA AATAATATTA AAAAATACAC TACCGGCCAA TTAA
 
Protein sequence
MIERIFEIIQ KEDKKNPLTD DQIAAILNIK REDVTQFRLK NNIPDSRERR KPYLLEDAKK 
IISKDPNISD RNLTKQLNNL GYNISRFVAT QIKKEILKDK IADNLVQRST IKNDTVNFET
PKTAKSLLSF KEIIGSEGSL KVQISLAKAA VLYPPHGLHT LIVGPSGSGK SQLAEAMYNY
AIESGRFNEN APFVVFNCAD YADNPQLLMA QLFGYVKGAF TGADTAKAGL VEKADGGILF
LDEVHRLPSE GQEILFYLLD KGKFRRLGET ESTREAQIML IAATTENPES SLLLTFRRRI
PMVIELPSLS ERPPQERYEI IHHFFSKESF RINKSIIVKK EAVRALMLYD CPGNIGQLRS
DIQVACARSF LNSLGSKSSS LTIDLSDLPN HVKMGIIRVN KRDPEIERYS NEDIMVYPDK
EIKLYPKEDR YMLPDEIYQF IEERFIDLKR QGLTKEEIDK ILGKEIEAEL KKFAVNVKSN
ITISKKELTN IVGEKIVNAV EKAYEIAKKS FKNLEDNLFY SLAIHLSAAY ERILSGKPII
NPQLENIVKE YPVEYSIAKI MAKQINKELE IQLPDEEVGF IAMYLRTFSG DKEITKGRVG
VVVLTHGHVA SGMAEVANKL LGVNHAVGID MALDESPESV LERTIEVVKR IDEGKGCIIL
VDMGSLITFG EIITKRTGIP TRVVGRVDTV MVLEAVRRAI IPDTNLDEIA DALDVDKTYI
GRVEGIKSKD KLPKAIVTVC ITGEGTALKI KKYIEDVLPQ LKDDYKIIPV GMLRQEDIEK
EISKIREENE VVAFVGTINP GIKSIPFISV EDILHGTGIE KLKKILNLKV ENPLKEIIDE
NLILCDVDIS MKSDVIDKLV ELLQDKGYVD DKFLLSVYKR ESMGATWMKG GIAIPHGYTK
NVTKSAIAIA KLKKPIFWEG ELKADLVFMI ALKEDSKDYM LDLYKVMTDE KIVNALKGAK
SPVQIKEIIL KNTLPAN