Gene Teth514_1966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_1966 
Symbol 
ID5878102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp1977170 
End bp1978750 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content36% 
IMG OID641542314 
Productsigma-54 factor interaction domain-containing protein 
Protein accessionYP_001663578 
Protein GI167040593 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAG CTTTTATTGA TTCTTTTATT GATTCTCTTA AAGAAACTTT TGGCGTTGTG 
ACCAATCTTC CGGAAAAATG TTTCTCATTT ATTTCACTTC CGAAAACAGG CTATCGTGTT
TGGGCACCAC CCTGTACGCC CACTTGCCCG GAAGGCACTG CTTGCCCTTA TGAGCTAATA
CTTTCCTTTA AACTAGAAGA AGATATTTTT GTGGTTGCAT TGCCTAAAAA CATTGATAGA
GATTTAAAGC TAACACCCGC ATTCCTTGTT AAATTTCTTT TGACTGTATT GGAAATGTTA
TATATTCCTG AACAAAAATC CGATAATTTG TTAATCGCGG AAGCATTGCA ATTTTTGGCA
AATCAATTTA ATATAGAGTT TTTCCTTTTT GACAATAAAG GACAACTAAT TTTAAGTCAA
ACTCATTCGC CTATCCAACT TTCTACCTTT ACCCTTTGGG AAAAGGTAAG TCAGACAGGT
GAGCCAGAAG GTTTTATAAA AACCACATGG GGAAACTTTT ATTATCGTAT TTTTACAAAA
GATGGAATAA CAACGGGTTC GATAGTCTGG AAAACAGAAC CAACCAAGCA GGAAGAAAAC
AAATTCCTTT TTCACGACGA TTCTTTTTTT ACTATTCCAG GTAAAAACCC TCGTTTTTTA
GAAATAAAAG ACCTTGTACG ACAAATCGCA GTAACCGACA GCACAGTACT ACTTCAAGGT
GAAAGTGGCA CTGGTAAAGA ACTCTTTGCC CGCTATATTC ACTACTTAAG TCCTCGGAAA
AACCATCCTT TTATTGCAAT AAACTGTGCT GCAATTCCCG ATAACCTTCT TGAAAGCGAA
CTTTTCGGCT ACGAAGACGG CTCATTTACA GGTGCTCGTC GTGGCGGCAA GCCTGGTAAA
TTTGAACTTG CTAATGGTGG AACAATTTTT CTTGATGAAA TTGGTGACAT GCCTCTACAA
TTACAAGCAA AACTTTTACG GGTTTTGGAG AACCGACAAG TAGAAAGAAT TGGAGCTACT
GTTTCCCGTC CAATTGATGT AAGAATTATT GCGGCAACTA ATAAAGATTT AAAAGAATTA
ATTGATGCTA AACTGTTCAG AGAAGACCTA TTCTTCCGCT TAAATGTTTT TCCAGTTTTC
ATACCACCAC TACGTGAGCG TCGAGATGAT ATACTTCTCC TTTTAGACTT TTATTTAAAA
AATATTTGTC TTGAACAGGA TAAGCCATTT AAAATTTTCA GCCCAGAAGC TATTGAAATT
CTTCAGAATT ATGACTGGCC AGGAAATATA CGAGAATTAA AAAATGTAGT AACTTACGCC
GTCTCTATTT GTAAAGGTGA TATAATTACT CTGCATTATC TACCAAAGTA CTTACAAAAC
GGTTTATCAT CCACTAAATC CCAGTTTGGT GAAGTTAAAA CTACTCATCC AATTATCCAG
GTCGATCAAT TAAAAGATCA ATTAGAAATT CTATTAGCTA AATACGGTAA TTCTACCCAG
TCCAAAAAGT TAATTGCCCA AGAGTTAGGT ATTAGTCTGG CAACTCTCTA TCGTTGGCTT
AGAAAATACC ATCTAAGGTA G
 
Protein sequence
MRKAFIDSFI DSLKETFGVV TNLPEKCFSF ISLPKTGYRV WAPPCTPTCP EGTACPYELI 
LSFKLEEDIF VVALPKNIDR DLKLTPAFLV KFLLTVLEML YIPEQKSDNL LIAEALQFLA
NQFNIEFFLF DNKGQLILSQ THSPIQLSTF TLWEKVSQTG EPEGFIKTTW GNFYYRIFTK
DGITTGSIVW KTEPTKQEEN KFLFHDDSFF TIPGKNPRFL EIKDLVRQIA VTDSTVLLQG
ESGTGKELFA RYIHYLSPRK NHPFIAINCA AIPDNLLESE LFGYEDGSFT GARRGGKPGK
FELANGGTIF LDEIGDMPLQ LQAKLLRVLE NRQVERIGAT VSRPIDVRII AATNKDLKEL
IDAKLFREDL FFRLNVFPVF IPPLRERRDD ILLLLDFYLK NICLEQDKPF KIFSPEAIEI
LQNYDWPGNI RELKNVVTYA VSICKGDIIT LHYLPKYLQN GLSSTKSQFG EVKTTHPIIQ
VDQLKDQLEI LLAKYGNSTQ SKKLIAQELG ISLATLYRWL RKYHLR