Gene Moth_1889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1889 
Symbol 
ID3831234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1952445 
End bp1954703 
Gene Length2259 bp 
Protein Length752 aa 
Translation table11 
GC content57% 
IMG OID637829822 
Productsigma-54 dependent trancsriptional regulator 
Protein accessionYP_430732 
Protein GI83590723 
COG category[K] Transcription
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains
[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGGGA TAGTCAACAC TGCTACTGGA AGGTGCCGTC AGTGCTATTC CTGTGTCCGC 
AATTGCCCGG TCAAGGCCAT CAGAATAAAC AAGGGCCAGG CGGAGGTTAT CGCCGAACGC
TGCATCAGCT GCGGCATGTG CCTGGCTTTT TGCTCCCAGG GGGCCAAACA GGTAGCCGGC
AGCCAGGCGG CCGTCCTGGC AGCGTTAAAG GAGCACCAGG AGATGGTAGC CTGCCTGGCG
CCGTCATTTC CAGCGGCTTT TCCTGGTTGG ACCGCCGGCC AGGTGGCCGG CGCCCTGAAG
AAACTTGGTT TTGCCCGGGT ATGGGAGGTG GCCGTGGGGG CGCTGCTGGT TGCCAGGGAG
TATCAAAGGG TGCTAAAACA GAGGAATACT CCCGCCATCA GTACGGCCTG CTATGCGGTG
GTCAATCTGG TCGAGAGGCA CTTCCCGTCC CTCATTCCTT ACCTGTTACC GGTAGTCTCC
CCCTCCATAG CCCTGGGAAG GCTTCTTAAA AAACACCTGG GTCCCGTGAA AGTGGCTTTT
ATCGGCCCCT GTATCGCTAA AAAAGAAGAG ATTCTGGATC CGGAGGTAGC CGGCGCTGTA
GATTATGTAC TGACATTTGC GGAAATTAAG GAGTTACTTG CCGTTGAGCA TCTGGAACAT
CCCGGGGTTG CGGCAGCCCT GGACAGCCCG CCGGTGGCAG TCAGCCGGCT TTTTCCCCTG
CCCGGGGGAC TCAGCCGGAG CATGGGCGCG ATCCCGGATA TTGCCGACCA GGATCTTTTG
CTGGTTGAAG GGAAAGAAGG TGTGCTGGCG GCCCTGGAGG GCCTGGCACG GGGGGAGATC
CGGCCCCGCT TAATCGACGC CCTTTTTTGC GAAGGCTGCG TCATGGGTCC CGGAATGGGT
GTTGTGGTCA ACCAGGTAAA GAGAAAGGAG CTGGTAGCCG CCTACTACCG CCGCTGTCAG
GAGGCGCGCG AGCCGGAAAT CCTAGCCCCC GACCTGGCGC GGAGCTTTCA CAATAAACAA
TCGTCCCTGC CCCTCCCCGG CGAGGAAGAT ATTAAACGCA TCTTACGGCT GACCAACAAA
TTTACGCCGG CCGATGAACT GAACTGTGGC GCCTGCGGCT ATCACTCCTG CCGGGAGAAA
GCCATAGCCG TTTACCAGGG CCTGGCGGAG ATCGATATGT GCCTGCCCTA TCTCCTGGAA
CAGAAGAGCG ACCTGCTGTC CCGGGCGGCC AGCAACCTGA TGCATTTCGT CAATCTATAT
AAAAGTCCCG GCGACAGGCC CGGCCCCGGG GTCATGGAAT TGCTCCAGGA AAGAAACATT
ATTGTCGCCA GCCCGCGGAT GTTAAGGGTC CTCTACCTGG CGGAACGGGT AGCCAGGGTG
GATTCCACGG TGTTAATCCT GGGGGAATCC GGCGTCGGTA AGGAAGTAGT CGCCCGCCTG
ATCCATGCCT TAAGCGAGCG CGGCAAGGGG CCGTTTGTGA AAATAAACTG CGGCGCTATT
CCGGAAAACC TGCTGGAATC CGAGCTTTTT GGCTACGAAC GGGGGGCTTT TACCGGGGCC
AACCGGGAGG GAAAGATGGG CCAGCTGGAG TTGGGCGAGG GGGGAACGGT ATTCCTGGAC
GAAATCGCTG AACTCCCCTT AAAGCTACAG GTTAAGCTCC TGCAGGTCTT ACAGGAGCAG
CGCCTGGTAC GGGTGGGGGG GATCAGGGAG ATCAAACTCA ATATTCGCAT TATCTCGGCG
ACCAATAAAA ACCTCTTGCA GATGGTCCGG GAAGGGACCT TCCGGGAGGA TCTGTATTAC
CGCCTGAATG TAATCCCCCT GACCATCCCC CCTTTACGGG AACGGCCGGA AGATATCGAA
GCCCTCATCG ACCATTTTAT GGACCGGCTG AACCGGCGTT ACAAGCAAGA AAAAAGGATT
AGCCGCCGGG CCAGGAGGTA TCTCCTGGCC TATCCCTGGC CCGGCAATGT AAGGGAACTC
CATAACGTCA TCGAGCAGCT TTTCGTCCTG GTAGAAGGGA CGGAGATTCT ACCTGAGCAT
TTACCCTATT ATATCCGCGA CGACCCGGCG AGATATAGCT CCCATATGCT GGTAAAAGAT
ATTATACCCA TGAAAGAAGC CATTGAAGAG GTTGAAAAAC AGTTGCTGTT AAAGGCCCTG
GAAAAGTACA GGAGCACTTA CCAGGTTGCC GAAAAGCTGG GGGTAAACCA GTCGACTGTA
GTGCGCAAAA TCAAAAAGTA CGGGCTGGAG CATCAATAA
 
Protein sequence
MGGIVNTATG RCRQCYSCVR NCPVKAIRIN KGQAEVIAER CISCGMCLAF CSQGAKQVAG 
SQAAVLAALK EHQEMVACLA PSFPAAFPGW TAGQVAGALK KLGFARVWEV AVGALLVARE
YQRVLKQRNT PAISTACYAV VNLVERHFPS LIPYLLPVVS PSIALGRLLK KHLGPVKVAF
IGPCIAKKEE ILDPEVAGAV DYVLTFAEIK ELLAVEHLEH PGVAAALDSP PVAVSRLFPL
PGGLSRSMGA IPDIADQDLL LVEGKEGVLA ALEGLARGEI RPRLIDALFC EGCVMGPGMG
VVVNQVKRKE LVAAYYRRCQ EAREPEILAP DLARSFHNKQ SSLPLPGEED IKRILRLTNK
FTPADELNCG ACGYHSCREK AIAVYQGLAE IDMCLPYLLE QKSDLLSRAA SNLMHFVNLY
KSPGDRPGPG VMELLQERNI IVASPRMLRV LYLAERVARV DSTVLILGES GVGKEVVARL
IHALSERGKG PFVKINCGAI PENLLESELF GYERGAFTGA NREGKMGQLE LGEGGTVFLD
EIAELPLKLQ VKLLQVLQEQ RLVRVGGIRE IKLNIRIISA TNKNLLQMVR EGTFREDLYY
RLNVIPLTIP PLRERPEDIE ALIDHFMDRL NRRYKQEKRI SRRARRYLLA YPWPGNVREL
HNVIEQLFVL VEGTEILPEH LPYYIRDDPA RYSSHMLVKD IIPMKEAIEE VEKQLLLKAL
EKYRSTYQVA EKLGVNQSTV VRKIKKYGLE HQ