Gene Moth_1713 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1713 
Symbol 
ID3833163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1754916 
End bp1757162 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content60% 
IMG OID637829638 
Productsigma-54 dependent trancsriptional regulator 
Protein accessionYP_430558 
Protein GI83590549 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains
[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000552259 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGGAAA TAGTGAAAAC AATCACAGGC AAATGCCGGA TGTGTTATGC TTGCATCCGC 
AACTGCCCGG TCAAGGCGAT CAAGGTAGTC GACGGCCAGG CCCGGGTGGT ACCGGAGCTT
TGTATCGCCT GCGGCCACTG CGTCCAGGTC TGTGCCCAGG GAGCCAAGCT GGTCGAGCGG
GAAATAGACA AAGTAGAGGA ATTCCTGGCT GCCGGCCGGG TCATAGCCTG CCTGGCGCCT
TCTTTTGTCG CGGAATTTCA CCCCGCCTGG CCCGGCCAGG TGGTGGCGGC CCTGCGGAAG
CTGGGTTTTG ACGAGGTTTA TGAAGTGGCT TACGGCGCCC ATCTGGTTAC CCTGGAGTAT
CAAAAGTACC TCCAGGAAGC CGGCGACCGC CAGGGGCTCA TCAGCACCCC ATGCCCGGCG
ATAGTCAACC TGGTGACCAA ATACTACCCC CGCCTGCTGC CCCAGCTGGT GCCGGTGGTA
TCGCCCATGA TCGCCCTGGG ACGCTACCTG AAAGCCCGCA AGGGTCCCGA TGTCCGCCTG
GTCTTCATTG GTCCCTGCGT GGCCAAGAAA GGAGAGGCCA GGGAGGAAGA AGTAGCCGGG
GCGATTGACG CCGTCCTGAC CTTTCTGGAG CTAAAGGAGA TGCTGGTCGC CAGGGAGATC
GACATTAATC TCTGCGGTAA CAGCGATTTC GATAATGACC CGGCCTATCT GGGCCGCCTC
TACCCCGTCG GGGGCGGCCT CTTACGTAGC GCCGGCATCG ATGCCGATAT CCTGACCAAC
CAGGTCCTGG TAGTTGAAGG CCGTGAAGAC TGCCTGGATT TTTTAAAAGC AGTAAACGAG
GGCAAGATGG TGCCTCAGAT GGCTGATATG CTCTTCTGCA AGGGGTGCAT CAACGGCCCC
ATGCAGGAAA ACGATTTTTC GCCCTACCGG CGGCGGAAGA TTGTGGCCGA ATTTACCCGC
CAGGGATTGC AAAATACCCG TCCCCAGCCC CTGGATCTCA CAGGGATAAA CTTGCGCCGG
GAGTTTCATG ATCAGAGCGT TTTCTTTCCC CGGCCCAGCG AGGCGGATAT CCGGGCCATC
CTGGCAGAAC TGGGTAAGTA TACCCCGGAG GACGAGCTCA ATTGCGGTGC CTGTGGCTAT
CCCTCCTGCC GGGAGAAGGC CGTTGCCGTT TATCACGGCC TGGCGGAGAA TCGCATGTGC
CTGCCCTACT TCATTGCCCA GCTGGAAAAG AGCAACTTCC ACCTGCGCCA GGAGCTTAAA
GCCTTCCAGG GCAAGTTTGG CCAGCTGGTG GGCAACAGTG CAGCCATGCA GGAGGTCTAC
CGCCTGATAG AGAAGGTAGC TAAAACCGAT GCCACCGTCC TCATCCGGGG GGAGAGCGGT
ACCGGCAAGG AGCTGGTGGC TAGGGCCATA CATTATCACA GCCGCCGGAG CGAGGCGCCC
TTCGTCAGCA TCAACTGCGC CGCCCTGCCG GAAACCTTGC TGGAGAGCGA GCTCTTCGGC
CATGCCCGGG GCGCCTTTAC CGGGGCGGTC ACGGCTAAGA AAGGACTCTT TGAGGAGGCC
AACGAAGGCA CCATTTTCCT GGATGAAATC GGCGATATCA GCCTGGGACT CCAGAGCAAG
CTCCTGCGCG TCCTCCAGGA AGGGGAATTC TTACGGGTGG GAGAGACCAG GCCGACCAGG
GTGAACGTCC GCGTCCTGGC GGCCACCAAC CGGGACCTGG AGAAGGCCGT CAAGGATGGC
GCCTTCCGGC CCGAGCTCCT TTATCGCCTC AACGTCATTA CCATCTGGCT GCCGCCCTTG
CGGGAACGGC GGGATGATAT CCCCCTGCTG GCCCGTCACT TCCTGGAAAA ATCCTGCCAG
CGCCTGGGTA AAGAAGTCAA TGGCTTCACC CACGAGGCCA TGGACGCCCT GATACGGGCC
GACTGGCCGG GCAATGTTCG GGAACTGGAG AACGTCATCG AGCGGGCGGT TATTTTGACC
GAGAGTAAGA GAATCGAGCT GGGCGATTTA CCGCGGCAGT TCCGTAACAT GGCCGCCGGG
GCTTCCGGCT GGGCCTTCAC CCGGGGCATG AGTTTTAAAG AGGCTGTGGC CGCCTACGAA
TCGCGCCTTA TTCTGCAGGC TCTGGAGGAG AGCCAGGGGG TCCAGGCCCG GGCGGCAGAG
ATCCTGGGCC TGAAGCGGAG CACTTTGAAT GAGATGATAA AACGCTATAA TTTAATGGGC
CGCAAAAATG GTGGTACAAA TAAATGA
 
Protein sequence
MREIVKTITG KCRMCYACIR NCPVKAIKVV DGQARVVPEL CIACGHCVQV CAQGAKLVER 
EIDKVEEFLA AGRVIACLAP SFVAEFHPAW PGQVVAALRK LGFDEVYEVA YGAHLVTLEY
QKYLQEAGDR QGLISTPCPA IVNLVTKYYP RLLPQLVPVV SPMIALGRYL KARKGPDVRL
VFIGPCVAKK GEAREEEVAG AIDAVLTFLE LKEMLVAREI DINLCGNSDF DNDPAYLGRL
YPVGGGLLRS AGIDADILTN QVLVVEGRED CLDFLKAVNE GKMVPQMADM LFCKGCINGP
MQENDFSPYR RRKIVAEFTR QGLQNTRPQP LDLTGINLRR EFHDQSVFFP RPSEADIRAI
LAELGKYTPE DELNCGACGY PSCREKAVAV YHGLAENRMC LPYFIAQLEK SNFHLRQELK
AFQGKFGQLV GNSAAMQEVY RLIEKVAKTD ATVLIRGESG TGKELVARAI HYHSRRSEAP
FVSINCAALP ETLLESELFG HARGAFTGAV TAKKGLFEEA NEGTIFLDEI GDISLGLQSK
LLRVLQEGEF LRVGETRPTR VNVRVLAATN RDLEKAVKDG AFRPELLYRL NVITIWLPPL
RERRDDIPLL ARHFLEKSCQ RLGKEVNGFT HEAMDALIRA DWPGNVRELE NVIERAVILT
ESKRIELGDL PRQFRNMAAG ASGWAFTRGM SFKEAVAAYE SRLILQALEE SQGVQARAAE
ILGLKRSTLN EMIKRYNLMG RKNGGTNK