Gene Moth_2113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2113 
Symbol 
ID3833264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2207418 
End bp2209064 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content59% 
IMG OID637830038 
Producttwo component AraC family transcriptional regulator 
Protein accessionYP_430948 
Protein GI83590939 
COG category[T] Signal transduction mechanisms 
COG ID[COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCACGG ATATCAAGAT CCTGCTTGTC GACGACGAAC CCCTGGAGCG CCAGGCCATC 
CGCTTTTTGC TGGCCAGGGA GCGCCCTCAT TACCAGATTG CCGGGGAAGC GGGTAATGGA
GGCGAGGCGG TCAAACTGGC TGCCAGGTTG CGACCGGACA TCGTCTTCCT GGATATCAAG
ATGCCCGTCA TGGATGGGTT GACCGCCGGT CGGGAGATTC GGGCAATCCT ACCCGAGGCC
AGGTTGATTT TTGTTACTGC CTATGGCGAA TTCGATTATG CCCGGGAAGC TGTTGCCCTG
GGGGCATCCA AATATTTACT AAAGCCGGTG GCGGCCGAAG AAATGCTTCC CCTCCTGGAT
GAACTGGCTG CCGGCGTCGC CGCCGCTCGC CGGCGCCAGC AGGAGACAGC AAGGTTGCGG
GCCGCTCTGG AGGAAGCGAA GCCCTTTATT CGCCTGGGCT TTATCATGGA CCTGATCAAC
GGTAATATCA CCGACGCCGA AGCCGTCAGC CGGGCGCGCT TCCTGGGGAT CGCCACCTTG
CCCCGTCTGG CCATGTTGGT GGATATTGAT AATTTTGCCG CTCTGGCCCG GGAGGGGACA
GAGGTAGAAC GGCAGATTTT AAAGCAACAG GTCAAGGAAA GTCTGGAAAG GGCGACTGTG
TCCTGGCCCG GGGCCCTGGT CGCCCCGGTA ACCAGGGATG AGTTCGCCAT CCTCCTGCCC
CTGGACCACC TGGCCCCGGG TGCAGATAGC CACCAGGCCG CCATCGAGCT GGGAGAAGGC
ATTTGCCGGC AGGTACGCCG GGATACCAGG GCTACGGTAA CGGTGGGTAT CGGCCGGCCG
GTGGCAAAGG TTGCTGAACT GGCCCGTTCC TACGCCGAGG CGGTGGCGGC GGCAGAATTC
CGGCTATTTT ACGGCGGGGA CCAGGTTATC CATGCTGACG ACGTTATTGC CCGGCCCAGT
GCCGGCCAGT TCCTGCCGGC TCCCGAGGAG CAGGAATTAA CCCAGGCCAT CCGTATGGGT
GATAGGCAGG CTGCCTACCG CCAGGCTAAA AATATTTTGA TGCAACTCCT CCTGGAGCAG
GAAAAACGGC CGGCTATATT GAAGATGAAA CTCCTGGAAC TGAATACCCT GGCGGCCAGG
GCCGCCCTGG AGGGCGGTGC CGACCCGGAG GCGGTTTCCG ACCTTGCCCT GGCCAGCAGC
ACTGAGTTTC TTACCCTGGA CAACCTGGCT GATATGCGGG AGCGCATCCT GGAACGTTTA
ATGGCCCTGG TGGCCCAGGT GGCGGAAACC CGGGAGCAGC GCAATTCCTC CCTTATTGAC
CGGGCCAGCA AGTATATTGA GGCCAATTTC AGCCAGGATC TCACCCTGGA AGAGGTCGCC
CGGCAGGTAT ATCTTAGCCC CTGTTATTTC AGCAAGCTGT TCAAGCAGTT CAAGGGCTTG
AATTTCATAG ATTATCTAAC AAAGGTACGC CTCAAGGCGG CCAGGGAGTT ATTGCTGAAC
ACCAAGCTCC CGGTAGCGGA AATCGCCACT CGCGTTGGTT ATCGTGATGC TCGCTATTTT
GGGCAGGTGT TTAAAAAGCA GGAAGGCTAC ACGCCCAGTG TCTTCCGGAA AATAGGGGGT
GCCCACTTTG GCAAGAGTAC TAGTTGA
 
Protein sequence
MATDIKILLV DDEPLERQAI RFLLARERPH YQIAGEAGNG GEAVKLAARL RPDIVFLDIK 
MPVMDGLTAG REIRAILPEA RLIFVTAYGE FDYAREAVAL GASKYLLKPV AAEEMLPLLD
ELAAGVAAAR RRQQETARLR AALEEAKPFI RLGFIMDLIN GNITDAEAVS RARFLGIATL
PRLAMLVDID NFAALAREGT EVERQILKQQ VKESLERATV SWPGALVAPV TRDEFAILLP
LDHLAPGADS HQAAIELGEG ICRQVRRDTR ATVTVGIGRP VAKVAELARS YAEAVAAAEF
RLFYGGDQVI HADDVIARPS AGQFLPAPEE QELTQAIRMG DRQAAYRQAK NILMQLLLEQ
EKRPAILKMK LLELNTLAAR AALEGGADPE AVSDLALASS TEFLTLDNLA DMRERILERL
MALVAQVAET REQRNSSLID RASKYIEANF SQDLTLEEVA RQVYLSPCYF SKLFKQFKGL
NFIDYLTKVR LKAARELLLN TKLPVAEIAT RVGYRDARYF GQVFKKQEGY TPSVFRKIGG
AHFGKSTS