Gene Moth_0207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0207 
Symbol 
ID3831358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp201662 
End bp203338 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content60% 
IMG OID637828143 
Producttranscriptional regulator 
Protein accessionYP_429085 
Protein GI83589076 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[T] Signal transduction mechanisms 
COG ID[COG2508] Regulator of polyketide synthase expression 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000326759 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCATAA CGGTAAGGGA AGCCTTGAAG CTGGGCGGCC TGCGCCGGGC TAAAGTAGTG 
GCCGGCGAGG CCGGGCTGGA AAGAATTATA AAATATGTGG ACATCCTGGA GATACCCGAT
CCCCGCGGGT GGTTCAGGGC CAACGAGCTT ATCATTACAA CTGGTTATGC TATCAGGAAC
GATCCCCGGG CGCAGGCGAA TCTACTGCTG GAGCTGGCCA GGACAAATGG GGCCGGCCTG
GCTGTAAAGT TCGGCCGTTT TATCGGCTCG GTACCGGAGG AGATGCGGCG CCTGGCGGAT
GAATTAAAGA TACCCCTCCT GGATGTTCCC GATGATATTC CCTACGTCGA GATTACCAAC
CCCCTCATGG CGGCCATTGT CAATGAACAG GCGCGGCAAC TGGAGTATTC GGAAAGGGTC
CACCGTGAGC TCACCAGGGT GGCCCTGGAG GCGAGCAACA TCCAGGCGGT GGCTGCGGCC
CTGGCGACCC TGGTGGAGCG GGAGGTGGTT ATTTGCGACG AGGAATTGCA GCCCCTGGCA
GTGTCAGGGG CAGTGTCAGG TAAGGGGTCT GCGGGCGGGT CCCTGCAATT GCCACCGGAG
GAAATCAAGA GGTTACATTC AGCCCAGAAA GCTGTGGAGA TAACCCTCTG GTCCAGGAAT
GTCCGGAGGC GGTATTTTGT CGCCCCCATT GATGTCAGGG AGCACCGCTA TGGCTATATA
CTGGTGGATG GGCAGACGCC CCTGAGTGAA CTCAATCAGA TCGCCCTGGA ACACGCCGTC
ACAGTGACGG CATTGCAGAT GGTCAAGGAA GAAGCCGTTG TCGAGGCCCG GAGGAGCCTG
CAGCGGGATT TACTGGAGGA CCTGATAGCC GGAGCCCTGC GCCACCGTGA ACTCGCCATC
AGCAGGGCCG AGGCCCTGGG GATTCTCCTC GAGGAACCCA AGGTCATTAT GGCCATTGAT
ATAGATGATT TCACCGGTTA CCTCCTGCAC CAGCCCGGTG CCCAGGAGGC CAATGCCGGC
GTGTTAAAGC GCCGCTTCCA CCGGGCGGTA AACTCCTGCT TCATGGCCTT TGACCGGCGG
GTGCTGACCG TGCAGCGCAG TGACAGTGTT GTCGGCATTT TGCCGGCAAA CGGCAGGGAG
ATCCGGGGGG CGGAAGACTG GCGGGTATTG CGCGGTATGC TGCAGGAACT GGCCGCTTCT
ATCCAGCTTA AAATCGCCCG GGAACTGGAT GGCGTGACCG TTACCATTGG GATAAGCTCC
ATAGCCGCGG ATCCTATGGA GATTAGCGAG CGCTACCAGG AAGCGAGGAC GGCCATCAGG
CTGGCCCGGC GTCTCAATGG GAAAGGCACC GTCGCCTTCT GGGAGGATGT GGAGTTATAT
CATGTCCTGG GACAATCGGG TGAGACCCTG GAGAGGTTTT ACCGTTCGGT TCTGGGGGAA
CTGGACCGGC CGGAAGTGAA AAACCGGGAA GAACTCCTGG AAACCCTGCG GGTCTATCTG
GAGTGCCAGG GGAATGTAAT GGCTTCCGCC GCGAAACTCT ATATCCACCG CAACACCATG
CGCTACCGTT TACAGCGGAT TGAGGAGCTC CTGGGGCGGG ACCTGGATTC GCCGGATGAA
CGGCTGGCCC TCTGGCTGGC CCTCAAAGCC CGGGGCCTCA TCAGGTCGGA ACAGTGA
 
Protein sequence
MGITVREALK LGGLRRAKVV AGEAGLERII KYVDILEIPD PRGWFRANEL IITTGYAIRN 
DPRAQANLLL ELARTNGAGL AVKFGRFIGS VPEEMRRLAD ELKIPLLDVP DDIPYVEITN
PLMAAIVNEQ ARQLEYSERV HRELTRVALE ASNIQAVAAA LATLVEREVV ICDEELQPLA
VSGAVSGKGS AGGSLQLPPE EIKRLHSAQK AVEITLWSRN VRRRYFVAPI DVREHRYGYI
LVDGQTPLSE LNQIALEHAV TVTALQMVKE EAVVEARRSL QRDLLEDLIA GALRHRELAI
SRAEALGILL EEPKVIMAID IDDFTGYLLH QPGAQEANAG VLKRRFHRAV NSCFMAFDRR
VLTVQRSDSV VGILPANGRE IRGAEDWRVL RGMLQELAAS IQLKIARELD GVTVTIGISS
IAADPMEISE RYQEARTAIR LARRLNGKGT VAFWEDVELY HVLGQSGETL ERFYRSVLGE
LDRPEVKNRE ELLETLRVYL ECQGNVMASA AKLYIHRNTM RYRLQRIEEL LGRDLDSPDE
RLALWLALKA RGLIRSEQ