Gene Moth_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0223 
Symbol 
ID3831374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp219634 
End bp220707 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content55% 
IMG OID637828159 
Productdiguanylate cyclase 
Protein accessionYP_429101 
Protein GI83589092 
COG category[T] Signal transduction mechanisms 
COG ID[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.133267 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGAAA AGCTCGGGCG GGCCGACTGG TTGGCAGCGA TGCTGGTTAC AGTGGGGGGC 
GTTTTTTTGT CCCTGGCAGC GCCACCGGGC CCGACTGTCT ATTGGGGGTT GACCTGGAGT
ATATTTAACG GCCTCATCCT GGCTATCGGA ATTCTTAGCA CGGGTTTTAT CACCCGGGTT
TTGATCCTGG GACTGAACCT GATACTGATA GCTGGCTGGC AGTTAACAAG TGGCTGGCCG
TCAGCCACGT CACTTCCCTT GCTTCTCTTC TTACCGGTCC TGGTGCCCCT TTATCGGAAG
CAAAGGAAGG AAATCCTTGC AGGCCTGGTA GGGGGGCTGA TACTGGGAGG ATACAGCACT
TTAAAAGAAA ATCTGGCTAA TCCGTCTTCA TGGGCTATCC TGGGTGGTTG GGTTGCAATC
GCGGGCTTCT TTTATTACCT CATGGTGGGT CTGGTGGTTA AGGCCAGGCA GGCTGCTTCT
TTGCAGGCTG AGGTGGAATA TACCCGTCAC GAGTATCAAG AAGCCTGCAA GCGGCTGGCC
GCCATGGAGA TGGCGGCCAT TACTGATGAT TTAACCGGGA TTTATAACTA CCGCTACTTC
GTGCAGGCCT TTAGCAACCT GTTGAACTCC CGGCAGCAGC CCCGTTACCT GGCAGTTTTA
ATGCTGGATA TCGATTACTT TAAAGAGATA AATGATGCCT ACGGCCATCT CACCGGTAAC
AGGGTACTGG CGGAACTGGC CACCATCCTG AAGGAGTGCA CCCGTGAACA GGATGTTGTC
ACCCGTTTCG GCGGGGAGGA GTTCGCTCTC ATTTTGCCCG ATACAGATTA TCACGGTGCC
CTGCAGGTGG CGGAAAGGAT CCGCAAGGCC ATCGCCGAGC ATACCTTCCA AGCTGAAGGA
ACGGCCATCC ACGTTACTGT AAGTGCCGGT GTGGCGGTCT GGCCGGTAGA CGGGACCGAT
AAAAAGGATA TCATTGCCCG GGCCGACCGT GCCCTTTACC AGGCCAAGAC AACCGGGCGT
AACAGTGTCT GCGCCTATCA GTTCCTGAAA AAGGAACGGG GTGTCCATGA ATAA
 
Protein sequence
MGEKLGRADW LAAMLVTVGG VFLSLAAPPG PTVYWGLTWS IFNGLILAIG ILSTGFITRV 
LILGLNLILI AGWQLTSGWP SATSLPLLLF LPVLVPLYRK QRKEILAGLV GGLILGGYST
LKENLANPSS WAILGGWVAI AGFFYYLMVG LVVKARQAAS LQAEVEYTRH EYQEACKRLA
AMEMAAITDD LTGIYNYRYF VQAFSNLLNS RQQPRYLAVL MLDIDYFKEI NDAYGHLTGN
RVLAELATIL KECTREQDVV TRFGGEEFAL ILPDTDYHGA LQVAERIRKA IAEHTFQAEG
TAIHVTVSAG VAVWPVDGTD KKDIIARADR ALYQAKTTGR NSVCAYQFLK KERGVHE