Gene Moth_2148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2148 
Symbol 
ID3833148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2249150 
End bp2250163 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content55% 
IMG OID637830070 
Productdiguanylate cyclase 
Protein accessionYP_430980 
Protein GI83590971 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain
[COG2202] FOG: PAS/PAC domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0891685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGTGT CCACCGTAGA GATGGCCCTG AAGCTCCATG AAGCCCTCTC CCTGGCGCAA 
ATGTACCGCA GGATAGTCGA AGACTCCTTG ACGGAGGTCT ATATCTTTCA TCCCGATACG
TTGAAGTTCC TGGCGGTCAA CCGGGGGGCC AACGAGAACC TTGGCTACGC CGAAGAAGAA
CTTTTGGACA TGAACATCCT GCAGCTAATG CCGGAATTTG ACCGGGAGAG CTTCAGAGCC
CTTTTGACCC CTTTACAGCA GGGCCAAAAA GAAAAGATCA TCTTTGACGC CAAGCACCGC
CGGAAAGACG GCTCCCTCTA CCCGGTGGAA ACGCACCTGC AGCTCTTCGG CCATGGGAAA
GGCAGCATAT GCGTGGCCTT TATATTAGAT TTGACAGAAC GCAAGAAAAT GGAAGAAAAG
CTGAGGGAGC AAGGAGAGTT CCTGCGGTCG CTCCTGGCCG CCCTGCCCGT CGGGATCTTT
ATCATCGACC CCGTCTCCCA CCGCATCGAG AAGGTAAACC TGGAAGCGGC CGCCATGATT
GGAGCCGCAC CCGAAGAGAT CGAAGGCAGA TCCTGCTGGG AATTTTTCAT ACAATCCGCA
GGAAGCTGTC CTATTACTGC CTCGAATGAA GAGGTTGACC GCTCCGAACG GCTTTTACGC
CGGAAGGACG GGCTGGAGAT CCTCGTGCTA AAGACGGTCA AGCGCGTGCG GACGGACAGC
GGGGAGAAAC TGGTGGAAAC CTTTATAGAC ATCTCCGAAC GCAAACACCT GGAGGAAGAG
CTTTACCGCC TCTCCATCAC CGACCCTCTG ACCGGCGCTT ACAACCGCCG CTATTTTTTA
GAAATGCTGG AAAGAGAAGT TGAGCGTATA CGGCGGACCG GGAATCCCTT CTCCCTGATC
ATGTTTGACC TGGATCACTT CAAAAGTATA AATGACCATT TTGGACATGC CGCAGGAGAC
CGGGTGGATT CAGGTGGCCG CGCCGGCATA ATTGAAAGCC GCGCTGGTGT ATGA
 
Protein sequence
MLVSTVEMAL KLHEALSLAQ MYRRIVEDSL TEVYIFHPDT LKFLAVNRGA NENLGYAEEE 
LLDMNILQLM PEFDRESFRA LLTPLQQGQK EKIIFDAKHR RKDGSLYPVE THLQLFGHGK
GSICVAFILD LTERKKMEEK LREQGEFLRS LLAALPVGIF IIDPVSHRIE KVNLEAAAMI
GAAPEEIEGR SCWEFFIQSA GSCPITASNE EVDRSERLLR RKDGLEILVL KTVKRVRTDS
GEKLVETFID ISERKHLEEE LYRLSITDPL TGAYNRRYFL EMLEREVERI RRTGNPFSLI
MFDLDHFKSI NDHFGHAAGD RVDSGGRAGI IESRAGV