Gene Moth_1471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1471 
Symbol 
ID3832352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1520473 
End bp1521792 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content51% 
IMG OID637829404 
Productdiguanylate cyclase 
Protein accessionYP_430324 
Protein GI83590315 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATAC AATGGCAGGA AACTTACTTC AAACTATTAC ACCAGTACAG TCTACAGCAG 
AGTGAGGCCA TCCTTTATGA AGCCTCCAAG CTTAGCAAGG CCCTGGTGGA GAAGGGTGTA
GGCCCCGAAG ATATTGTTCA ATTTCACCTT GAGGCCCTGG AAAAGCTTTT CAAGGATATT
TCGCCCCTCC GGGTACCGGG TCTGATTCTT ATTTCCTTTG ATCTACTGCT GGAAGTGATG
ATGGCTTATG CTTTGAATTT CAGGGAATAT TTGGAAGTAA AAAACAAGCT CATTGAGCGA
CTGGAAAACT TTAATCAAGA ATTGGCGGCT GCCAACCGTG CCCTGGAAAT GAAAGTCAAA
GAGTTATCTG TTATCCAGGA ACTGACGAAA GAATTGGGAT CCTGCCTGGA CCTTGATCGG
ACGGCCGGGA TAATCACCAG GCACCTGCAG GAATTGCTTA ATTGCGAAGT CAATCTATAT
ATTATCAGTA GTAACGGTGA ATGGCAGGGC TATACCCCCG ATGATGACCC TGAGGCCGTC
AGTATAAGCA AAAATATTGA TGTCCCGCCC CCGGTCCTGG AAGCCAGGGG CGGGGAGGCC
GTCAGGGTGG AAGGTCGGGA CTTGACATTG CCCCTGGTAG TTGATCGGGA GGTTGGCGGC
GCAATCTATT TACAAAGGGA CGATAGCTTC AGCGCCGACG AGTTTCGGCT GGCGGATATT
ATCGCCGGCT ATGCTGCCCT GGCCATCGAG CGCGCCCGGC TGTATGAAGC CATGAAGTTC
CAGGCAACCA TTGACGCCAA AACCGGTTTG TATAATTACC AGCACATGAT GCACCTGCTG
GAAAAGGAGA TTGCCCGTGC CAGGCGTTAC CAGCGTACCT TTACCATCGC CATGCTTGAT
ATCGATGACT TTAAAATTTA CAACGATACC CATGGTCACC ACCAGGGAGA CAAGGCCCTG
CAGAAAATAG CGGCCCTCAT CAAGGCCAAC ATCCGGGAAG TAGATATAGC AGCGCGCTAT
GGTGGCGAGG AATTTGTCAT TATCATGCCG GAAACATCTG CTTTAGAAGC GAGTGTAGTG
GCCGAAAGGG TACGGCGAGC CATCATGAAT GCTGGTATCG CCAACGTAGG ATGCGGTCCG
GACAGGCTGC TGACGGTAAG CATCGGCCTT GGTACTTATC CCCACGATGC CACGACGGCC
GGGAAATTGA TTGACGCCGC CGATAGCGCC CTTTACGAAG CCAAGCGGTG GGGAAAGAAC
GTAGTGCGGG TTTACAGTAA GACTGACAGG CGGCGATCCG GCGATGCAGA AGTACTTTAA
 
Protein sequence
MEIQWQETYF KLLHQYSLQQ SEAILYEASK LSKALVEKGV GPEDIVQFHL EALEKLFKDI 
SPLRVPGLIL ISFDLLLEVM MAYALNFREY LEVKNKLIER LENFNQELAA ANRALEMKVK
ELSVIQELTK ELGSCLDLDR TAGIITRHLQ ELLNCEVNLY IISSNGEWQG YTPDDDPEAV
SISKNIDVPP PVLEARGGEA VRVEGRDLTL PLVVDREVGG AIYLQRDDSF SADEFRLADI
IAGYAALAIE RARLYEAMKF QATIDAKTGL YNYQHMMHLL EKEIARARRY QRTFTIAMLD
IDDFKIYNDT HGHHQGDKAL QKIAALIKAN IREVDIAARY GGEEFVIIMP ETSALEASVV
AERVRRAIMN AGIANVGCGP DRLLTVSIGL GTYPHDATTA GKLIDAADSA LYEAKRWGKN
VVRVYSKTDR RRSGDAEVL