Gene Athe_0447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0447 
Symbol 
ID7407524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp506654 
End bp508303 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content35% 
IMG OID643714834 
Productdiguanylate cyclase/phosphodiesterase 
Protein accessionYP_002572352 
Protein GI222528470 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAA AAAATAAAGA CTTAAACTGT AAAGCAAATA ATTATGCAGC TAAAATATCA 
ATAATATATG CAGTTGTGAG CGCAATTTGG ATTTTAATCT CTGATGTGCT AACAACAATT
TTCTTTGCTA AAAAAGAGCT TGTTACTTTC ATTTCAGTAT TTAAAGGCTG GCTCTTTGTG
TTTATCACTG CAAGCCTTTT GTATTTGTTG ATTCGCAAAA AAATCTATTC GCTTTACCTT
TCTGAAAATA AGCTTCAAAA TGCTATCGAG GAACTTCAAA AGACAAATGA TGAGCTTTCC
AAAACACAGG AAAAGCTTGT TGTCCAGTAC AAAAAACTTG CAAAAAACCA GGAAAAGATA
AAAGAACTTG CTTACTTTGA CCAGCTAACT AATCTGCCAA ATAGAAATCA TTTTATGTTA
GCTCTCGAAA AGGCTATCAG GGATGCTCAT TTCAAAGGGC AGCAACTTGC TCTGATGTGT
ATCGATATTG ATAATTTCAG TAAAATTAAT AATACTCTTG GTCATGCGAC TGGTGACGTT
GTTTTAAAAG AAATTGCGCA AAGGCTTAAA GAAGCGGTGG GTAATAACGG GTTTGTTGCA
AGACTAACAG GAGATGAGTT TGGAATAATA ATTTATAATT TTCTTGATTT TAACGTCTTG
AATTACTTCA TCTACAAAAT TTTTAATTTA TTTTCAACAT CATGGGAAAT AATGGAATAT
AATTTTAATA TTACACCAAG TATGGGGATT GCTATATATC CTTCAGATGG GCAGGATAGT
ATATCCCTTT TGAAGAATGC CGATAAGGCC TTGAATCTTG CCAAGGAAAA AGGCAAAAAT
ACTTTCTGCT TTTACAATCT GGAAATGGAC AATATACTTC AGCAGAGGCT TGAATTCGAA
TCAGACTTGA GAAAGGCGAT TGAAAAAGAC CAGTTCGTTT TGTATTATCA GCCTATTGTA
GACCTTGAAA AAATGCAGCT GTGCGGAGCA GAAGGGCTTA TTCGCTGGAT ACATCCACAA
AAGGGGCTTA TATCGCCCAT GTCTTTCATC CCAATTGCTG AGCAGACAGG ACTCATATCA
CAAATTGGTC AATGGGTATT GACAAAGGTC ATTACGGATT TGAAGAGCAT CAGAGAAGTA
ACAAACCACA ACTTTTATAT TTCTTTCAAT GCGTCCTTAA GAGAGTTTTC AAGCGCAAAT
TTTGTTGATA ACGTACTTTA TACAATTGAA GCCTTAAAAG GTGACCCAAC TTCTTTGGGA
ATAGAGATTA CAGAATCGGT TGCAATGGCA GACCCTCAAA ATACCATAAA GTCGCTGAAC
ACCTTCAAAG AAAAAGGCAT TAAAGTTTTT CTTGACGACT TTGGCACAGG TTATTCTTCC
TTGAACTATC TAAAACAGCT TCCCATTGAT GTTGTTAAAA TCGACAGAAG TTTCATAGCC
AATATGAGCA CTGACATTAA AGAACAAAAA ATAGCCAAAA GCCTGATTAA CCTTTCTCAC
ATCTTAGATT TAAAAGTTGT GGCAGAGGGT ATTGAAAATA GTCAGCAGGC TGAGATATTA
AAATCTTTTG AGTGTGATTT TGGACAGGGA TATTTGTTTG GAAAACCTCT TCCAAAAGAC
CAATTTATAG AGTTTGCAAA GAGGTTTTGA
 
Protein sequence
MSEKNKDLNC KANNYAAKIS IIYAVVSAIW ILISDVLTTI FFAKKELVTF ISVFKGWLFV 
FITASLLYLL IRKKIYSLYL SENKLQNAIE ELQKTNDELS KTQEKLVVQY KKLAKNQEKI
KELAYFDQLT NLPNRNHFML ALEKAIRDAH FKGQQLALMC IDIDNFSKIN NTLGHATGDV
VLKEIAQRLK EAVGNNGFVA RLTGDEFGII IYNFLDFNVL NYFIYKIFNL FSTSWEIMEY
NFNITPSMGI AIYPSDGQDS ISLLKNADKA LNLAKEKGKN TFCFYNLEMD NILQQRLEFE
SDLRKAIEKD QFVLYYQPIV DLEKMQLCGA EGLIRWIHPQ KGLISPMSFI PIAEQTGLIS
QIGQWVLTKV ITDLKSIREV TNHNFYISFN ASLREFSSAN FVDNVLYTIE ALKGDPTSLG
IEITESVAMA DPQNTIKSLN TFKEKGIKVF LDDFGTGYSS LNYLKQLPID VVKIDRSFIA
NMSTDIKEQK IAKSLINLSH ILDLKVVAEG IENSQQAEIL KSFECDFGQG YLFGKPLPKD
QFIEFAKRF