Gene Athe_2238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2238 
Symbol 
ID7407657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2372643 
End bp2374358 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content32% 
IMG OID643716604 
Productdiguanylate cyclase/phosphodiesterase 
Protein accessionYP_002574083 
Protein GI222530201 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0733409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAT CTTCTAATGA TTTTTGGGTT AATAATTTTA AAAACATAAT AGAAGGTAAT 
TATGAAAGAT TAGTACAACT AATTAAATCA GAAAGAGGAT TTTGGCTCTA TGATATAGAA
AAAAAAGGAG TTTACATTTC AAACGGCTTT GAATACATTT CTATCCAAAT AAATGGTAGC
ATTAATTTTA TCCAAGATGT AATGACTGAA AGTGAGTTTG AACATCTTAA AAAGCTTGTA
AGAGACAGCA TAGAAAAAGG GCAAAATGGT TTTTCGGGAA GAATAAAACT GAAGGATGGT
AGATGGATTT TTGTATGTGC TACTATTTTG TATAGTGAAC AGGAGCCTTT AAAGATAGTT
GGGACATTTG AGGATGTAAC TCCACATGTT TCGTGTGATT TGAGATTAAG TAGATATATT
GAACTTATAG CATATTATGA TGAAATAACC GGTCTTCCTA ACAGGAACTT TTTGAATGAA
GTCTTGAGAG AAAAAATTGA TAAGAGCAAA AAAGATAATT CTGGCTTTTG GGTAATATTT
ATAGAGGTAA GTAACTTTGG GTATATAAAT GAGCTATTTG GTCATTCAGT AGGGGACGAG
TTTTTAAAAG CTGTAACATT TGAGATAAAA AAGTTTTTGC CAAGGGATTG GACATTTTGC
AGATTTGGTG GAGATGAATT TGTTGTTTTG ACATCAAACA TACAGAGAAC TTCTGTTGCA
ATGGTAGTTG AAAACCTCAT AGAAAGATTT TCAAGACCAT GGAATGTGAT GGGAAAGTGG
TTATATGCAA ACATAAACGT TGGGATTTCA GGGTATCCGC AGGATGGAGA GTTTGCTGAT
ACGCTTTTAA AAAATGCAGA GATTGCTCTG ACCGCAGCTA AAAAACATGG AAGAGATTTT
GGAAAGAGCC AATATGAATT TTATAAAATC TCAATGGAAG AAGAGATTTT GAGAAGAGTT
GAGATTGAGT CAGAAATTTC AAGAGGAATT CAGGAAAGAC AGTTCTTTTT GGTTTATCAG
CCGAAGGTAT CTTATGATGG TAGAACGGTA GTCGGGTTTG AATCACTTCT TAGGTGGAAC
TCCCAAAAAG GAATTTTAAC TCCGGACAAA TTTATTCAAA TTGCAGAAGA GAGCGGGCTT
GTGTATGATT TGAACAGGCT TGTGCTTGAT ATTGTTTGTA AAGATATAAA GTACATTAAG
CAAAAAACTT CAAAAACTAT TCCTGTTGCT GTAAATTTGT CTGGTAAGGA ATTTGCAATG
TACAATATGA TTGGAGTTTT AAATGAACGT CTAAAAGCAC ATAGTCTTCT ACCTGAAGAT
ATTGAGATTG AAGTTACAGA AAGGGTAATA TTAAACAATG TGGAATTAAC AAAACAAATT
TTAAATAGCT TACATGAGAA AGGTATAAAA ATATCTATAG ATGACTTTGG CACAGGATAT
TCTTCATTTG AACTTCTTCT GCAGCTTCCA ATTTATGCTC TTAAAATTGA CAAGAAATTT
ATTAATAAAA TACACTTATT TGGCAATGAA TATATAATTG TAAAGAATAT AATTTACATG
GCAAAGGAAA TGAATTTAAA GACAATTGCA GAGGGTGTTG AAAATAAAGA ACAATATGAT
ATATTGCGAG AGCTTGGTTG TGACCAATTT CAGGGTTATT ATTTTTCAAA ACCTGTAAGT
ATAGAAGAGG TTGTAGTTAG CAATAATGAT AACTAA
 
Protein sequence
MSESSNDFWV NNFKNIIEGN YERLVQLIKS ERGFWLYDIE KKGVYISNGF EYISIQINGS 
INFIQDVMTE SEFEHLKKLV RDSIEKGQNG FSGRIKLKDG RWIFVCATIL YSEQEPLKIV
GTFEDVTPHV SCDLRLSRYI ELIAYYDEIT GLPNRNFLNE VLREKIDKSK KDNSGFWVIF
IEVSNFGYIN ELFGHSVGDE FLKAVTFEIK KFLPRDWTFC RFGGDEFVVL TSNIQRTSVA
MVVENLIERF SRPWNVMGKW LYANINVGIS GYPQDGEFAD TLLKNAEIAL TAAKKHGRDF
GKSQYEFYKI SMEEEILRRV EIESEISRGI QERQFFLVYQ PKVSYDGRTV VGFESLLRWN
SQKGILTPDK FIQIAEESGL VYDLNRLVLD IVCKDIKYIK QKTSKTIPVA VNLSGKEFAM
YNMIGVLNER LKAHSLLPED IEIEVTERVI LNNVELTKQI LNSLHEKGIK ISIDDFGTGY
SSFELLLQLP IYALKIDKKF INKIHLFGNE YIIVKNIIYM AKEMNLKTIA EGVENKEQYD
ILRELGCDQF QGYYFSKPVS IEEVVVSNND N