Gene Mlg_1109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1109 
Symbol 
ID4269816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1295509 
End bp1298337 
Gene Length2829 bp 
Protein Length942 aa 
Translation table11 
GC content68% 
IMG OID638125861 
Productdiguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) 
Protein accessionYP_741951 
Protein GI114320268 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.339045 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.881736 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATGA GCGAAACCGG AACCGCGCGA CGCCGACTCC GCCTGGGCCT GACCTGGCGG 
GTCATCGCCC TGACCAGTCT GGTGCTGGTG GGGCTGGCAG CCCTGGTCAC CGCGCATGGG
CATGGCACCC TGGAGCGGCA GTTTCAGGAG GCCCGTGAGG CGCACCAGGC GCGCCAGCAT
CGCGAGATCC GCCTGGCGCT GGAGCGATCG GCGGAGGGAC TGCGCGAGCT GGCGGCACTG
GTGGCGTCGG CACCGGAGCT GGCCCGGGCC GTGCAGGCCG GTGATTCGGA GGGTGCGCAC
CGCGCTCTCG ACGGGCAGTG GCCCACTCTG CAGTTGGAGG CCGGGGTGGA GCGTATCGGC
GTCTACGCGC CAGACGGCAC GCAGGTGGCC GCCCGCGGCG GGAGGCCCGC GGTCGCGCCG
GAGGGGATCG AGCGCTGGCT GCATCAGGTC CGTCTGGACG ACAGGCCACT GACCGCATTG
ATCTGCCAGG CGGGCTGTCG TCAGTTCAGC GTGGTGCCCA TGCTGGTGGC GGGTGAGACC
GCCGGTATGG TGATGCTGTC GCGCTCCCTG GTGGACCTCA CCCGCCAGCT GCACGATGTC
TCCGGCAGTG ATGTGGCGCT GCTGCTACGC GGCGACACCA TGGTGGATCA GGCAGGGGAC
ATCCCCCGGT TGCCGGAGTG GGACGCTCAT CTGCTCGCCC TCACCCGCAG TGAGGTGACC
CTGCCCTGGC TGCAGGCCCT GGCCAGCGAG GTTAGCATGG AGGCCCTGGT AAGCGCCCCG
CGGCAGATCG CGATCGACGG ACGGGAGTTG GAGATCAACG CGGTGACGCT GGCCGAGCAT
GAGGCGCCCG GTGGCAGCGG CTACCTGCTG CTGATTACCG ATATCACCCG CCAGCTGGAT
GTCATCCAGT GGCACACCCG CAACCTCTTC TTTATGGGGC TGGCCGGATG GCTCGTGGCG
GAGGCGATCC TGCTGCTCAT CCTGTGGCGT CCGATGCTGC GTCTGCGCCG CCTGGCGCAG
GTCCTGCCGG GGCTGGCCGA GGGCGGCTTT CAGCGGGCGC GTACCGCCAT TCCGGTCGGG
CGCCCCCGGC TGGCCGATGA AATCGATCTG CTGGAGACCA CCACGCTCGG TCTGGCCGAT
CAGCTGGAGG CCCTCGAGCG CGAGGTGCGG CGCCGGCGTG AGCAGGTGGT CTCCCAGCTG
CGCGCCCTGG GGCGTGAGCG GGATTTCGTC AGCAGTCTGC TGGACACCGC CCGGGTCCTT
ATCGTGAGTC AGGACGCCGA GGGGCGGATC ACGCTAATCA ACGACTACGC CCAGGCGGCG
TTGGGCCGTC GCGAGGATGA GTTGGTGGGC GCGCACTTTG ACGCGGTCTT TCCGGGCCTG
GTCCCCATCG GTCGTAACAG CGGCCTGCCT CGGGAGGAGG AGCGCCCGCT GCACAGCCCC
GGCCGGGGGG AGCGGATCGT GGCCTGGTAC CACGCCCCGC TGGCGGCGGA GGAGGGGCGG
CCGGCGGGCC GCATCTCGGT GGGGCTGGAT ATCACCGAGC GCAAGGCCGC GGAGGCGCGG
CTGATCTGGC TGGCCCAGCG TGATCCGCTG ACCGAGCTCT ACAACCGCCG TTACTTTCAG
GAGGCCCTCG ACAGGGCGTT GGCCAAGGGG GTGCACGGGG CGGTGCTGCT GATGGACCTG
GACCAGTTCC GGGATGTCAA CGAGCTGAGT GGCCACCACG CGGGTGATGA GCTGTTGCGG
TTGGTGGCCG GGACGTTGCT GGATCATCTG GAGCACCGGG GCGTGATCGC CCGCCTGGGC
GGCGACGAGT TTGCACTGCT GCTGGAGGAG ACGGATGCCG ACGCGGCGGT CTCCGTGGCC
CAGTACATGG TCAAGCTCCT GGAAGACCTG GGCCTGAGTA TCGGCGAGCG CCGCCACCGG
GTCAGCGCCA GCATTGGCAT TGTGCTGTTC CCCGAACATG GCGAAACCCC GACCGATCTG
ATGGCCAGTG CCGATGTGGC CATGTACAAG GCGAAAGAGA CAGGGGTCCA GCGCTGGCAC
CTGCTCCACG CCCTGGACCA CGCCAAGGGC GAGCTGCAGG AGCGGGTCTA TTGGGTGGAG
CAGTTACACC AGGCGCTCCA GGGTGATGCC TTCGAGCTCA TGGTGCAGCC CATCGTGCGG
CTGCGGGACC GCAGTGTGCG CCACTACGAG GTGCTTGTGC GTATGCGCGA TCCGTCCGGT
GAACTGCTGC TGCCCGGCCG GTTCATCCCC TTCGCCGAGC ACAGTGGCCA GATCGTTCAG
CTGGACCGCT GGGTGCTGCG CGCGGCCCTG AAGGTGCTCC GCCGGGTGCA GTCACAGGGT
ATTGGCCTGG CAGTGAACCT GTCGGGGCAG TCGCTCCACG ACGATGGACT GACGACCTTC
CTGGCGGACG AGCTCCGCGC CAGTGGTGCG GACCCGGAGC ACCTGATACT GGAGATCACC
GAGACCGCGG CGGTTACCGA TTTTTCCACC GCCCGAGGGG TGTTGGAGGG CATCCGCGCC
TTGGGTTGTC AGACGGCATT GGACGATTTC GGGGTCGGGT TCAGCAGCTT CCATTACCTG
GGCCAGTTGC CGGTGGACTA TATCAAGATC GACGGTAGCT TTATCCGCAG CCTGCCCCAC
AACGAGGACA GCCGGATTAT CGTCAGGGCC ATCGCCGACA TTGCGGCCGG TTTCGGCAAG
GCGGCCATTG CCGAGTTCGT TGACCAGGAG GTCCTGGTGC CGATGCTGCG TGACTACGGC
ATCGCTTACG GCCAGGGCTA TCACCTGGGC CGGCCGGTGC CAGTGGAGGA GGCCTTCGGG
CCAGCCTGA
 
Protein sequence
MRMSETGTAR RRLRLGLTWR VIALTSLVLV GLAALVTAHG HGTLERQFQE AREAHQARQH 
REIRLALERS AEGLRELAAL VASAPELARA VQAGDSEGAH RALDGQWPTL QLEAGVERIG
VYAPDGTQVA ARGGRPAVAP EGIERWLHQV RLDDRPLTAL ICQAGCRQFS VVPMLVAGET
AGMVMLSRSL VDLTRQLHDV SGSDVALLLR GDTMVDQAGD IPRLPEWDAH LLALTRSEVT
LPWLQALASE VSMEALVSAP RQIAIDGREL EINAVTLAEH EAPGGSGYLL LITDITRQLD
VIQWHTRNLF FMGLAGWLVA EAILLLILWR PMLRLRRLAQ VLPGLAEGGF QRARTAIPVG
RPRLADEIDL LETTTLGLAD QLEALEREVR RRREQVVSQL RALGRERDFV SSLLDTARVL
IVSQDAEGRI TLINDYAQAA LGRREDELVG AHFDAVFPGL VPIGRNSGLP REEERPLHSP
GRGERIVAWY HAPLAAEEGR PAGRISVGLD ITERKAAEAR LIWLAQRDPL TELYNRRYFQ
EALDRALAKG VHGAVLLMDL DQFRDVNELS GHHAGDELLR LVAGTLLDHL EHRGVIARLG
GDEFALLLEE TDADAAVSVA QYMVKLLEDL GLSIGERRHR VSASIGIVLF PEHGETPTDL
MASADVAMYK AKETGVQRWH LLHALDHAKG ELQERVYWVE QLHQALQGDA FELMVQPIVR
LRDRSVRHYE VLVRMRDPSG ELLLPGRFIP FAEHSGQIVQ LDRWVLRAAL KVLRRVQSQG
IGLAVNLSGQ SLHDDGLTTF LADELRASGA DPEHLILEIT ETAAVTDFST ARGVLEGIRA
LGCQTALDDF GVGFSSFHYL GQLPVDYIKI DGSFIRSLPH NEDSRIIVRA IADIAAGFGK
AAIAEFVDQE VLVPMLRDYG IAYGQGYHLG RPVPVEEAFG PA