Gene Mlg_0665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0665 
Symbol 
ID4268277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp726956 
End bp730111 
Gene Length3156 bp 
Protein Length1051 aa 
Translation table11 
GC content67% 
IMG OID638125414 
Productdiguanylate cyclase/phosphodiesterase with PAS/PAC and GAF sensor(s) 
Protein accessionYP_741509 
Protein GI114319826 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCCT TGCGCAAACC GGGCCACGGC CCCGCCCTGA CCATCTGCCT GGCCTACGCG 
GCGCTCGGGC TGCTGTGGAT TTTCGGCTCA GACCGCCTGT TGCTGGTGCT CATAGAGGAC
ACCGAGCTGG TGGCCCGGCT GCAAACCGCC AAGGGCTGGT TCTACGTGCT GGTGACCGCC
GGGCTGCTCT ACCTAGTGAT CCGCTTTACC ATCAACACCA CCCGGCTGGA AGAGCAGGCC
CGGCGCCCTC GCGCCCCTGA TTCCGACCTG CTCCCCGTTG CCGGCGACTA CTGGGTGCCC
GTGGGCATCC TGACCGGGCT CGCCGGCGGA TTGGCGGCGC TCGGTTACCT CTTCTACCAC
CAGGTCTCCA GTACCCCCAC GCCCCCGCCG GAAGCGGATA TCCGGCACCT GGCGCGCTGG
AGCTTCACCA CCGGTCTGCT GCTGGTGGTC GCAGCGGGCA CGGGCATCAT CCTCTGGTGG
CGGGGCCATG TGCACGGTCT GGAGGCCCGA CGGCTACGCA CCGAGTTGAT GGCGCAGGCC
GACCTCGCCG CGCGAGAGCG GGCCTACGAG ACCCTGGCGG AAAACTCACC CGACATCATC
GTCCGACTCG ACCGCGACCT CCGGCATCTC TACGTCAACC GTGCCCTGGA GGCGTTCACC
GGCATCCCGC GGGAGCACTA CCTGGGCCGA ACACACGAGG AGGTCGGCTT CCGCGGCAAA
CTGCTCGCGA CCTGGGAGCG GCACCTGAAC CAGGCGCTGG AAAGCGGCGA ACCCGGGACC
ATGACCTTTT CCCATCCGGA CCGCGAGGGC CGGGTCCGCC ACTTCGAGGC GCTGATCGTC
CCGGAGACCG ATGACACCGG CAATATCGCC CATGTGCTCT GTATCAGCCA CGACGTGACC
GGCCACGTCC AGGCCCGCGA GGAGATCAGC CGACTGCGCG ATCTCTACGC TGCCCTCAGC
CAGGCCAATC AGGTCATCAT CCGTATCGAC GACCCCAAGG AATTGTTCCA GCGGACCTGC
GAGGTGGTGG TCCGGTATGG CCGACTCGAC ATGGCCTGGA TCGGGTTGGT GGATCAGGCC
AGTGGCCGCG TCACCCCGGT GAGCCGGGCC GGTCGCGACA TGGGTTATCT GGACGAGATC
CGTATCAGTA CCGATCCTGC GCGCCCGGAG AGCCGGGGCC CGGTAGGACG CGCCATCCGC
CAGCGGCGGG CCAGCGTGTT CGACCGGTTC CTGGATCACC CGGACGCCGG CCCCTGGCAG
GCCGCCGCGG CCCGCCGCGG GTTTCGCTCG GTCGCCGCCT TCCCCCTGTT CCAGAGCGAC
GACTGCATCG GCGCATTGGC CGTCTACTCC AGCGAGGAGC GCTTTTTTAG CGCGGATATC
ATCCAGCTCA TGGCGGAGAT GACCGGGGAC CTCTCGTTCG CGCTGGACAA CATAGAGCGG
GAACAGGCCC GCAAGGCGGC CGAAGAGCAG ATGCGCCTGG CCGAGGAGGT GTTCGAGCAC
AGCACCGAGG CCATCATCAT CACCGATGCC AGACAGCGGA TAATGCGGGT GAACCGGGCC
TTCACTGAAC GCACCGGCTA TGTCCCCGAC GAGGTGCTCG GCCGGCGCCC GCACATGCTC
CGCTCCCCCC ACCACACGCA ACAGTTCTAC CGCCGTATAC TCGCCCGGCT CCGCCGCCAC
GGCTTTTGGG AAGGCGAGAT CTGGAACCGG CGCAAGGGGG GCGAAAACGC GCCCCTTTGG
CTCACCCTAT CGGTGGTCCG GGACGAGGCA GGGCGGATCA CCCACTATAT CGCGGTAGGC
CTTGACCTTA CCGAGACCAA GGATCGCGAG GCGCACATCC GCTACCTGGC CCAGCACGAT
CCGGTCACCG GGCTGCCCAA TCGCGGGTTG CTCGCCGACC GATTGGGACA GGCGTTGCGC
CGCGCCCAGG ACCAGCACTA CCGGCTGGCC CTGCTGTCGC TGGACCTCGA CCGTTTCAAG
ATCATCAACG AAACCCTTGG TCATGTGGCC GGTGACGAGG TGCTGCAAAC GGTGGCGCAG
CGCCTGCAAG ACCACGTGAA CGGGGCGGGC ACGGTCTCCC GGATCGGCAG CGATCAGTTT
CTGATCCTGC TGCCGGAGAT CCACAAGCCC ACCGAGGCGG CCCAAGCCGC CGAACGCCTG
ATGGCCTGCG TCGCCGAGCC ACTGGACCTG AGCGGACAGG AGCTGACCCC CTCCAGCGTC
GTGGGCATCG CACTCTACCC CGAAGACGGG GATAACCTCG AGACCCTGCA CAGTCACGCG
GATGCGGCCA TGTCGATGGC CAAACAGGCC AGTGGCCATG AGGGCTACCG CTTTTTCAGC
AGCGACATGA CCAGCCGCGC CCGCGAGCGC CTGTCGCTGG AGAGCCGGCT GCGCCGCGCA
CTGGACCGGG GTGAATTCCG CCTACATTAC CAGCCGCAGG TAAGCCTGGC CGACGGCACC
CTCACCGGAA TGGAGGCCCT GCTGCGTTGG CACGCCGGGG ACGAGGGCGA CATCTCACCG
GCGCGCTTCA TCCCCATCGC CGAGGAGACG GGGCTGATCG TCCCACTGGG CCGCTGGGTG
CTCGGCGAGG CCTGCCGCCA GAACCGCGCC TGGCGGCAGG CCGGGCTTCC CGCCCTACCG
GTCTCGGTGA ATCTATCGGT GGTCCAGCTC CGTCGCAGCG ACGTGGTCGC AGACGTGCGC
CGGGCACTGG AGACCAGCGG CCTGCCTGGT GACGGGCTCC ACTTGGAGGT CACCGAAAGC
CTCTTTTTGA GTGAGGACGA CCCGGCGGTG GTCACCGGTT TCCAGACCCT TCGGGAAATG
GGCATCAGTC TGGCCATTGA CGATTTCGGG ACCGGTTATT CCAACCTCGG CTATCTCAAG
CGATTGCCCA TCGCCAAGCT GAAGATTGAC CAATCCTTCG TCCGCGGCAT AGGCGACAGC
GGCCACGACA CAGCGATCAA CCAGGCCATC ATCAGCATCG CCCGCAGCCT TGGCCTTGGG
GTCATCGCCG AAGGGGTGGA GACCGCGGCC GAGTTACGGG TATTACAGGC GCTGGGCTGC
GACGAAATCC AAGGCTTTTA CTACAGCCGC CCGCTGGATG CGGAGCGCGC CGGCCAACTG
ATGGCGGCTC CACCCCCACT GCATCATGGG GCCTGA
 
Protein sequence
MPALRKPGHG PALTICLAYA ALGLLWIFGS DRLLLVLIED TELVARLQTA KGWFYVLVTA 
GLLYLVIRFT INTTRLEEQA RRPRAPDSDL LPVAGDYWVP VGILTGLAGG LAALGYLFYH
QVSSTPTPPP EADIRHLARW SFTTGLLLVV AAGTGIILWW RGHVHGLEAR RLRTELMAQA
DLAARERAYE TLAENSPDII VRLDRDLRHL YVNRALEAFT GIPREHYLGR THEEVGFRGK
LLATWERHLN QALESGEPGT MTFSHPDREG RVRHFEALIV PETDDTGNIA HVLCISHDVT
GHVQAREEIS RLRDLYAALS QANQVIIRID DPKELFQRTC EVVVRYGRLD MAWIGLVDQA
SGRVTPVSRA GRDMGYLDEI RISTDPARPE SRGPVGRAIR QRRASVFDRF LDHPDAGPWQ
AAAARRGFRS VAAFPLFQSD DCIGALAVYS SEERFFSADI IQLMAEMTGD LSFALDNIER
EQARKAAEEQ MRLAEEVFEH STEAIIITDA RQRIMRVNRA FTERTGYVPD EVLGRRPHML
RSPHHTQQFY RRILARLRRH GFWEGEIWNR RKGGENAPLW LTLSVVRDEA GRITHYIAVG
LDLTETKDRE AHIRYLAQHD PVTGLPNRGL LADRLGQALR RAQDQHYRLA LLSLDLDRFK
IINETLGHVA GDEVLQTVAQ RLQDHVNGAG TVSRIGSDQF LILLPEIHKP TEAAQAAERL
MACVAEPLDL SGQELTPSSV VGIALYPEDG DNLETLHSHA DAAMSMAKQA SGHEGYRFFS
SDMTSRARER LSLESRLRRA LDRGEFRLHY QPQVSLADGT LTGMEALLRW HAGDEGDISP
ARFIPIAEET GLIVPLGRWV LGEACRQNRA WRQAGLPALP VSVNLSVVQL RRSDVVADVR
RALETSGLPG DGLHLEVTES LFLSEDDPAV VTGFQTLREM GISLAIDDFG TGYSNLGYLK
RLPIAKLKID QSFVRGIGDS GHDTAINQAI ISIARSLGLG VIAEGVETAA ELRVLQALGC
DEIQGFYYSR PLDAERAGQL MAAPPPLHHG A