Gene Mlg_2683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2683 
Symbol 
ID4269558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3039469 
End bp3042540 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content65% 
IMG OID638127442 
Productdiguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) 
Protein accessionYP_743513 
Protein GI114321830 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.000000849371 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTCTGGG GGCTGGTCCT GCTGGCCCTT TACGGTAACT GGCAGGATAA GGTCGAGTTG 
CACCACGAGC GCCAGCAGGG CGCGCTCGAG ATTGCCTACC AGTCCACGGT GAACACCTTT
CAGTTGGCAG CGCAGGCCTA TCTCGACGAG GCGGTGGCGC GCCCCGAAGT GCTGGACCTG
ATGCGTGTCG GACAAGAGAC CTCCGACGCC CAGGCGCAGG CCGTGGTTCG GGGACGTCTC
TACCGGGCGC TCTGGCCCAC GTATCAGCGG CTGCATGACG GGGAACTGCG CGAGCTGCAC
CTGCACACCG CGGACGGCAG GAGCTTTCTG CGGTTCTTCG CACCGGAGTA TTACGGTGAC
GATCTGCTGA CCACCCGGCC ATTGGTGCGC CAGGTCCACC AGACCCGTCG GCCTGCATCG
GGCTTCGAGA CCGGCCGCTC GCTCTCCGGC TTCCGGTATG TCTACCCGCT GTTCGACGGC
GATGAGTTTC TCGGCAGCGT GGAGTTCAGT ATCCCCTTCC GCTACGTGCG GGAGATGATG
GATCGGTTGG ATCAGGGGCA TGAGTTTCAA CTCATGCTGC GCCGCGAGAG GGTCGAGGAG
AGGGTTCCGC CGGGGTTCCT GGCACTCTAC GAGCCCGCTC CTCTGCATCC GGGGTTCGTG
GTGGAGGACG CCGGGTTGCG GCTACCCGAT TCGCCACCGC CACCGTCCGC AGAGGTGCTC
GCCATCAGCG ACAAGCTGGG CGGCATGCGG GGCGTAAGGG CCACCATTGA TGCCGGCGCA
GCGGACGTTG TCCCGTTGGA TTTTGCCGGA CAGACCTGGG CCGCTGTCCT GCTGCCCATC
CACGACCCCG CCGGCGAGCT GGTGGCCTAC GTGGTGTCAT TTGCGCCGGA TCCTTTTTCC
CGGATGTTTC GGTGGGATTT CATACGGGCC GCGCTGGTGG CAACGCTGTT GCTGGGCGCG
GTGCTGATGC TGTTGCTGCG GGTGTTGCAC TCGCGCGAGG TGCTGCGTGG CGAGCGGGCG
TACCAGCAGG CGATCACCGA CACGATGGCC GATGGTCTGT ACGCCCAGGA CAAGAAGGGC
CGGCTCACCT TTATCAATCC CGTCGGCAGG GACCTGCTGG GTTACCGCGA GGAGGACGTC
CTTGGGCGGC GCGCTCACGC CCTGTTTCAC CAACACGACG ACGATGGCCT GGCGGGGTCG
GGGCAGTGTC CGCTCGAACG GCGCGTGGCG GCCGGCCGCT ATTTTGAAGG TGAGGAGACG
TTCCGCCGGC GGGACGGCAG CGAGATCCCG GTGGAAGTCT CCAGCCGCCC GCTCTACCAG
GGGGGGCATT GGGCTGGATC GGTCACCCTG TTTCGCGACA TCACCGAGCG CAGGCGGGCG
AGGGCACGGT TGAAGCTGGC AGCCAGCGTG TTCACCAGTG CTAACGAGGG GATCCTCATT
ACTGATCCCC AGGCCCGCAT CCTGATGGTG AACCAGGCTT TCACCCGCAT CACCGGCTAC
CACCAGGATG AGGTCGAGGG CCGCGACCCC AAGCTGCTGG CATCGGGGCG GCACGGGCCG
GAGTTTTTCC AGGAGCTTTG GCGCGTCCTG GATGAGCACG ATCATTGGCA GGGCGAGCTC
TGGAACCGGC GCAAGAGCGG AGAGCTCTAT CTCCAGTTGA TCACCATCAG CGCCGTGCGC
GATGAGTCCG GTCGCCTGAT CAATTACGTG GGGCTGTTGT CGGACATCAC CGATATGAAG
GCCTACCAGC GGAAGCTGGA GTTTCTGACC CATTACGACC CGCTCACCGA GCTGCCCAAC
CGCGTGCTGT TGCTTGACCG TCTGCGCAAG GCCATGCAGC TGGCTCAGCG GGAGCAGCGT
CGGGTCTGTG TCGCCTACGT GGACCTGGAT GATTTCAAGA CCATCAATGA CGTGCATGGC
CACCGGGTGG GGGATCAGGT GTTGCTGGAG CTTGCGAGCC GTCTCAGTGC GGCGCGTCGG
GAAGGGGACA CCGCCTCGCG CCTGAGCGGG GATGAGTTTG CGATGGTCTT CGTCAATCTG
GGCAGTGTCG CCGAATGCCA CCGGATGGCG GCCCGAGTGC TGGACGCAGT CGCCCGCCCG
ATAACGATTG AGGGCGTTGC GCTGCAGACC TCCGCCAGCA TAGGTGTGGC GCATTATCCC
CAGGACGAGG AGGTGGATGC CGAGCAACTG CTGCGCCAGG CGGATCAGGC CATGTATCAG
GCCAAGTTGC AGGGCAAGAA TCGCCACCAT GTCTTCGATG TTGCCGGCGA CCGCCAATTG
CGCGATCAGC ACGAGAGCCT TACGCGCCTG GAGCTTGCCC TGGCGAGGGG TGAGTTCGTG
CTTCACTACC AGCCCAAGGT GAACCTGCGG ACCCGGGAGG TGGTGGGCGC GGAGGCGCTG
ATCCGCTGGC AGCATCCGGA GCGCGGGCTG CTGCCCCCGG CTGCCTTCCT GGACGACCTG
AACGGACAAC CCCTGGAGGT GGCGGTCAGT CGATGGGTTA TGGCCCGGGC CTTGGAGCAG
GTCGAGACCT GGAGCGAGCA GGGGGTCCAC CTGCCGGTCA GCGTCAATGT GCCGGCCCTG
CATTTGCAGC AGGCCGATTT CGTGGATCAG ATCCGTGAGT TGTTGGCGCG CCACCCCGGC
CTGCCGCGGA ACAGCCTGGA GTTGGAGATC CTTGAGTCCA GCGCACTGGC CAGCCTGGAC
CATGTCTCCC GGGTCATTCA GGGGTGCGCA GCGCTGGGTG TGGATTTCTC ATTGGATGAC
TTCGGCACTG GCTACGCCTC GCTTTCGTAT CTCAAGCGGA TCCCGGTGCG CATCGTGAAG
ATCGATCGCA GTTTTGTGCG GGATATGCTC GAGGATGCCG ACGACCTGGC CCTGCTTGAG
GGCATCGTGC GCCTGACCCA GGTCTTTGAC CTTCAGCTCA TCGCCGAGGG CGTGGAGACA
CCGGAGCAGG GTGAGCGGTT GTTGGAGTTG GGCTGCGAGC AGGCCCAGGG ATACGGGATC
GGGCGCCCGA TGCCAGCAGG GTCACTGCTG GAGTGGCTTT CGGCCTGGCA ACGGCAGGGG
GGCACGGCCT GA
 
Protein sequence
MVWGLVLLAL YGNWQDKVEL HHERQQGALE IAYQSTVNTF QLAAQAYLDE AVARPEVLDL 
MRVGQETSDA QAQAVVRGRL YRALWPTYQR LHDGELRELH LHTADGRSFL RFFAPEYYGD
DLLTTRPLVR QVHQTRRPAS GFETGRSLSG FRYVYPLFDG DEFLGSVEFS IPFRYVREMM
DRLDQGHEFQ LMLRRERVEE RVPPGFLALY EPAPLHPGFV VEDAGLRLPD SPPPPSAEVL
AISDKLGGMR GVRATIDAGA ADVVPLDFAG QTWAAVLLPI HDPAGELVAY VVSFAPDPFS
RMFRWDFIRA ALVATLLLGA VLMLLLRVLH SREVLRGERA YQQAITDTMA DGLYAQDKKG
RLTFINPVGR DLLGYREEDV LGRRAHALFH QHDDDGLAGS GQCPLERRVA AGRYFEGEET
FRRRDGSEIP VEVSSRPLYQ GGHWAGSVTL FRDITERRRA RARLKLAASV FTSANEGILI
TDPQARILMV NQAFTRITGY HQDEVEGRDP KLLASGRHGP EFFQELWRVL DEHDHWQGEL
WNRRKSGELY LQLITISAVR DESGRLINYV GLLSDITDMK AYQRKLEFLT HYDPLTELPN
RVLLLDRLRK AMQLAQREQR RVCVAYVDLD DFKTINDVHG HRVGDQVLLE LASRLSAARR
EGDTASRLSG DEFAMVFVNL GSVAECHRMA ARVLDAVARP ITIEGVALQT SASIGVAHYP
QDEEVDAEQL LRQADQAMYQ AKLQGKNRHH VFDVAGDRQL RDQHESLTRL ELALARGEFV
LHYQPKVNLR TREVVGAEAL IRWQHPERGL LPPAAFLDDL NGQPLEVAVS RWVMARALEQ
VETWSEQGVH LPVSVNVPAL HLQQADFVDQ IRELLARHPG LPRNSLELEI LESSALASLD
HVSRVIQGCA ALGVDFSLDD FGTGYASLSY LKRIPVRIVK IDRSFVRDML EDADDLALLE
GIVRLTQVFD LQLIAEGVET PEQGERLLEL GCEQAQGYGI GRPMPAGSLL EWLSAWQRQG
GTA