Gene Mlg_1619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1619 
Symbol 
ID4269351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1847467 
End bp1849365 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content66% 
IMG OID638126376 
Productdiguanylate cyclase with PAS/PAC and GAF sensors 
Protein accessionYP_742455 
Protein GI114320772 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.702165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGCA CCCTGCTCGT TATCGGGCTG ATCGCCACGG TCCGGTTCAG GGCCCGTCAC 
GACGCTGGCC GTGACGTGGG CAACCAACCC CAGGCACTTC CCGATGAACC CGCGCTCTTT
CAGAGACTTG CCGGGAACAC CTCGGCCGGA CTCTTCCTGG CCCGGGCAGA TCGCCTGATC
GCGGTCAACC CCGCGCTCTG CCGTATCCTG CGGTGCCCGC CAGAGCAGCT GACCGGGGCT
GAATGGCGGC CGCTGATCCA TTCCGACCAC GCCGATGAGG TCGAGACCCA CGTCAGCGCC
CGCCTGCGCG GGCAACCGGC CAGCCCACTG CACCCGATCA GGGTCCAGCG GGGTGATGGC
ACCACCCGCT GGGTGGAGCT GAGCCTGGAA CCGGTGGCCC TGGGCGGGGA TCCCGCCATG
GTTGGCACCC TCGTCGACAT TGACCGCCAC TGCGCGCTAC AACAAGAGCT CCTGCTGAGT
GAGAGGAAAT ACCGACAACT CGTTGAAAAC GTAAATGATA TAATCTATAC ATTGACACCG
GATGGACGCC TGAGTTACGT CTCCCCCAAC TGGCCGGAGC TCCTCGGCCA CCCGGTCGAT
CAGGTCATCG GCCGGCCCAT TGCCCGGTTC GTGCACCCGG ACGACCTCCC CGGCTGCGAG
GAGTTTCTCA GGCGCCTGTT CCTCACCGGC AAGAAGCAAA GCGGCATTGA ATACCGGGTG
CGGCACAGTG ACGGTCACTG GCGCTGGCAC ACCGCCAATG CTTCACCCAT CACCGGCGAC
GACGGCACCG TCATCGCCTT TGTGGGTATC GCCCGCGATA TCACCGCCAG GAAGGGCATG
GAGGCCGAAC TGGTCTACCA GCTTCGGTTT AACGAGCTGG TCGCTGAACT CTCCACCCGG
TTCGTGCGCA GCCCTGCCGA GGAGACCGAT GCCCGGATCG ACGATCTGCT TCGGCGGGCG
GGCACCCTGT TCAATGCGGA TCGGGCCTAC CTCTACCTCT TCTCCGACGA TGGCGAGACC
ATGAGCAACA CCCACGAGTG GTGTGCGCCC GCGGTCCCCT CGCTGTTGGA GGACAGTCAG
CGGATGCCCG TCGCCCGCTA CCCGTGGTGG CAGCGACACA TGACGGCGCT GCGCGACCAG
CACCAGGTGT TGTTCATCAA TGACGTCTCA GCGCTGCCCG CCGAAGCAGC GGCGGAGCGG
GCGCTGCTGG AGGGGCAGCA GGTGCGCACC CTGGTCTGTG CCCCGGTGGC GACGCCGGAT
CGGGTCATCG GTTTTTTGGG GTTCGACTCG CTGCGTCCCA AGGAGTGGCG GAAGGATCAG
GGCGACCTGT TGGTCGTCCT CGGCAATCTC CTGGCCGCGG CACTGGTGCG ACGGCAACTG
GAGTTGGATC TGCGCAACCT GTCGGTGACC GACCCGCTCA CCGGCCTGTT CAACCGGCGC
TTCCTGCGGG CGCGCCTGCT GGCGCTCATT GAGGAGTATG AGCGCCACGG ACACCGGTTC
TGTGTCAGCC TGGTGGACCT GGATCACTTC AAGGCCCTGA ACGACCGTCA TGGCCATCTG
GCCGGTGACC GGATCCTGCA GGGCTTCGCC AACATCCTGC GGGACAATCA CCGGGTGTTC
GACATCGTCG CCCGCTACGG CGGTGAGGAG TTCGTGGTGA TCCTGGTGGG AACCGAAATC
GGGCAGGCGC GACGGGTGAC TCGCCGCGCC CTGACAACCA CCCGCAACCA TGACTTCCGT
TTCAACGAGA CCCCGCTGTC GATCACCGCC AGCGCCGGTC TCGCTGATAT CGCGGAGTTG
CCCGAAGACC ACCGCACCGT GGAGCATCTC CTGCAACTCG CGGACCACCG CCTCTATCGG
GCCAAGCAGC AGGGCCGCGA CTGCCTGGTG GCCGACTAG
 
Protein sequence
MASTLLVIGL IATVRFRARH DAGRDVGNQP QALPDEPALF QRLAGNTSAG LFLARADRLI 
AVNPALCRIL RCPPEQLTGA EWRPLIHSDH ADEVETHVSA RLRGQPASPL HPIRVQRGDG
TTRWVELSLE PVALGGDPAM VGTLVDIDRH CALQQELLLS ERKYRQLVEN VNDIIYTLTP
DGRLSYVSPN WPELLGHPVD QVIGRPIARF VHPDDLPGCE EFLRRLFLTG KKQSGIEYRV
RHSDGHWRWH TANASPITGD DGTVIAFVGI ARDITARKGM EAELVYQLRF NELVAELSTR
FVRSPAEETD ARIDDLLRRA GTLFNADRAY LYLFSDDGET MSNTHEWCAP AVPSLLEDSQ
RMPVARYPWW QRHMTALRDQ HQVLFINDVS ALPAEAAAER ALLEGQQVRT LVCAPVATPD
RVIGFLGFDS LRPKEWRKDQ GDLLVVLGNL LAAALVRRQL ELDLRNLSVT DPLTGLFNRR
FLRARLLALI EEYERHGHRF CVSLVDLDHF KALNDRHGHL AGDRILQGFA NILRDNHRVF
DIVARYGGEE FVVILVGTEI GQARRVTRRA LTTTRNHDFR FNETPLSITA SAGLADIAEL
PEDHRTVEHL LQLADHRLYR AKQQGRDCLV AD