Gene Mlg_1116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1116 
Symbol 
ID4269840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1305923 
End bp1307407 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content66% 
IMG OID638125868 
Productdiguanylate cyclase with PAS/PAC sensor 
Protein accessionYP_741958 
Protein GI114320275 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.464201 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCA AGGATAGTGA GACCCCGCGG CCGCGCCAGC ACCCCGAGGC CATTCGGGAG 
CAGGGGACAG CTCCCCAGGT GATCGACGGC GCCTTCCCTG ACCGGGAGAT GGTGGCACGG
GCCGCGACCA ATGGACTCTA TCATGGGCGA GGGCCACTGG TGAGCGGGCG AGCGCTTACG
CCAGTGGCCA CACCGGTGGC GGCCGATCCG GTGTCGATCC TCCTCCTGGG CCTTGATCCC
GCAACGGCGG CCAGGGTCAT GGCGGTGGTC GCCACTCACT TCGGCCAGCC CTTCGACATG
GTATCGCGTG CCATTGATGC AAGGGCGATC CGTGCCGCGG ACATCGTGCT CTTCGACGTG
GCCGTTCCGG ACTACGCGAT CAGTCAGGCA CAACTCGCTG CCCCCGAGGC ACTGATCCTG
CCGTTGGACG GGGGCCGGCT GGAAGGGGAT AGCTGGCTTC CCGCCATCCT CCGCCACGTC
GCCCGGCAAA AGGCCGTGGA ATCGGGCCGG CAGGTGGCGG AGGAGGCGTT GTTCCAGAAA
GCGGAGCGTG CCCGTGTCAC CCTGGAATCC ATCGGCGACG CCGTGCTGGT GACGGACAGC
CTGGGCTATG TGACGTATCT CAACCCGGTG GCGGAAACCC TCACCGGCTG GTCGCGGGAC
GAGGCCTCCG CCCAGCCTCT GGCGACCGTC TTCAAGATCG TGGACGGGGC CGCCGGCGAG
TTCGCGCTGA ACCCTGCGGT TACTGCGATG AACGAAGACA GAACCGTCGG GCTGGTGGCC
AATTGCATCC TGCTGCAACG CGATGGCGGC AGCATCGGCA TCGAGGACTC CGCGGCACCG
ATCCATGATC GTAACGGCCG GGTCACCGGT GCGGTCATCG TCTTTCGCGA TGTCAGTCTG
TCGCGCTCGG TGACCCAGAA GATGGCCTAT CTGGCTCACC ATGACAGCCT GACCGGGTTG
CCCAACCGAG CGCTGCTCGC GGAACGGCTT GGCCGGGCCC TGGGGGCCGC CCGCCGCCAT
GACCGGCAAC TCGCCCTGCT GTTTCTGGAT CTGGATCACT TCAAGCGCAT CAACGACACC
ATGGGCCATG ATATCGGCGA CCACGCCCTC CGCGGGACCG CCTATCGGCT TTCGGACTGT
GTCCGCGAGA CCGATACCGT GAGCCGTCTC GGGGGTGACG AATTCGTGGT TCTGCTCGAG
GAGATCGATA GCCCGGACGA TGCGGCGCAC ATCGCGGAAA AAGTGCTGGC GGCCATCACG
GCGCCGCTGC ATGTGGGTGA CCACACCCTG CAGGTCTCCG CCAGCATCGG CATCAGTATC
TTTCCCCAAC ACGGTGCGGA TGCCGAGACC CTCCACCAAC GGGCCGATGC CGCCATGTAC
CAGGCAAAGG CAAAGGGCCG GGCGGGCTAC CAATTCTTCC AGGCCGATCC GGAGGGTGAA
GCGGGAGTGC CGCATTCCGG CAATTCGACA AAGCGCGTAG GATAA
 
Protein sequence
MKFKDSETPR PRQHPEAIRE QGTAPQVIDG AFPDREMVAR AATNGLYHGR GPLVSGRALT 
PVATPVAADP VSILLLGLDP ATAARVMAVV ATHFGQPFDM VSRAIDARAI RAADIVLFDV
AVPDYAISQA QLAAPEALIL PLDGGRLEGD SWLPAILRHV ARQKAVESGR QVAEEALFQK
AERARVTLES IGDAVLVTDS LGYVTYLNPV AETLTGWSRD EASAQPLATV FKIVDGAAGE
FALNPAVTAM NEDRTVGLVA NCILLQRDGG SIGIEDSAAP IHDRNGRVTG AVIVFRDVSL
SRSVTQKMAY LAHHDSLTGL PNRALLAERL GRALGAARRH DRQLALLFLD LDHFKRINDT
MGHDIGDHAL RGTAYRLSDC VRETDTVSRL GGDEFVVLLE EIDSPDDAAH IAEKVLAAIT
APLHVGDHTL QVSASIGISI FPQHGADAET LHQRADAAMY QAKAKGRAGY QFFQADPEGE
AGVPHSGNST KRVG