Gene Mlg_1831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1831 
Symbol 
ID4268186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2089431 
End bp2090381 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content72% 
IMG OID638126587 
Productdiguanylate cyclase with PAS/PAC sensor 
Protein accessionYP_742665 
Protein GI114320982 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTGA CCTTTCAGGA ACCCTTGCTG GAGGCCCTGC CCGAGGCGGT GGTCTGGGTG 
CGCCCGGACG GCCGGATCGG CTACCTGAAC CCCCGGGCGT CGCAACTGAC CGGCTGGCCC
GTGGCGAATG CCCGGGGCCA GCCCCTGGGC GCAGTCCTGC ACTTGGAGGA GCAGGGGCAG
CCGCTGCCAC CGGAGGCGCT GGTGGCGCAG TGCCGGCAGC TCGGGCAGGC GGGGGAGCGC
CACGCCCGCC TGCGGCGCCA GGATGGCGAG ACCCTGGAGG TGGCACTGAC CGGGGCCCCG
ATTCAGGACG GGGCCGGGCA GCCCCGCGGG GTCATCCTCT CGTTCCGCGA CATCGGCGAC
TACCTGAAGA TGGCCCGGCG GCTCACCTAC GAGGCCAGCC ACGATGGGCT GACCGGGTTG
GTGAACCGCC GTGAGGCCCT TCGGCGGCTG GAGCGCATGG TGGCGTCGGC CGGGGAGCAG
TGCTGTGAGC ACGCCCTGTG CTACCTGGAC CTGGATCGTT TCAAGGGCAT CAACGATCTG
GCCGGCCACG TGGCGGGTGA CCGGGCGCTG GCCGAGGTGG CGGGCCGGCT GCTCGACTGT
GTGCGCCAAC GGGACACCGT GGCCCGGCTG GGCGGTGATG AGTTTCTGGT GCTGCTGGAG
CATTGCCCGT TGCTCCAGGC CATCCGGGTG GCCCAGGTGA TCCGCGCGGC CGTCCGGGAC
TATCGCTTCC ACTGGCGGGG GCAGACGCTG GGGTTGGGGG TGAGCATCGG CCTGGTGCCG
GTTCTCGGTC ACGGCCCGGG TGCGGAGGCG CTGCTGGAGG TGGCCGACCA GGCCTGCTAC
GAGGCCAAGC GAAGCGGCGG CATAGGGATT CGGGTCCGGT CCGGGCGGGA GCGCTCAACG
TGCCAGGCGC AGGAGGACAT AGGCGGCCCC CGTGCCACCA TCACTGGGTA G
 
Protein sequence
MGLTFQEPLL EALPEAVVWV RPDGRIGYLN PRASQLTGWP VANARGQPLG AVLHLEEQGQ 
PLPPEALVAQ CRQLGQAGER HARLRRQDGE TLEVALTGAP IQDGAGQPRG VILSFRDIGD
YLKMARRLTY EASHDGLTGL VNRREALRRL ERMVASAGEQ CCEHALCYLD LDRFKGINDL
AGHVAGDRAL AEVAGRLLDC VRQRDTVARL GGDEFLVLLE HCPLLQAIRV AQVIRAAVRD
YRFHWRGQTL GLGVSIGLVP VLGHGPGAEA LLEVADQACY EAKRSGGIGI RVRSGRERST
CQAQEDIGGP RATITG