Gene Mlg_1301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1301 
Symbol 
ID4268638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1500873 
End bp1502558 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content68% 
IMG OID638126052 
Productdiguanylate cyclase 
Protein accessionYP_742140 
Protein GI114320457 
COG category[T] Signal transduction mechanisms 
COG ID[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGG GAAACACCAC CGGAAACCGA GGATACCGAA GAGACCGGCC TTTGGCGGCG 
TGGCTCATCG CCCTGTTCCT GGCTCTGCCC GGCCTGGCCG GTGCCGAGCT ACAGCTGAAA
CGGCTCGACC ACCACGACCC CCACCCCCCG CCGGTCCATC TGCTGCCTGC CGGTCAGGTG
TTGGACGCCA GCCGGCCCAT TAACCTGGGG CTTTCATCGC GCCCGGTCTG GCTGCACCTG
GAGATCTCGG CACCGCCCCC CCGGGTCCTG CACCTGGACA ATCCACTGCT GCACGATGTG
CGCCTGTTGC GGGTGGATAG TGCGGGGGAT CTCCATTGGG CCCCGGTGCC CCGGCGGACC
ATGGCCCGGC TCGGCGGTGG CAGCAGCGAG GCCTGGCGCC CCATCTTCGA TGTCACCCCG
GGGCAGTACC TGTTGCGAAT CGAGAGCACC CAGGCACTGC GCTTCGATCT CCGCCTGCAG
TCCCCGGAGG CGCTGGTGGG CGACTGGCGT GCCTTTACCC TGGCGCAGGG CCTCTTCCTG
GGGCTGGTCC TGGCCCTGGC GGCCTACAAC ACCATCCTGC TATTCCGGCT CAGGGATGCG
AGCTACCTCT GGTACGTCGG CTTCATTCTC GGCCTCGCCG GCTACTTCGT GTTTCAGAAG
GGGCTCACCC ACGAGTTCTG GCCCGGCCTG GGGATAGCGC TCAACGAGGC GCTGATGTTC
ACCGCCCTTT CGCTGGGCTG CGCCAGCGCT ATGTGGTTCT GCCGCCGTTT CCTGATGACC
GAGCAACGCG ACCCGGCAAT GGATCGCCTG TTGCCGATAG CCGCCCTCTG CAGCCTCGCC
CTGGCCCCGC TGGCTTGGGG CCTTCCCGGC CATGCCACAC TGCTTTACGC CAGCGCCATC
GGACTGCTGG CCATGGCCAG CTACCTCTTC GCCACCGCGC GCGCAATCCG CGACCATGAC
TTCCCACCCG CACGCTGGCT GCTATTGGCC TGGCTCGTCA TGGTGGTGGG GGCCCTGCTC
TTCACACTGG CCGGCCTGGG GCTGGTCCCT CATCACTTCG TAACGTACTA CGGCTTCCAG
ATCGGCGTAA GCCTGCAGGC GGTGCTCCTG TCATTGGCCC TGGCCGACCG CATCGGCCTG
CTCCAGGCGG AGCGGGAGGC GCTGCTGAAT GAGCAGGCGC AGCTGCGCCT GACGGCCTAC
ACCGATGGTC TCACCGGGTT GTACAACCGA CGCTACCTGG ACGAATTCCT GGCCCGTGCC
GTGGAGGACG CCGAACGCCG GCAGCGGGAG CTGGCCGTAG TGATGCTGGA TCTGGACGAC
TTCAAGCCGT TCAACGACCG CTGGGGCCAC CAAGTGGGGG ATCGCGCCCT GCAACACCTG
GCCTACCTGA TGCAAGAGGT GGTACGCGGC ATCGATCCGG TGTGCCGCTA TGGGGGGGAG
GAGTTCCTGC TCATCCTGCC CGACCGCGGG CTCAAGGAGG CGGAGATTGT GGGGCGCCGG
CTGCTCACCA ACCTGGCCTC CCGGCCGATG ACCGGGCCGT GCCACCCGCC CCTGACCCTC
ACGGCTACCG CCGGCGCCAC CGCACATCGG CCAGGGGACA ATGCCGCGCG GCTGCTGGAA
CGGGCGGATG CCGCCCTTTA TGAGGGCAAA CGGGCCGGCA AGAACCGCCT GGTCACGGCC
GCGTGA
 
Protein sequence
MATGNTTGNR GYRRDRPLAA WLIALFLALP GLAGAELQLK RLDHHDPHPP PVHLLPAGQV 
LDASRPINLG LSSRPVWLHL EISAPPPRVL HLDNPLLHDV RLLRVDSAGD LHWAPVPRRT
MARLGGGSSE AWRPIFDVTP GQYLLRIEST QALRFDLRLQ SPEALVGDWR AFTLAQGLFL
GLVLALAAYN TILLFRLRDA SYLWYVGFIL GLAGYFVFQK GLTHEFWPGL GIALNEALMF
TALSLGCASA MWFCRRFLMT EQRDPAMDRL LPIAALCSLA LAPLAWGLPG HATLLYASAI
GLLAMASYLF ATARAIRDHD FPPARWLLLA WLVMVVGALL FTLAGLGLVP HHFVTYYGFQ
IGVSLQAVLL SLALADRIGL LQAEREALLN EQAQLRLTAY TDGLTGLYNR RYLDEFLARA
VEDAERRQRE LAVVMLDLDD FKPFNDRWGH QVGDRALQHL AYLMQEVVRG IDPVCRYGGE
EFLLILPDRG LKEAEIVGRR LLTNLASRPM TGPCHPPLTL TATAGATAHR PGDNAARLLE
RADAALYEGK RAGKNRLVTA A