Gene Mlg_2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2066 
Symbol 
ID4270452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2342334 
End bp2344088 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content68% 
IMG OID638126822 
Productdiguanylate cyclase/phosphodiesterase 
Protein accessionYP_742898 
Protein GI114321215 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.684851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.148917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCAAC GACCCAACCC CGAATCGTCG CCGGACCCCG AGCCTGGGGC GCCGATGGCC 
GACCAACCGG CCTTCCGACT GGCCCCGGTG GCGCAATTGA TCCTCGATGA AGGGGGCACC
ATCCATGCCC TGAACGAGCA GGCCCGGGCC TTTCTCGAGG CCTCGGCGGA GCCCCTCCGG
CAGCGGCCAT TCTGGCAATT GATGGTGGAC GGGGACGCCG TACGTCTGCA GCGTGCGCTG
GCGAACCAGC GTCCGGAGGC GGGCGTGCGC CGGTACTGCG GCCTGCAGCT TGCCGCGGGC
TGCCGTGTGG ATCTGTCGGT GCGCCATGTG GGCGCCGGGC GACTGCTCTG TGCCCTGGAG
CATGTGCCGC CCGGTACCGG GGATGACAAG GAGGTGGCGC GGCTGGCCCG TGAAGTCCGG
GAGCAGCGCA CCGCGCTGGA GCGCCTGGCC CACTACGATG CCCTGACAGG GCTCCCCAAC
CGATGGTTTT TTGAACGGTT TCTCGACAAC CAGTTGCAGC GGGCGGGGGA TAACGACGGG
CATCAGATCG CGGTGCTGGT GCTGGATCTG GATCACTTCA AGGCCGTTAA CGACCGGTTC
GGCCACAGTG AGGGGGACCA GGTGCTCAAA GAGGCCACCC GGCGTATCCT CGCCTGCGTG
CGCGATGTGG ACATGGTCTC TCGCATCGGT GGCGACGAGT TTGTGGTGGT CCTGGGCCGG
TTGCGCGGTC GTGGCGGTGC CAGCCGGGTG GCGCGGGCGC TGATCCGCTC GCTGAGCCGG
CCCTTTACCG TGGGCGCGCG CAGCCACCGG CTCACCGCCT CGGTGGGTGT GGCCTTCTAC
CCCAAGGACG GGGAAACGGT GGAGGACCTG ATCCGGCGGG CCGATCTGGC GATGTTTCAA
GCCAAGGACC GCGGCCGCAA TGGCTGGGCG GCCTTTGACT ACGAGCATGA GCACCACTTG
TTGGAGGAGG ATCACTGGAA GGGGCTACTC TGGCGCACCA TCGACGCCCC TGGTCGTTAT
CTCACCATGA ACTACCAGCC GATCCTGCGC CTGCGCGACG GCCGGCCCCG GCCCTGGGCG
CTGGAGGCCC TCCTCCGGGT CCAGGGCCAC GACCAGCAGA CCCTAGACAC CGGCACCCTG
GTCCGGGCGG CCGAGGAACA CCACATGATA CTGCCGCTGG GGGAGGCCAT CTTCGCGCGG
ATCTGCGCCG AGGTGGCGGC GATGCGCCAG GAGGGCATGA GGCTGCCGGT GACGGTCAAC
CTTTCCGCGG ATCAGTTCCT CGATCCGTGC CTGGTGGGGC GGATGGATAG GGTCTGCCAG
GCCCACGGCG TGCCCATGAA TGCCCTCTGC TTCGAGATCA CCGAGACGGC GCTGGTACGC
AATCTGTCGG GGGCCCGGGA GATGGTCCAG GCGCTGCAGG CGGCGGGGGC CCTGATCCTG
CTCGATGACT TCGGCAGTGG GTACGCCTCG CTCTCCCAGT TGCACAGCCT GCCGGTGGAT
GTGCTCAAGA TCGATGCCGG CTTCATCGCG GAGGTCGGGC GCAGCCCCCA GGCCGAGGCC
CTGATCCGCG CCATCCTGGC GATGGCGCGC GCCCTGGGTA TCGACGTCGT GGCCGAAGGG
GTGGAGACCA ACGCCCAGCG GGTCTGGTTG GAGCGCGAGG GGGTGCAGGG CCTGCAGGGC
TACTATTTCT CCCGGCCGCT GCCCCGTGAA GCGGTTCAGG ACTGGGTGCG CATCCAAGGG
CTCGCGGATG ACTGA
 
Protein sequence
MEQRPNPESS PDPEPGAPMA DQPAFRLAPV AQLILDEGGT IHALNEQARA FLEASAEPLR 
QRPFWQLMVD GDAVRLQRAL ANQRPEAGVR RYCGLQLAAG CRVDLSVRHV GAGRLLCALE
HVPPGTGDDK EVARLAREVR EQRTALERLA HYDALTGLPN RWFFERFLDN QLQRAGDNDG
HQIAVLVLDL DHFKAVNDRF GHSEGDQVLK EATRRILACV RDVDMVSRIG GDEFVVVLGR
LRGRGGASRV ARALIRSLSR PFTVGARSHR LTASVGVAFY PKDGETVEDL IRRADLAMFQ
AKDRGRNGWA AFDYEHEHHL LEEDHWKGLL WRTIDAPGRY LTMNYQPILR LRDGRPRPWA
LEALLRVQGH DQQTLDTGTL VRAAEEHHMI LPLGEAIFAR ICAEVAAMRQ EGMRLPVTVN
LSADQFLDPC LVGRMDRVCQ AHGVPMNALC FEITETALVR NLSGAREMVQ ALQAAGALIL
LDDFGSGYAS LSQLHSLPVD VLKIDAGFIA EVGRSPQAEA LIRAILAMAR ALGIDVVAEG
VETNAQRVWL EREGVQGLQG YYFSRPLPRE AVQDWVRIQG LADD