Gene Mlg_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2042 
Symbol 
ID4270176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2312667 
End bp2313725 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content66% 
IMG OID638126798 
Productdiguanylate phosphodiesterase 
Protein accessionYP_742874 
Protein GI114321191 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.281797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTGTC CAGGTTGCGA GAAGGTCCCG AAGCAACCGC AGGGTGCGGG AACACTCTAC 
ATCCTGCCCG CCCAACCGCA CGCGGCCGCC ACCATCGTCG AGGCCCTGGT GGGCGACGGC
CTGACGCCAG AGCAGCACGG CGACTCGATC CTAGCCGTGC CGGTGGAGCC CGGGGCGTTG
AACCGGATCA TGGCGCTGCT CGGCGGTGCC CTGACGCCAC AGGAGCAGGC CGCCTGCCAG
GCGAATTTCT TCGCCGACGG CGGCGACTGG TCCCCCGAGA CGCTGCTGGC GACCCGCCCG
CTGGACGTGC TGGTGGCGCG CAGTCAGTTT GTCTGGCTGA ACGAACTGAT CGAGGACCTG
CGGCTGCAGA TGCACTTTCA GCCCATCGTC CACGCCGATG ATGGCCGCAC GATCTTCGCC
TACGAATCGC TGGCCCGCGG CCTGGACCAC GCGGGGCAAT TGATCTCCCC GGGCCGGCTC
TTCCCCGCAG CGCGGGCGGC CAATCTCCTG TTTCACCTGG ATCGGGCGGC ACGCATCAGC
GCCATCCGAC AATCCCACCA GCACCGTATC CAGCAGCCGG TGTTCATCAA CTTCAATCCC
ACCGCCATCT ACGACCCGGG CTTCTGCCTG CGTACCACCT TCAAGGAGGT CCGGCGGCTG
GGCATCGACC CGGCCAATTT CGTCTTCGAG GTGGTGGAGA CCGACTCGGT GACCGACGAG
ACCCACCTCA AGTCCATCCT CGAGGAGTAC CGCCGGCAGG GCTTTCGTAT CGCACTGGAT
GACCTGGGGG CGGGATTCGG CTCACTGACA CTGCTGAAAC AGATCCGCCC CGACTTCATC
AAACTGGACC GGGAACTGGT GGACGGGGTG CACTGGGACA ACTACAAGGC GTCCATCACC
GCGCACCTGA TCCGCATGGG CAAGGATCTG CAGGTCCGGA TCATCGCCGA GGGCATCGAA
CAGCCGGAGG ATTGGCACTG GCTGCGCGAG CGCGGCGTGG ACTACGTCCA GGGTTTCCAC
TTCGCCCGGC CCGCCTCACC GCCTCCCGTC CTGGGCTGA
 
Protein sequence
MSCPGCEKVP KQPQGAGTLY ILPAQPHAAA TIVEALVGDG LTPEQHGDSI LAVPVEPGAL 
NRIMALLGGA LTPQEQAACQ ANFFADGGDW SPETLLATRP LDVLVARSQF VWLNELIEDL
RLQMHFQPIV HADDGRTIFA YESLARGLDH AGQLISPGRL FPAARAANLL FHLDRAARIS
AIRQSHQHRI QQPVFINFNP TAIYDPGFCL RTTFKEVRRL GIDPANFVFE VVETDSVTDE
THLKSILEEY RRQGFRIALD DLGAGFGSLT LLKQIRPDFI KLDRELVDGV HWDNYKASIT
AHLIRMGKDL QVRIIAEGIE QPEDWHWLRE RGVDYVQGFH FARPASPPPV LG