Gene Mlg_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1053 
Symbol 
ID4270526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1225998 
End bp1227557 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content66% 
IMG OID638125805 
Productdiguanylate cyclase 
Protein accessionYP_741896 
Protein GI114320213 
COG category[T] Signal transduction mechanisms 
COG ID[COG1639] Predicted signal transduction protein
[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.258962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCCGG ATTTGAAACG GCGCCTGAAA GCCACCCCTG CTCTCCCTTC TCTCCCACCG 
GCAGCGGTCC GTATCCTGGA CATTGCGCGC AGCAACGAGC CCCAGCCGCG CGAACTGTAC
GAGGCCATCC AACTCGATCC GGCCCTTTCC GCGCGCCTGC TGCGGGCCAG CAATTCGCCG
GCCTTTCGCC GGGTGGGCGA GGTGCGCTCG CTCCGCGAGG CCCAGATGGT GCTGGGCTAC
GATGCCACCC TCACCCTCAG TCTGTCCTTC TGTCTGTTCA CCACCATCCG CCGTCTGGAA
GCCGGTCGCC TGGACTATGA ACACTTCTGG CGCCGCTCCG CCTGCGTCGC CAGTGTAGCC
GCACTGATTG CGAAATCGCG CGGGTTGGCC GGTGCAGAGG CAATGCTCGC CGGGCTGTTA
AAGGATATCG GTGTTCTGGT GCTGGACAGC ATCACCCCGG CCGACCACCC CGGCTGGACG
AGTGATGAGC AGGCGGCGGC ACAGCTTGCG GCGGAGGTCG GGGCCTGGCT GATCAATGAA
TGGCGGCTGC CGGACTACCT GGCGAGTGTC AGTTGGGCAA CCCGGGACCT GGACCGGCGG
CTGCCGCATA TCCCCGACGA AGACCAAGCG CTGGTAGAGG TCGTCGGGGT GGCGGACGAG
CTGGTGGGGA TCTGGCTTGC CGACGATACC GGCGGGCAGC TGGACAATAG CCGGGACCTG
GTCGCCCGGC GCTGGGCCTG GTCACCAGAG CAACTCTACC AACTGATCAC CGAGGCCGAC
GAGGCGATCA AGGGCAACCT TGCCCTGCTG GATCTCAGCC AATTCGACGA GCAGATCATG
GTCGGCGTAA TGGACCAGGC GCGGGAGGTG CTACTGTTCC AGAATCTGTC CCAATTCAAA
ACGACCCATG AGGCCAACAG CCAGGCCGAC GCCTTACGCG AACGCGCCCG TCAGCTGGAG
GAATCCAGCC TGCGCGATGA ACTGACCGGC GTCTGGAACC GCCGCAAGCT CTTCTCCTTC
CTGGAGGTCA CCCTGGAGGA TGCCCGCACC GCCGGCACGC CGGTCACCGT CGCCTTCGCC
GACCTGGATC GCTTCAAACC GGTAAACGAC GACTACGGCC ATGCAGCGGG CGATGTTGTC
CTCAACCATT TCGCCCAGAA ACTCTCCCGG CTGGTGCGCG GCGACGACCT AGTCGCCCGC
TACGGCGGCG AGGAGTTCGC TATTGTCATG CCCAACACCG GTGGCGAAAC CGCCAAGACG
GCGCTGACCC GGGTGCTGAA GCAAGCCGCC GAGACGCGCT ATGCCGTCCG CGAAGGCAAG
TCCCTGCAGG TGACAGCCTC TATAGGCATG CGCAGCGTCG ACCCCCGCAG CGAACCGGAG
GTGTCAGCCC GCGAACTGCT TGCCGAGGCG GATGAGGCGG CCTACCACGC CAAGGCGATC
GGCCGGGCAG CGCTCGTCGC CTCGGATCCG GACGGCCTCA GGGTGGTGCA CCGGCTGGGG
CAGCCATCGG TGATGGGCCG ACTGGGCCAG ACGCTGGTCA GCGCCTTCAG GCGGCGCTAG
 
Protein sequence
MDPDLKRRLK ATPALPSLPP AAVRILDIAR SNEPQPRELY EAIQLDPALS ARLLRASNSP 
AFRRVGEVRS LREAQMVLGY DATLTLSLSF CLFTTIRRLE AGRLDYEHFW RRSACVASVA
ALIAKSRGLA GAEAMLAGLL KDIGVLVLDS ITPADHPGWT SDEQAAAQLA AEVGAWLINE
WRLPDYLASV SWATRDLDRR LPHIPDEDQA LVEVVGVADE LVGIWLADDT GGQLDNSRDL
VARRWAWSPE QLYQLITEAD EAIKGNLALL DLSQFDEQIM VGVMDQAREV LLFQNLSQFK
TTHEANSQAD ALRERARQLE ESSLRDELTG VWNRRKLFSF LEVTLEDART AGTPVTVAFA
DLDRFKPVND DYGHAAGDVV LNHFAQKLSR LVRGDDLVAR YGGEEFAIVM PNTGGETAKT
ALTRVLKQAA ETRYAVREGK SLQVTASIGM RSVDPRSEPE VSARELLAEA DEAAYHAKAI
GRAALVASDP DGLRVVHRLG QPSVMGRLGQ TLVSAFRRR