Gene Mlg_2672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2672 
Symbol 
ID4268805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3024507 
End bp3025547 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content70% 
IMG OID638127431 
Productsensory transduction protein kinase AlgZ 
Protein accessionYP_743502 
Protein GI114321819 
COG category[T] Signal transduction mechanisms 
COG ID[COG2972] Predicted signal transduction protein with a C-terminal ATPase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000156498 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGGTGT TCGAACGAGG GGGTTCGTCC GTCAGGCGAT CGGCGATTCT GCCCGACTTC 
TGCCACCTGC AGACGGTGTT GGCCGTGGTG CTGGCCGGCC AGCTATTGGC CTTTGTGCTC
TTTCTGGCGC GGCCCGCGAC GCAGTGGGAC TGGGCGACCC TGGGGCTGAT CTCGCTCTAT
GTCCAATGGG TGGTGCTGCT GAGCACGGCG CTGCTCTGCC TGTCCCGCCG GCCACTGTCC
CGGGTCAGTC CCGAGGCGGC GGGGCTGCTG GCCTGGCTGG GTATCATCGC GGTGGCTGCG
GTGACCGCCG AGCTGGCCTG GCGCAGTACC GGGGGGGTGT TGGGCGCGGA GCGCTGGGCA
TTGGTTCTGC GGTCGGTGGC CATCAGCGGC ATTATTGCGG CCCTGGTGCT CCGCTACCTC
TATCTGCAGG GGGAGTGGCG CCGGGGGCTG CAGGCCGAGG CCCGGGCCTC CATGCAGGCC
CTGCAGTCGC GCATCCGCCC CCACTTCCTG TTCAACACCC TTAACACCAT CGCCGCGATG
CTCCGCCAGG CCCCGGAGCG CGCCGAGCAG GCGCTGCTGG ATCTTGCCGA TCTGTTTCGT
GCCGGCCTGC GCGAGGTCGG CGGTTGGTCC ACGCTGGACG AGGAGCGGGC CCTGACCGAG
CGCTACTTGC GTCTGGAACA GCTCCGGCTG CAGGAGCGGC TGCGGTTGGA CTGCGACTGG
GACGGGTTGC CGGGCCAGGC CCGGGTGCCC TCGCTGATCT TGCAGCCACT GGCGGAGAAC
GCCGTGGTGC ATGGCATCGA GCAACTGCCC GCAGGCGGTG AGCTGCGGCT GAGGGGGCGA
CGGGAGGGCG ATACCCTGGT GCTGGAGCTG GAAAACCCCG TCCCGGCCGG TGGCTCCCTT
CGCGGCGGCC ACGGCCTCGG GCTGGAGAGC GTGCGGCGCC GGATGCGCTA CGCCTTCGGT
GCCGCGGCCG ATCTGGAGGT GACAGAGCGT GCCGGCCGTT TCCACGTAGT GCTCCGTTGG
CCCTGGCAGG AGGCAGGATA G
 
Protein sequence
MAVFERGGSS VRRSAILPDF CHLQTVLAVV LAGQLLAFVL FLARPATQWD WATLGLISLY 
VQWVVLLSTA LLCLSRRPLS RVSPEAAGLL AWLGIIAVAA VTAELAWRST GGVLGAERWA
LVLRSVAISG IIAALVLRYL YLQGEWRRGL QAEARASMQA LQSRIRPHFL FNTLNTIAAM
LRQAPERAEQ ALLDLADLFR AGLREVGGWS TLDEERALTE RYLRLEQLRL QERLRLDCDW
DGLPGQARVP SLILQPLAEN AVVHGIEQLP AGGELRLRGR REGDTLVLEL ENPVPAGGSL
RGGHGLGLES VRRRMRYAFG AAADLEVTER AGRFHVVLRW PWQEAG