Gene Mlg_0306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0306 
Symbol 
ID4270766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp346195 
End bp347262 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content72% 
IMG OID638125032 
Productputative periplasmic ligand-binding sensor protein 
Protein accessionYP_741151 
Protein GI114319468 
COG category[T] Signal transduction mechanisms 
COG ID[COG3292] Predicted periplasmic ligand-binding sensor domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCTA AGGCCCTGCA ACTCGGTGTG CCCGCCCTGG TCGCCGGCCT GCTGTACCTC 
GCCGGCACCG CCTGGGTCGG CGACCGGGAA CACTGGGTGC CGATCAGCCC CTATCCCGGC
GAACACTTCA TGGCCCTCAC CACCGCCCCT GACGGGCGCC TGTTCGCCGG GGCCCAATCC
GGCGCCGTGC TGGAACGTGA CCCGGGCGGT CCGTGGCGCC TGCACAATAC CGGCCTGCCC
GCGATCACCT GGCTGCTGCC GGACGGTGAT GGGCTGCTGG CCGGAACCAT CCGAGGGGTG
TACGCCTCGC CGGACGGGCG CCAGTGGGCA CCGGTGGAGC GGGGTCTGCC GGAGGGGCTG
TGGGTGCTGC AGTTCGAGCC TCTGCCGGAC GGCCTGCGCC TGCTTAGCCC CGACCAGGGG
CTCTACCGGC GGGATGACCA GGGGCGTTGG CACGCCGACC ACAGCCGCGG GCTGCCGGCG
GGGGTCCACA TCTATCACTA CGCCCGGGAT ACCCAGGGCG GGGACCACGT GGGGACGGTG
GCGGAAGGCG CCTATTACCG GCCAGACCCG GGGGCCGACT GGCGTCCCAA CAGCGAGGGT
CTGCACCGCC ATGCCCGTGG ATTCTCCCTG CTCCGCCGGG AGGGTGGCAT CATCCTGGGC
AGCGACCGCG GCGCCTGGTG GCAGTCCCAA CCCGGGGAAC GCTGGCAGGC CCTGGGCACC
GGACGGCATG GCTTCCGGGT GCTCGATCTG GCCGCGGACG CCCGTGGCCG GGTCTGGGCG
GCCAGCGACG AGGGGATTTG GGTCGCCGAC GAGAGCAATC GCGACGGCCG GCCGACACCC
TGGCGCAGTG TCCCCATGCG CGACGAGGGC CCACAGGCGC CGGTCAGCCG TTTTCACATC
GACGGTGATC AGCACCTGGC CGCCGCGGGC GCCATCTACC AATTGGAGCG GGACCGCGGC
TGGCAGGTCC CTATCCTGGT GATGGCCATC CTCGCCGGGG TCATGACCTG GGCTATGATG
CACGTGCCGG CGGTGACCGG CCGGCGACCA CCGCCGAACC ACCCCTGA
 
Protein sequence
MSAKALQLGV PALVAGLLYL AGTAWVGDRE HWVPISPYPG EHFMALTTAP DGRLFAGAQS 
GAVLERDPGG PWRLHNTGLP AITWLLPDGD GLLAGTIRGV YASPDGRQWA PVERGLPEGL
WVLQFEPLPD GLRLLSPDQG LYRRDDQGRW HADHSRGLPA GVHIYHYARD TQGGDHVGTV
AEGAYYRPDP GADWRPNSEG LHRHARGFSL LRREGGIILG SDRGAWWQSQ PGERWQALGT
GRHGFRVLDL AADARGRVWA ASDEGIWVAD ESNRDGRPTP WRSVPMRDEG PQAPVSRFHI
DGDQHLAAAG AIYQLERDRG WQVPILVMAI LAGVMTWAMM HVPAVTGRRP PPNHP