Gene Mlg_2050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2050 
Symbol 
ID4270184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2323156 
End bp2324937 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content64% 
IMG OID638126806 
ProductNa+/solute symporter 
Protein accessionYP_742882 
Protein GI114321199 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR03648] probable sodium:solute symporter, VC_2705 subfamily 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.500291 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGAT CTCAACAGAT ACTCAACCTG ACCGTGGTGG GTCTCACCTT CGCGCTCTAC 
ATCGGTATCG CGATCTGGGC CCGCACCGGT AGCACCAGCG AGTTCTACGT GGCCGGCAAG
GGGGTCAACC CCGTCGCCAA CGGCATGGCC ACGGCCGCTG ACTGGATGTC CGCCGCCAGC
TTCATCTCCA TGGCCGGCCT GATTGCCTTC CTGGGCTTTA CGGGTGGCGC CTTCCTGATG
GGCTGGACCG GTGGCTACGT GTTGATGGCC CTGCTGCTGG CCCCCTACCT GCGTAAGTTC
GGTAAGTTCA CGGTGCCCGA GTTCATCGGC GACCGGTTCT ACTCGAAGAC GGCGCGGGTG
ATTGCGGTCA TCTGCCTGCT GGCCATCTCC ATCACCTACG TTATCGGCCA GATGCGCGGC
GTCGGTATCG CCTTCTCCAA CATCCTGGAG GTGCCGCTGA CCGTGGGCCT GATCTCCGGC
ATGGTGGTGG TGTTCATCTA CGCTGTGTTC GGCGGTATGA AGGGTATCAC CTATACCCAG
ATCGCCCAGT ACTGCGTGAT GATCTTCGCC TACACCGTCC CGGCGGTCTT CATCGCCATC
GCCATCACCG GCGTGCCGAT CCCGCAGATC GGTCTCGGCA GCACCCTCGC CGGCTCCGAC
ACCTATCTGT TGGAGGCGCT GGACCAGACC CTGGTGGACT TGGGCTTTGC GGCCTACTCG
GCCACCGAGG GCGGCTTCAA CATGCTCAAC ATGTTCCTGC TGACCATCTC CCTGATGATC
GGTACCGCAG GTCTGCCGCA CGTGATCATC CGGTTCTTCA CCGTGCCGCG TATCCGCGAC
GCGCGTAAGT CCGCCGGCTG GTCCCTGGTC TTTATCGCCC TGCTCTACAC CACCGCCCCG
GCTGTGGGTG CCATGGCGAT GTGGAACCTG CTCGACACCG TGCTGGTGGA CCGCCACAGC
ATCGGTGAGG CTGAGGCCCA CACCCGGTAT GAGGACCTGC CCGACTGGAT GTACCGCTGG
GAGCAGACCG GTCTGCTGCA GTGGGAGGAC AAGAACAACG ACGGCCGTAT CCAGTACTAC
AACGACGGCA ATGCGGAGTT CGACCAGATG GCCCGCGAGC AGTGGGGCTG GGAAGGCTCG
GAGATCACCA ACCTGGACCG TGACATCATC GTGCTGGCCA ACCCGGAGAT CGCCGGTCTG
CCGACCTGGG TGATCGCCCT GGTGGCGGCG GGTGGTATCG CGGCGGCGCT GTCCACCGCG
GCCGGTCTGC TATTGGCCAT CTCCTCGGCG GTTTCGCACG ACTTGCTCAA AGGCGTGTTC
AAACCCGATA TCAGCGAGAA GAACGAGATG CTCGCGGCCC GTATCTCCAT GGCCGTCGCC
ATTATCTTCG CGGGGTATCT GGGCTTGAAC CCACCAGGCT TCGCGGCCGA GGTGGTGGCG
CTGGCCTTCG GTCTCGCCGC GGCCAGCCTC TTCCCGACCC TGATGATGGG CATCTTCTAC
CGGAAGATGA ACCGCGAAGG GGCCATCGCC GGCATGCTGG CGGGTCTGAT CGTGACCCTG
GGTTACGTCT TCACCTACAA GGGCTTCCTG TTCTTCCCGC AGCTGGCCCT GCTGCCGGAC
ACCGCCGAGT ACTGGCTGTT CGGTATCAAC CCGGCCGCAT TCGGTGTGAT CGGCGCCGTG
GCGAACGGCA TTGTCTCCTT CGCCGTGGCG AAGATGACCG CGCCGCCGCC GGCCGAGATC
CAGAAGCTGG TGGAGAGCGT GCGTGTGCCG CGCGCCGACT GA
 
Protein sequence
MEGSQQILNL TVVGLTFALY IGIAIWARTG STSEFYVAGK GVNPVANGMA TAADWMSAAS 
FISMAGLIAF LGFTGGAFLM GWTGGYVLMA LLLAPYLRKF GKFTVPEFIG DRFYSKTARV
IAVICLLAIS ITYVIGQMRG VGIAFSNILE VPLTVGLISG MVVVFIYAVF GGMKGITYTQ
IAQYCVMIFA YTVPAVFIAI AITGVPIPQI GLGSTLAGSD TYLLEALDQT LVDLGFAAYS
ATEGGFNMLN MFLLTISLMI GTAGLPHVII RFFTVPRIRD ARKSAGWSLV FIALLYTTAP
AVGAMAMWNL LDTVLVDRHS IGEAEAHTRY EDLPDWMYRW EQTGLLQWED KNNDGRIQYY
NDGNAEFDQM AREQWGWEGS EITNLDRDII VLANPEIAGL PTWVIALVAA GGIAAALSTA
AGLLLAISSA VSHDLLKGVF KPDISEKNEM LAARISMAVA IIFAGYLGLN PPGFAAEVVA
LAFGLAAASL FPTLMMGIFY RKMNREGAIA GMLAGLIVTL GYVFTYKGFL FFPQLALLPD
TAEYWLFGIN PAAFGVIGAV ANGIVSFAVA KMTAPPPAEI QKLVESVRVP RAD