Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2050 |
Symbol | |
ID | 4270184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2323156 |
End bp | 2324937 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638126806 |
Product | Na+/solute symporter |
Protein accession | YP_742882 |
Protein GI | 114321199 |
COG category | [R] General function prediction only |
COG ID | [COG4147] Predicted symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family [TIGR03648] probable sodium:solute symporter, VC_2705 subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.500291 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGGAT CTCAACAGAT ACTCAACCTG ACCGTGGTGG GTCTCACCTT CGCGCTCTAC ATCGGTATCG CGATCTGGGC CCGCACCGGT AGCACCAGCG AGTTCTACGT GGCCGGCAAG GGGGTCAACC CCGTCGCCAA CGGCATGGCC ACGGCCGCTG ACTGGATGTC CGCCGCCAGC TTCATCTCCA TGGCCGGCCT GATTGCCTTC CTGGGCTTTA CGGGTGGCGC CTTCCTGATG GGCTGGACCG GTGGCTACGT GTTGATGGCC CTGCTGCTGG CCCCCTACCT GCGTAAGTTC GGTAAGTTCA CGGTGCCCGA GTTCATCGGC GACCGGTTCT ACTCGAAGAC GGCGCGGGTG ATTGCGGTCA TCTGCCTGCT GGCCATCTCC ATCACCTACG TTATCGGCCA GATGCGCGGC GTCGGTATCG CCTTCTCCAA CATCCTGGAG GTGCCGCTGA CCGTGGGCCT GATCTCCGGC ATGGTGGTGG TGTTCATCTA CGCTGTGTTC GGCGGTATGA AGGGTATCAC CTATACCCAG ATCGCCCAGT ACTGCGTGAT GATCTTCGCC TACACCGTCC CGGCGGTCTT CATCGCCATC GCCATCACCG GCGTGCCGAT CCCGCAGATC GGTCTCGGCA GCACCCTCGC CGGCTCCGAC ACCTATCTGT TGGAGGCGCT GGACCAGACC CTGGTGGACT TGGGCTTTGC GGCCTACTCG GCCACCGAGG GCGGCTTCAA CATGCTCAAC ATGTTCCTGC TGACCATCTC CCTGATGATC GGTACCGCAG GTCTGCCGCA CGTGATCATC CGGTTCTTCA CCGTGCCGCG TATCCGCGAC GCGCGTAAGT CCGCCGGCTG GTCCCTGGTC TTTATCGCCC TGCTCTACAC CACCGCCCCG GCTGTGGGTG CCATGGCGAT GTGGAACCTG CTCGACACCG TGCTGGTGGA CCGCCACAGC ATCGGTGAGG CTGAGGCCCA CACCCGGTAT GAGGACCTGC CCGACTGGAT GTACCGCTGG GAGCAGACCG GTCTGCTGCA GTGGGAGGAC AAGAACAACG ACGGCCGTAT CCAGTACTAC AACGACGGCA ATGCGGAGTT CGACCAGATG GCCCGCGAGC AGTGGGGCTG GGAAGGCTCG GAGATCACCA ACCTGGACCG TGACATCATC GTGCTGGCCA ACCCGGAGAT CGCCGGTCTG CCGACCTGGG TGATCGCCCT GGTGGCGGCG GGTGGTATCG CGGCGGCGCT GTCCACCGCG GCCGGTCTGC TATTGGCCAT CTCCTCGGCG GTTTCGCACG ACTTGCTCAA AGGCGTGTTC AAACCCGATA TCAGCGAGAA GAACGAGATG CTCGCGGCCC GTATCTCCAT GGCCGTCGCC ATTATCTTCG CGGGGTATCT GGGCTTGAAC CCACCAGGCT TCGCGGCCGA GGTGGTGGCG CTGGCCTTCG GTCTCGCCGC GGCCAGCCTC TTCCCGACCC TGATGATGGG CATCTTCTAC CGGAAGATGA ACCGCGAAGG GGCCATCGCC GGCATGCTGG CGGGTCTGAT CGTGACCCTG GGTTACGTCT TCACCTACAA GGGCTTCCTG TTCTTCCCGC AGCTGGCCCT GCTGCCGGAC ACCGCCGAGT ACTGGCTGTT CGGTATCAAC CCGGCCGCAT TCGGTGTGAT CGGCGCCGTG GCGAACGGCA TTGTCTCCTT CGCCGTGGCG AAGATGACCG CGCCGCCGCC GGCCGAGATC CAGAAGCTGG TGGAGAGCGT GCGTGTGCCG CGCGCCGACT GA
|
Protein sequence | MEGSQQILNL TVVGLTFALY IGIAIWARTG STSEFYVAGK GVNPVANGMA TAADWMSAAS FISMAGLIAF LGFTGGAFLM GWTGGYVLMA LLLAPYLRKF GKFTVPEFIG DRFYSKTARV IAVICLLAIS ITYVIGQMRG VGIAFSNILE VPLTVGLISG MVVVFIYAVF GGMKGITYTQ IAQYCVMIFA YTVPAVFIAI AITGVPIPQI GLGSTLAGSD TYLLEALDQT LVDLGFAAYS ATEGGFNMLN MFLLTISLMI GTAGLPHVII RFFTVPRIRD ARKSAGWSLV FIALLYTTAP AVGAMAMWNL LDTVLVDRHS IGEAEAHTRY EDLPDWMYRW EQTGLLQWED KNNDGRIQYY NDGNAEFDQM AREQWGWEGS EITNLDRDII VLANPEIAGL PTWVIALVAA GGIAAALSTA AGLLLAISSA VSHDLLKGVF KPDISEKNEM LAARISMAVA IIFAGYLGLN PPGFAAEVVA LAFGLAAASL FPTLMMGIFY RKMNREGAIA GMLAGLIVTL GYVFTYKGFL FFPQLALLPD TAEYWLFGIN PAAFGVIGAV ANGIVSFAVA KMTAPPPAEI QKLVESVRVP RAD
|
| |