Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1623 |
Symbol | |
ID | 4269355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1854338 |
End bp | 1855738 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638126380 |
Product | Na+/solute symporter |
Protein accession | YP_742459 |
Protein GI | 114320776 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATG GCTTCACCCA CCTCCCCATG ACCGGCCTGG TGATCGCCCT GGCGGCGGTG GCGCTGGTTA CCTTGCTGAT CGCCCCCCGG GTCAGTACCG CCAACGGGTT CTTCCGCGGT TTCGCCGACG CGGGCCAGCC GCCCGGCGTG CTCACGCTCA CCCTCTCCCA GGTCACCACC TGGATCTTTG CCCGCTCATT GCTCAATGCG GCCATCCTGG GTTATTTCTT TGGCATCGCC GGTGTGCTCG CCTACACCGC CTACTACGGC TCCTTTGTCA CCGGCTGGCT GATCATCGAT CGCCTTCGCT TCCGCTACGG CGCCTCCAAT CTGCAGGGTT TCATGAGGGC CCAGTATGGC GGGGTGGGGA CCTTCTGCCT GAACGGACTC CTGGCGCTGC GCCTGGTCAG CGAGGTGTTC GCCAACCTGC TGGTCGTCGG GATCGTCTTC GGCGCCGCCG GCAGCGCCGG CAACACCACC GCCATGCTGT CGGTGGCCGC CCTGACCCTG GTCTATTCCC TCACCGGCGG GCTCCGGGCG TCGCTGCGAA CCGATGTCCT GCAGATGGTG CTTTTGATCG GTCTGCTCGC CGCCCTCGTG GTGGTCATGC TCCTGCATCC GCTCTTTGAC CCCGGGGCGC TGGCGGTCAG CAGCCCCGGG ATCGACAACC CGGGCTGGAT CCTGCTGGCG GTCGCGGGTC TCCAGGTGCT CAGCTACCCC ATGCATGACC CCGTCATGAT GGACCGCGGG TTCCTCGCCG ACCGCCGCAC CACACGGCGC AGTTTTCTGC ACGCCTTCTG GCTCTCGGCG GTGTGTATCC TGGCCTTCGG GCTGCTGGGG GTTTTCGCGG GCCTGCATCG CCTGGAGGGC GAAACCGTGC TGCTGACGCT GGACCGGCTG ATGGGTACCC CCGTGATGTT TCTCCTGGCG GTGGCGCTGG TCCTGTCGGC CGCCTCGACC CTGGACTCGA CCTTCTCCAG CGCGGCCAAA CTCAGCATCG CCGACATGGG ACTGGCCCGG ACCACACCCC GCAACGGCCG GATCGCAATG CTCCTGTTCT GCCTGGGCGG GCTGCTACTG GTGCTGTTCG GCACCGACGA CCTGTTCGCC GCCGTGGCCG TCAGTGGAAC CGCGTCGCTG GCGCTCACAC CGGTCATCGC CTTCTCCCTA CTGGCCGGCT GGCGGCTTAG CCGGGCCAGT CTGCTGACCT CCTTTACGCT GGCCTTCACG GGCGCTGTCG TCTACTTCCT GGAGACCTCC GGGCATATCG ACCTGCTGAC CCCCCTGACC GGGCTGGAGC ACGACTACAG TAAGCTGCTG GCGATCACGG TCACCATCCT GGTTCTTGCC ACCGCGGCGG CCTGGCTTGG CCGCCAGGGC CGCATGGAGG TCGTGAAATG A
|
Protein sequence | MSDGFTHLPM TGLVIALAAV ALVTLLIAPR VSTANGFFRG FADAGQPPGV LTLTLSQVTT WIFARSLLNA AILGYFFGIA GVLAYTAYYG SFVTGWLIID RLRFRYGASN LQGFMRAQYG GVGTFCLNGL LALRLVSEVF ANLLVVGIVF GAAGSAGNTT AMLSVAALTL VYSLTGGLRA SLRTDVLQMV LLIGLLAALV VVMLLHPLFD PGALAVSSPG IDNPGWILLA VAGLQVLSYP MHDPVMMDRG FLADRRTTRR SFLHAFWLSA VCILAFGLLG VFAGLHRLEG ETVLLTLDRL MGTPVMFLLA VALVLSAAST LDSTFSSAAK LSIADMGLAR TTPRNGRIAM LLFCLGGLLL VLFGTDDLFA AVAVSGTASL ALTPVIAFSL LAGWRLSRAS LLTSFTLAFT GAVVYFLETS GHIDLLTPLT GLEHDYSKLL AITVTILVLA TAAAWLGRQG RMEVVK
|
| |