Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0671 |
Symbol | |
ID | 4268466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 738074 |
End bp | 740179 |
Gene Length | 2106 bp |
Protein Length | 701 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638125420 |
Product | Na+/solute symporter |
Protein accession | YP_741515 |
Protein GI | 114319832 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGCCA AGGCGGTATG GCTGTTCATC TTTGTGGCCA TCTACTGGGG CTACTGCGTG TTCTGGGGGG TGCGGGGCGC CATGGAGGCG CGGACCGCCA CCGACTACTT CGTGGCCGGC CGCAGTGTCT CCATGTGGGT GTTTATCCTG GCCGCCACCG CCACCTCCTT CTCCGGCTGG ACCTTTATTG GCCACCCGGG GCTGATCTAC CAGGATGGCC TGCAGTACGC CTACGCCTCC TTCTACGCCA TCACCATTCC GTTCACCGGG CTGCTGTTCC TGAAGCGCCA GTGGATGATC GGCAAGCGCT GGGGCTTCGT CACCCCGGGG GAGATGTTTG CCACCTACTA CCGCTCCGAT ACCCTGCGCA TCCTCATCGT GCTGGTGGCG CTGATCTTCG CCGTGCCCTA TCTCGGGGTG CAGTTGCGGG CCTCGGGCTT TCTGTTCAAC GTGCTCACCG ACGGCATGCT GGGGGTGGAG GTGGGCATGT GGTTGCTGTC CGCCATCATC GTCTTCTATG TGGCCTCCGG GGGGCTGCGG GCAGTGGCCT ACGTGGATGC GCTGCAGGCG ATATTGCTGG CGGCCGGCAT CGTCATCGTG GGTGGGGTCG CGGTCTACTT CATGGGCAGC TGGGGGGTGT TCGTGGAGGG CATCGCGCGG CTGGCGGAGC AGTCCGTGGC CAGCGGCGAG TACGTTACCC CCGCCGGACT GAGCGGCTAT GTCGCCGTGC CGGGTGTGAT CCAGTTCGTC AGTGACGGGC CCTCGGCCAC CGGCGGCGCC TGGACCGGGC TGATGATCCT GACCTACATG TTCGCCCTGA TGGGCATTCA GTCCAGTCCG GCCTTCAGCA TGTGGGCCTT CTCCAACGCC AACCCGCGCC CCTTTGCCCC GCAGCAGGTC TGGGCCTCGG CGGTGGGTAT CGGCCTCATC CTGTTCACCT TCACCGCCAT CCAGGGCATG GGTGGCCATC TGCTGGGTGC CAACCTGGGC TTCACCGCCG ACAACCCTGA CCTGTTCGAC AGCCGTGATC GGGTGGTGTT GCAGCGCGGA CTGATCGCCA CCGCTGAGCC GGCGCTCTCC CGCGACGAGG TGGAGGCCCG GATCGAGTCC GGATTGGCGG CGCTCGCCGC CGCGGGGCCG GACGAGACGG TGGACCTGGC GGGCGACCAC GGGTGGGTGG ACCTGGAGGC CCACGGCGGC GGCGACCCGG CGCTGGTGCC CCAGATGATC AGCCTGCTGG AGGTGGGGGC CCCCTGGCTG GTAGGCCTGC TGGCGGTCTG TGCCCTGGCC GCCATGCAGT CCACCGGCGC TGCTTATATG TCTACCTCGG GGGGCATGAT CACCCGCGAT ATTCTGCGCC GGTACCTGAT ACCGGAGGCG GATCATGCCA CCCAGAAACT CTGGGGGCGG GTGTTCGTGC TGCTGATTGT CGCCGCCGCC TTGGTCACCG CCACCGTCGC CACCGATGCC CTGGTGTTGC TGGGCGGCTT GGCTGTGGCC TACGGCTTCC AGATGTGGCC GGCGCTGATC GGCTTGTGCT TTTGGCCCTG GCTTACTCGC CAGGGGGTGG TCGCCGGCCT GATCGTTGGG CTGGTGGTGG TCACGCTGAC AGAGAACATC GGTGTGCAGT TGCTGGCCGC CCTCGGCCTG GAGTGGTGGG GCCGGTGGCC CTGGACGCTG CACTCGGCGG GATGGGGCAT TCTGTTCAAT ATCACCACCG CGATCCTGGT CTCCGCCATC ACCCAGGATC GCGGTGAATT GGAACACCGG ATGGTCTATC ACCACTTCCT GCAGGAGCAT GCTGGTCTGC CGGTGGCCAA GCGCCATCTG ATCCCGGTGG CCTGGATCCT CACCATTGGC TGGTTCTTCT TCGCTGTGGG ACCGGGCACG GTCATCGGCA ACACCCTGTT CGGCTACCCC CAGTCCTTCC TCTTCGGGCT CCCTTCCATC TGGGTTTGGC AGCTCCTTTT CTGGGCCGCC GGGTGTCTGC TGCTCTACTT CCTGGCCTAC CGCATGGAAC TGAGCACCGT GCCACGCAAA GAGGTGGAGG TGCTGGCCGA GGATGAGCCC GGGGACATTC GTCTCGACAT CCACCGCCCG TCGTAG
|
Protein sequence | MEAKAVWLFI FVAIYWGYCV FWGVRGAMEA RTATDYFVAG RSVSMWVFIL AATATSFSGW TFIGHPGLIY QDGLQYAYAS FYAITIPFTG LLFLKRQWMI GKRWGFVTPG EMFATYYRSD TLRILIVLVA LIFAVPYLGV QLRASGFLFN VLTDGMLGVE VGMWLLSAII VFYVASGGLR AVAYVDALQA ILLAAGIVIV GGVAVYFMGS WGVFVEGIAR LAEQSVASGE YVTPAGLSGY VAVPGVIQFV SDGPSATGGA WTGLMILTYM FALMGIQSSP AFSMWAFSNA NPRPFAPQQV WASAVGIGLI LFTFTAIQGM GGHLLGANLG FTADNPDLFD SRDRVVLQRG LIATAEPALS RDEVEARIES GLAALAAAGP DETVDLAGDH GWVDLEAHGG GDPALVPQMI SLLEVGAPWL VGLLAVCALA AMQSTGAAYM STSGGMITRD ILRRYLIPEA DHATQKLWGR VFVLLIVAAA LVTATVATDA LVLLGGLAVA YGFQMWPALI GLCFWPWLTR QGVVAGLIVG LVVVTLTENI GVQLLAALGL EWWGRWPWTL HSAGWGILFN ITTAILVSAI TQDRGELEHR MVYHHFLQEH AGLPVAKRHL IPVAWILTIG WFFFAVGPGT VIGNTLFGYP QSFLFGLPSI WVWQLLFWAA GCLLLYFLAY RMELSTVPRK EVEVLAEDEP GDIRLDIHRP S
|
| |