Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2709 |
Symbol | |
ID | 4269500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 3074422 |
End bp | 3075537 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638127470 |
Product | bile acid:sodium symporter |
Protein accession | YP_743539 |
Protein GI | 114321856 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGATT CGCAGCCGAC CGCCGCCAAT GACACCCCCT CCACCAAGGA GGGTATGGGC TTGTTCGAGC GCTACCTCAC CCTTTGGGTG GCGCTGGGCA TGATCGCCGG CGTGCTGCTG GGGCAGTTTC TGCCGGTGGT GCCGGATACC CTTGCACGCT TCGAGTACGC CCAGGTCTCC ATCCCCGTCG CCATCCTGAT CTGGGCCATG ATCTACCCGA TGATGGTGCA GATCGACTTC GGTGCCATCC TCGGTGTGCG CCGGCAGCCC AAGGGGCTGG TGATCACCAC CACTGTGAAC TGGCTGATCA AGCCTTTCAC CATGTTCGCC ATCGCCTGGT TCTTCCTGAT GGTGGTCTTC CAGCCCTTCA TCCCGGAGGA CCTGGCCCGG GAGTACCTGG CCGGGGCCAT CCTGCTGGGC GCGGCCCCCT GCACCGCCAT GGTCTTCGTC TGGAGCTACC TCACCCGGGG CGATGCCGCC TACACCCTGG TGCAGGTCTC GGTGAACGAC CTGATCATGC TGTTCGCCTT TGCCCCCATC GTGGTGCTGC TGCTGGGCAT CTCTGACATC CAGGTGCCGT GGGATACCGT GGCGTTGTCG GTGTTCCTGT ACATCGTCAT CCCGCTGGCT GCCGGCTACC TGACCCGCGT GATGCTCATC AAACACAAGG GCGTGGAGTG GTTCGACAAC GTCTTCATGA AGCGCCTGGC GCCGGTGACG CCCATCGGGC TGATCCTCAC CCTGATCCTG CTGTTCGCCT TCCAAGGCGA GGTGATCCTG AACAACCCGC TGCACATTCT GTTGATCGCG ATCCCGCTGA TCATCCAGAC CTTCCTGATC TTCTTCATTG CCTATCGTTG GGCGAAGGCG TGGAAGGTGC AGCACTCGGT GGCGGCACCC GGGGCCATGA TTGGCGCCAG CAACTTCTTC GAGCTGGCCG TGGCCGCGGC CATCGCCCTG TTCGGCCTGC AATCGGGGGC GGCGCTGGCC ACCGTGGTGG GCGTGCTGGT GGAGGTACCG CTGATGCTGG CGCTGGTCCG CATCGCCAAT AAAACCAAGC ACTGGTTCCC GGAAGAGACG CAGCCGGGGC TGGCCCCCGC CTCGGGCAAG GCATGA
|
Protein sequence | MSDSQPTAAN DTPSTKEGMG LFERYLTLWV ALGMIAGVLL GQFLPVVPDT LARFEYAQVS IPVAILIWAM IYPMMVQIDF GAILGVRRQP KGLVITTTVN WLIKPFTMFA IAWFFLMVVF QPFIPEDLAR EYLAGAILLG AAPCTAMVFV WSYLTRGDAA YTLVQVSVND LIMLFAFAPI VVLLLGISDI QVPWDTVALS VFLYIVIPLA AGYLTRVMLI KHKGVEWFDN VFMKRLAPVT PIGLILTLIL LFAFQGEVIL NNPLHILLIA IPLIIQTFLI FFIAYRWAKA WKVQHSVAAP GAMIGASNFF ELAVAAAIAL FGLQSGAALA TVVGVLVEVP LMLALVRIAN KTKHWFPEET QPGLAPASGK A
|
| |