Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0627 |
Symbol | |
ID | 4270609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 676109 |
End bp | 677110 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638125374 |
Product | bile acid:sodium symporter |
Protein accession | YP_741471 |
Protein GI | 114319788 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0125709 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCT TTCAGATCCT GTCCAAGCTT ACCGGTAAGC TGACTATCGC CATCCCCGCG ATGATGGCGA TGGGGTTCGT TTTCGGGGCC GTGGCGCCGA GCGACTGGCT GCAGGCGCTT ATTCTGCCTC TCACCTTCCT GATGGTTTAC CCGATGATGG TCAATCTGAA GGTGCGTTCG GTCCTGGAAG GGGTGGACGG TCGCGCCCAG GGCCTGGCGC TGCTGGTGAA TTTTGGCGTC ATTCCCTTCA TCGCCTTTGC CATCGGGCTG CTGTTCCTGG CCGATCACCC CTATTTCGCC CTGGGGCTGC TGCTGGCGGC GCTGCTGCCG ACCAGCGGCA TGACCATTGC CTGGACCGGC TTTGCCAAAG GCAACGTGCC CGCCGCCATC AAGATGACGG TGATTGGCCT GCTTGTGGGC TCGGTGGCGA CGCCCTTTTA CGTGCAATGG CTGATGGGCG CCGAGGTGCC GGTGGACCTG CTGACGGTCT TCCGCCAGAT CGCCATTATC GTGCTGCTGC CACTGGTGGC CGGGCAGATC ACCCAGCACT ACCTGCGCCG TCGCTACGGC CAGGAGGGCT ACCAGAGGCG GTGGGCCCCG CGTTTCCCGC CGCTCTCCTC CCTGGGGGTG CTGGGTATCG TCTTCGTGGC CATGGCCCTG AAGGCCGGGG ACCTGATCGC CTCTCCCGGC GACCTGCTGA TTATCGCCGT GCCGGTGGTC CTGCTTTACC TGATCAACTA CACCCTCAGC ACCGGCATTG CCCGCGCCGT GTTGGGGCGG GGCGAGGGGA TTGCGCTGGT CTACGGCACG GTGATGCGCA ATCTCTCCAT TGCCCTGGCA TTGGCCATGA ACGCCTTTGG CGAGGCAGGT GCCGATGCGG CGCTGGTGGT TGCGCTGGCC TTCATCATCC AGGTGCAGTC GGCGGCCTGG TACGTCAAAC TGACGGACCG GGTGTTTGGT AGCGCACCGG AAACGGACGG GGTCGGCGAA CAGGCGCGAT GA
|
Protein sequence | MNAFQILSKL TGKLTIAIPA MMAMGFVFGA VAPSDWLQAL ILPLTFLMVY PMMVNLKVRS VLEGVDGRAQ GLALLVNFGV IPFIAFAIGL LFLADHPYFA LGLLLAALLP TSGMTIAWTG FAKGNVPAAI KMTVIGLLVG SVATPFYVQW LMGAEVPVDL LTVFRQIAII VLLPLVAGQI TQHYLRRRYG QEGYQRRWAP RFPPLSSLGV LGIVFVAMAL KAGDLIASPG DLLIIAVPVV LLYLINYTLS TGIARAVLGR GEGIALVYGT VMRNLSIALA LAMNAFGEAG ADAALVVALA FIIQVQSAAW YVKLTDRVFG SAPETDGVGE QAR
|
| |