Gene Mlg_2709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2709 
Symbol 
ID4269500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3074422 
End bp3075537 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content64% 
IMG OID638127470 
Productbile acid:sodium symporter 
Protein accessionYP_743539 
Protein GI114321856 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATT CGCAGCCGAC CGCCGCCAAT GACACCCCCT CCACCAAGGA GGGTATGGGC 
TTGTTCGAGC GCTACCTCAC CCTTTGGGTG GCGCTGGGCA TGATCGCCGG CGTGCTGCTG
GGGCAGTTTC TGCCGGTGGT GCCGGATACC CTTGCACGCT TCGAGTACGC CCAGGTCTCC
ATCCCCGTCG CCATCCTGAT CTGGGCCATG ATCTACCCGA TGATGGTGCA GATCGACTTC
GGTGCCATCC TCGGTGTGCG CCGGCAGCCC AAGGGGCTGG TGATCACCAC CACTGTGAAC
TGGCTGATCA AGCCTTTCAC CATGTTCGCC ATCGCCTGGT TCTTCCTGAT GGTGGTCTTC
CAGCCCTTCA TCCCGGAGGA CCTGGCCCGG GAGTACCTGG CCGGGGCCAT CCTGCTGGGC
GCGGCCCCCT GCACCGCCAT GGTCTTCGTC TGGAGCTACC TCACCCGGGG CGATGCCGCC
TACACCCTGG TGCAGGTCTC GGTGAACGAC CTGATCATGC TGTTCGCCTT TGCCCCCATC
GTGGTGCTGC TGCTGGGCAT CTCTGACATC CAGGTGCCGT GGGATACCGT GGCGTTGTCG
GTGTTCCTGT ACATCGTCAT CCCGCTGGCT GCCGGCTACC TGACCCGCGT GATGCTCATC
AAACACAAGG GCGTGGAGTG GTTCGACAAC GTCTTCATGA AGCGCCTGGC GCCGGTGACG
CCCATCGGGC TGATCCTCAC CCTGATCCTG CTGTTCGCCT TCCAAGGCGA GGTGATCCTG
AACAACCCGC TGCACATTCT GTTGATCGCG ATCCCGCTGA TCATCCAGAC CTTCCTGATC
TTCTTCATTG CCTATCGTTG GGCGAAGGCG TGGAAGGTGC AGCACTCGGT GGCGGCACCC
GGGGCCATGA TTGGCGCCAG CAACTTCTTC GAGCTGGCCG TGGCCGCGGC CATCGCCCTG
TTCGGCCTGC AATCGGGGGC GGCGCTGGCC ACCGTGGTGG GCGTGCTGGT GGAGGTACCG
CTGATGCTGG CGCTGGTCCG CATCGCCAAT AAAACCAAGC ACTGGTTCCC GGAAGAGACG
CAGCCGGGGC TGGCCCCCGC CTCGGGCAAG GCATGA
 
Protein sequence
MSDSQPTAAN DTPSTKEGMG LFERYLTLWV ALGMIAGVLL GQFLPVVPDT LARFEYAQVS 
IPVAILIWAM IYPMMVQIDF GAILGVRRQP KGLVITTTVN WLIKPFTMFA IAWFFLMVVF
QPFIPEDLAR EYLAGAILLG AAPCTAMVFV WSYLTRGDAA YTLVQVSVND LIMLFAFAPI
VVLLLGISDI QVPWDTVALS VFLYIVIPLA AGYLTRVMLI KHKGVEWFDN VFMKRLAPVT
PIGLILTLIL LFAFQGEVIL NNPLHILLIA IPLIIQTFLI FFIAYRWAKA WKVQHSVAAP
GAMIGASNFF ELAVAAAIAL FGLQSGAALA TVVGVLVEVP LMLALVRIAN KTKHWFPEET
QPGLAPASGK A