Gene Mlg_0627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0627 
Symbol 
ID4270609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp676109 
End bp677110 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content64% 
IMG OID638125374 
Productbile acid:sodium symporter 
Protein accessionYP_741471 
Protein GI114319788 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0125709 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCT TTCAGATCCT GTCCAAGCTT ACCGGTAAGC TGACTATCGC CATCCCCGCG 
ATGATGGCGA TGGGGTTCGT TTTCGGGGCC GTGGCGCCGA GCGACTGGCT GCAGGCGCTT
ATTCTGCCTC TCACCTTCCT GATGGTTTAC CCGATGATGG TCAATCTGAA GGTGCGTTCG
GTCCTGGAAG GGGTGGACGG TCGCGCCCAG GGCCTGGCGC TGCTGGTGAA TTTTGGCGTC
ATTCCCTTCA TCGCCTTTGC CATCGGGCTG CTGTTCCTGG CCGATCACCC CTATTTCGCC
CTGGGGCTGC TGCTGGCGGC GCTGCTGCCG ACCAGCGGCA TGACCATTGC CTGGACCGGC
TTTGCCAAAG GCAACGTGCC CGCCGCCATC AAGATGACGG TGATTGGCCT GCTTGTGGGC
TCGGTGGCGA CGCCCTTTTA CGTGCAATGG CTGATGGGCG CCGAGGTGCC GGTGGACCTG
CTGACGGTCT TCCGCCAGAT CGCCATTATC GTGCTGCTGC CACTGGTGGC CGGGCAGATC
ACCCAGCACT ACCTGCGCCG TCGCTACGGC CAGGAGGGCT ACCAGAGGCG GTGGGCCCCG
CGTTTCCCGC CGCTCTCCTC CCTGGGGGTG CTGGGTATCG TCTTCGTGGC CATGGCCCTG
AAGGCCGGGG ACCTGATCGC CTCTCCCGGC GACCTGCTGA TTATCGCCGT GCCGGTGGTC
CTGCTTTACC TGATCAACTA CACCCTCAGC ACCGGCATTG CCCGCGCCGT GTTGGGGCGG
GGCGAGGGGA TTGCGCTGGT CTACGGCACG GTGATGCGCA ATCTCTCCAT TGCCCTGGCA
TTGGCCATGA ACGCCTTTGG CGAGGCAGGT GCCGATGCGG CGCTGGTGGT TGCGCTGGCC
TTCATCATCC AGGTGCAGTC GGCGGCCTGG TACGTCAAAC TGACGGACCG GGTGTTTGGT
AGCGCACCGG AAACGGACGG GGTCGGCGAA CAGGCGCGAT GA
 
Protein sequence
MNAFQILSKL TGKLTIAIPA MMAMGFVFGA VAPSDWLQAL ILPLTFLMVY PMMVNLKVRS 
VLEGVDGRAQ GLALLVNFGV IPFIAFAIGL LFLADHPYFA LGLLLAALLP TSGMTIAWTG
FAKGNVPAAI KMTVIGLLVG SVATPFYVQW LMGAEVPVDL LTVFRQIAII VLLPLVAGQI
TQHYLRRRYG QEGYQRRWAP RFPPLSSLGV LGIVFVAMAL KAGDLIASPG DLLIIAVPVV
LLYLINYTLS TGIARAVLGR GEGIALVYGT VMRNLSIALA LAMNAFGEAG ADAALVVALA
FIIQVQSAAW YVKLTDRVFG SAPETDGVGE QAR