Gene Mlg_1668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1668 
Symbol 
ID4268900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1908624 
End bp1910378 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content68% 
IMG OID638126426 
Productsulfate transporter 
Protein accessionYP_742504 
Protein GI114320821 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00377] anti-anti-sigma factor
[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.933775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAAC GGCTCCTACC CTTCCTGGCC TGGCCCCGGC CCACCGCCGA CACGCTCAAG 
GCCGACCTGT TGGCCGGCGT CACGGTGGCG CTGGTCCTGG TACCACAGTC CATGGCCTAC
GCCACCCTGG CCGGCATGCC GCCCTACTAC GGGCTGTATG CCGCCTTCCT GCCGGTGATC
GTGGCCGCCC TCTGGGGCTC GTCACCGCAA CTGGCCACCG GGCCGGTGGC CGTGGTGGCG
CTGCTGACGG CCGCGGCGCT GGCCCCGTTG GCGGAAGCGG GCAGCGGCGA GTTCATCACC
CTGGCCATCG CCCTGGCCTT CATGGTCGGG GTGATCCAAC TCCTGCTCGG CGCCTTCCGA
CTCGGCACCC TGGTGAACTT CATCTCCCAC CCGGTCATCA TCGGGTTCAC CAACGCCGCC
GCCATCGTGA TTGTGCTCTC CCAACTGGGT AGCCTGCTGG GGCTGTCGAT GGACCGCAGC
GGCAGTTTCC TGCTGGGGGT GCTCGACCTG TTGCAACGGG TGCCCCAGGC CCACGGGCCA
ACCGTACTGA TGGGACTGGC GGCCATCGCC ATGATGGTGG GGTGCAAGCG CTGGCTGCCG
CGGATCCCCG GGGTGTTGCT GGCCGTCGCC GTACTCACGC CGGTCAGCCT ATGGCTGGAT
TTCGAGGGCA TGGGCGGCGC GGTGGTGGGC GGCATCCCGG AGGGCTTACC CACCCTGGGG
ATCCCGGAAC TCGGCGTCAC CACGGTCACC ACGCTGATGA CCACGGCACT GGTCATCGCC
CTGGTCGCCT TCATGGAGGC GATCTCCATC GCCAAGGCCA TCGCCACCCG GACCCGCGAC
CGTATCGATC CCAACCAGGA GTTGATCGGG CAGGGGCTGG GCAATCTGGT GGGCAGCTTC
TCCAGCGCCT TTCCGGTCAG CGGTTCCTTC TCCCGCTCAG CCGTCAACTA CAACGCCGGT
GCCCGGACCG GGCTGTCCTC GGTCATCACC GGGTTGCTGG TGGCGCTCAC CCTGCTCTTC
CTCACCCCCC TGCTCTACCA CCTGCCGCTG GCGGTGCTCG CGGCCATCAT CATGATGGCG
GTACTGGGCC TGGTGAACGT GAAGGCGGTG CGCCACGCCT GGCAGGCCAA ACGCGACGAT
GGGATCGCGG CCGTGGTCAC CTTCAGCGCC ACCCTGATCT TTGCCCCCCA CCTGGATTAC
GGCATCCTGC TCGGCGCGGG GCTGGCCATC GTGCTCTATC TGTTACGGAC CATGAAGCCA
CGGGTGGTGC TGCTCGCACG CCACCCCGAC GGCACCCTGC GCGATGCCGA GTACTTCGAC
CTGCCCCGCA GCCCCTACAT CGCCGCCGTA CGCTTCGACG GGGATCTCTA TTTCGCCAAC
GTGGGCTACT TCGAGGACGC CATCCTCGAT GCCCGGGCCC GACACCCGGA AGCGCGCTTC
GTCCTCGTGG TGGCCAACGG CATCAACCAG ATCGACGCCT CGGGCGAGGA GACCCTGCAC
AAACTGGCGG AGAACCTCCA CGCCAGCGGC AGCACCCTGG TCCTGGCCGG TCTGAAGCTG
CCTCTCCAGG AACTGCTGGA ACGGACGGGG CTGAAGGAGG TAATTGGCGA CGAGAATATC
TACCGCAACG AACGCCACGC CCTGGCGGCG ATTTATCAAC GGATGGATGT ACCCGGGTTT
GATCCCGCGC GCTGCCCCCT GAACCCGGAG CCCGCGGGTG AAGCCGGCAT CGCGGACAGC
GCCGCCGAAC GCTGA
 
Protein sequence
MLKRLLPFLA WPRPTADTLK ADLLAGVTVA LVLVPQSMAY ATLAGMPPYY GLYAAFLPVI 
VAALWGSSPQ LATGPVAVVA LLTAAALAPL AEAGSGEFIT LAIALAFMVG VIQLLLGAFR
LGTLVNFISH PVIIGFTNAA AIVIVLSQLG SLLGLSMDRS GSFLLGVLDL LQRVPQAHGP
TVLMGLAAIA MMVGCKRWLP RIPGVLLAVA VLTPVSLWLD FEGMGGAVVG GIPEGLPTLG
IPELGVTTVT TLMTTALVIA LVAFMEAISI AKAIATRTRD RIDPNQELIG QGLGNLVGSF
SSAFPVSGSF SRSAVNYNAG ARTGLSSVIT GLLVALTLLF LTPLLYHLPL AVLAAIIMMA
VLGLVNVKAV RHAWQAKRDD GIAAVVTFSA TLIFAPHLDY GILLGAGLAI VLYLLRTMKP
RVVLLARHPD GTLRDAEYFD LPRSPYIAAV RFDGDLYFAN VGYFEDAILD ARARHPEARF
VLVVANGINQ IDASGEETLH KLAENLHASG STLVLAGLKL PLQELLERTG LKEVIGDENI
YRNERHALAA IYQRMDVPGF DPARCPLNPE PAGEAGIADS AAER