Gene Mlg_0091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0091 
Symbol 
ID4268829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp100152 
End bp101567 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content70% 
IMG OID638124817 
Productmajor facilitator transporter 
Protein accessionYP_740938 
Protein GI114319255 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCAGG TTTTTCTCTC GATCCTGGCG CTGCTGGGCT CAGTAGGGCT GTTCACGCTG 
GGCAGCGGGC TGCTGGGCAC CCTGCTCGGC GTGCGTATGA CCCTGGACGG CTTCGACCCG
CAGGTCACCG GCCTGGTCAT GGCCGCCTTC TTCGTGGGGT TAATGGTGGG GGCCATGGAG
GCCGGCCGGG TGATCCGCCG GGCCGGGCAC ATCCGAGCCT TTGCCGTCTT CGCCGCCTGC
GCCACCGCCG CCGTGCTGCT GCACGGGCTG TTCGTCTCGG TATGGGTTTG GGCCCTGCTG
CGGGTGATCA CCGGCTTCGC CGCCGCTGGC ATCTACATGG TCATCGAGAG CTGGCTCAAC
GAGCGCTCCT CGGCGGCCAA CCGGGGCCGG GTGTTCTCGG TCTACCAGGT GGTCAGCTAC
CTGGGCCTCG GCACCGGGCA ATTCCTGCTC TTCGCCGCCG ACCCCGCCAC CACCGAGCTG
TTCATGATCA CCGCCGGCCT GTTCGCCCTC TGCCTGATCC CGGTGGCCAT GACCCGGGGC
CTCCACCCCT CGCCGCCGGA AAGCCACGGC ATGGCGCTGA GGCCGGCGCT CACCGAGTCA
CCGTTGGGGG TGGTGGCCTG CATCGGTGCG GGCATGGTCA ACGGGGCGGT GTTTGCCCTG
ACCCCGGTCT TCGCCCTGGA GGCCGGACTG GGCCTGGCCG GGGTCTCGCT GCTGATGGGG
GCGATCATCT TTGGCGGTTT TCTGCTGCAA TGGCCCATCG GCCACCTCTC CGACAACTTC
GGCCGGCGCG GGGTGATGGC CATCGTCAAC CTCTCGGTGG CGGTGGCGGC GGTGGCCCTG
GTCTTCTCCG CCGAGCTCAC CCTGCCGGTG CTGATGGGGG TGGGGGCGCT GTTCGGGGGG
CTCTCCTTCA CCCTCTACCC GTTGGCGGTG GCCCATACCA ACGACCAGAT CAAGGTACGT
GACTTTGTCA CCATCAGCGC GGCACTGCTG TTCCTGTGGG GGTTGGGATC GGCGGTGGGG
CCGGTGCTGG CCGGTCAGGT GATGGGCCGG GTCGGCAATA CCGGGCTGTT CCTGTTCGTT
GCGGTCATCG CCCTGGGCGT GGCCCTGGCC GCCTGGCAAA TGCGTCGCGA GTCAGTGGCC
CCGGAGGACC AGGAGCCCTT CGTGGTGATG GCCCGGACCA CGCCGGTGGC CTCCGAGCTG
GACCCGCGCT ACGACGAAGA GGCCGCCCGG GAAGCGGCCG AGCAGCAGGC GCGGGCCGAT
GCCGAGACCG CCGCCGCCGA ACTCTGGGAC GAGGTGGTGG TTGCCGAGGC GGAGGCGGAA
GCGGAGGCCG ACGCCGCCAC CAAGCCGACC GCCACCGGGG ACGCCGGCCC CCAGGACGGC
GACGAGGACA CGACCCCGCC GCCACGCCGC GACTGA
 
Protein sequence
MAQVFLSILA LLGSVGLFTL GSGLLGTLLG VRMTLDGFDP QVTGLVMAAF FVGLMVGAME 
AGRVIRRAGH IRAFAVFAAC ATAAVLLHGL FVSVWVWALL RVITGFAAAG IYMVIESWLN
ERSSAANRGR VFSVYQVVSY LGLGTGQFLL FAADPATTEL FMITAGLFAL CLIPVAMTRG
LHPSPPESHG MALRPALTES PLGVVACIGA GMVNGAVFAL TPVFALEAGL GLAGVSLLMG
AIIFGGFLLQ WPIGHLSDNF GRRGVMAIVN LSVAVAAVAL VFSAELTLPV LMGVGALFGG
LSFTLYPLAV AHTNDQIKVR DFVTISAALL FLWGLGSAVG PVLAGQVMGR VGNTGLFLFV
AVIALGVALA AWQMRRESVA PEDQEPFVVM ARTTPVASEL DPRYDEEAAR EAAEQQARAD
AETAAAELWD EVVVAEAEAE AEADAATKPT ATGDAGPQDG DEDTTPPPRR D