Gene Mlg_2183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2183 
Symbol 
ID4270962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2482184 
End bp2483407 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content74% 
IMG OID638126939 
Productmajor facilitator transporter 
Protein accessionYP_743015 
Protein GI114321332 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.362785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.176029 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCCCT TGCCTCGATT CCTCCAGCCA CGGCCTGAAC TGGCCGCCTT TGCGCTATTG 
GCCACGGCCA CCTCCGGCTT TGGCCAAACC TTCCTGATGT CGGTTTTCGG CGGCGAGATT
CGCGCGGCCT TCGACCTCAG CCACAGCGCC TACGGCACGC TCTACGGCGC CGCTACCCTG
GTCAGCGCCC TGCTGCTGCT ACGGGCCGGG GCCTGGGTGG ACCACTGGCC GTTGCGCCGG
GCGGTGGCGG TGACCCTGCT GTTGCTGGCA CTGGGCTGCC TGACCGTGGG ACTGGCCCCC
ACGGCGGCGT TACTGCTCCC AGGTTTCCTT CTCATCCGCT TCGCCGGCCA GGGGCTCAGC
GCCCACCTGG GCCTGACCGC CGCCGGGCGC TATTTCTCCA CCCACCGCGG CAAGGTCATG
GCCCTGGCGG CCAGCGGCTT TCCCCTCGCC GAGGCCCTGT TGCCGGCAGC GGCAGTGGCC
ATTATGGGGC TCGGGGGCTG GCGCATGCCC TGGCTCCTGG GCGCGGGTTT CCTGCTGCTC
TGCATGCTGC CGCTGCTGCT GCGGCTCACC TGGCACGCCC CCAGTGCGGC GGAGGCCGCT
CGGGAGGCGG GCGGCACCGA CAACGGCCAT CGCCGGCGAG ACGCTCTGCG GGATCCGGGC
TTCTACCTGC TGCTGCCGGC GGTGCTCGCC GCCCCGTTCA TCGTCACCGC CATGCTCTTC
CACCAGGCTG CCATCGCCGA GGCCCGGGAA TGGCCGCTGC CCCTCATGGG CGCCGCCTTC
ACCGGCTTCG CCGCCGGCCA CCTGGCCAGC CTGCTGCTGG CCGGCCCACT GGTGGACCGC
ATCGGCGCCC ACCGGGCCCT GCCCCTGGCC CTGGGGCCCA TCGGCCTCGG CCTGTTGATC
CTGGCCTTCG GCGGCGGTGG CGGGTGGGTG CCCTTCGCCT ACCTGACACT CACCGGCGCA
ACGCTGGGCT GGGGCGCCAC CGCCGGCGGC GCCATCTGGG CCGAGCGCTA TGGCGTGCGC
CACCTGGGCG CCATCCGGGC CATGGCCCAC GGGGTAATGG TGGCCAGCAC CGCGATCGCC
CCGGTGGTGG CCGGGGTGCT GCTGGACCGG GGCTGGTCGG TGACCGCCCT GGCGGGTGCG
ATGGTGGGTT ACGTGCTGGT GGCCGGCCTC TGCGCCCGGG CGGCCCCGGC ACCGCCAGCG
ATGCGAGCCC CCGCCGGCGG CTGA
 
Protein sequence
MLPLPRFLQP RPELAAFALL ATATSGFGQT FLMSVFGGEI RAAFDLSHSA YGTLYGAATL 
VSALLLLRAG AWVDHWPLRR AVAVTLLLLA LGCLTVGLAP TAALLLPGFL LIRFAGQGLS
AHLGLTAAGR YFSTHRGKVM ALAASGFPLA EALLPAAAVA IMGLGGWRMP WLLGAGFLLL
CMLPLLLRLT WHAPSAAEAA REAGGTDNGH RRRDALRDPG FYLLLPAVLA APFIVTAMLF
HQAAIAEARE WPLPLMGAAF TGFAAGHLAS LLLAGPLVDR IGAHRALPLA LGPIGLGLLI
LAFGGGGGWV PFAYLTLTGA TLGWGATAGG AIWAERYGVR HLGAIRAMAH GVMVASTAIA
PVVAGVLLDR GWSVTALAGA MVGYVLVAGL CARAAPAPPA MRAPAGG