Gene Msil_1009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1009 
Symbol 
ID7091837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1095568 
End bp1096950 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content68% 
IMG OID643464348 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002361340 
Protein GI217977193 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.00165429 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGTCGGGC TGGTTTGCAT CGGCGCCTTC CTCGGCCAGC TGGACGCCAC CATCGTCCAG 
CTCGCCTTGC CGACTCTCGG CAAAACGTTC GCTGCGCCGC TGCAACAGGT CAGCTGGGTC
GCACTGGCCT ATCTCGTCGC CTTCGCGTCA TTTTTGCCGA TCTTCGGGCG GCTGTGCGAG
ATCTTCGGCC GGAAGTCGCT TTATCTCGCG GGATACGCTC TGTTCGTCAT CGCAAGCGCT
TTATGCGGCC TCTCCACCAG TCTCAGCGAA CTGATCGTCT TCCGCGTTCT GCAGGGGATC
GGCGGCTCGC TGCTGGGCGC CAACAGCATC TCCATTCTCG TGGCCGCGAT AGGCCCGAAA
CAGCGCGGCA AGGCGCTTGG AATCTTCGCA GCGGCGCAGG CGGTCGGCAT GAGCGCGGGG
CCTGCCATCG GCGGCCTGAT CCTCGCCGCG CTCGACTGGC GATGGCTGTT CTGGCTCGCC
GCGCCGTTCG GGGCCGCCGC CATCATCGTT GGATGGCTGG CGCTGCCGCA AACGGAGACG
ACCGAGCAGG ACATGCGGTT CGATTGGCGC GGCGCCGTCC TGATCGGGCC GGCGCTCATC
TTCCTCATCG TGGCGCTCAA CCATCTCTCG GCCCTGACCT CGCCGCTCAC AATGGCTTTC
CTTGCTGGGT CGTCGGCGCT GGGCTGGCTC CTCATCCGGC ACGAACGCGC ATCGCCCCGG
CCGCTGGTTG ATCTGCTTCT ATTCCGCAGC AGAGCCTTCT GCTGCGGGGC GATCGCCGTC
GCGCTGGCTT ACGCCCTGCT CTACAGCATG TTCTTCCTGA TGTCGTTTGC GCTGGAACAT
GGCTATGGCG ACAGTCCCGC AGCTGCGGGA CTTCGTCTGG CGATCATTCC GGTCGCGCTC
GGGGCGACGG CGCCGTTCAG CGGCGCGCTC AGCGACCGGC TTGGCGCGCC GCTGCTCAGC
GCCGCGGGAA TGGCCTGCTG TCTCGTCGCT CTGCTGATGC TGGTCATCAG CGGGCAAGAC
TCGGGCGGCG GGCATCTCGT CGCAGCGGCG GCCTTCGCGC TGTTCGGCGT GGGACTGGGC
GCATTTATCG CCCCCAACAA TCACAGCACC GTCGAGGCCG CGCCCGCGCG CCTCTCGGGC
GAAGCCGGCT CGATGCTAAA CCTGATGCGC GTTCTCGGCG CCAGCCTTGG CGTCGCCGCC
GCCACAGCCA GCCTCTCGTG GCGTCTCGAG GACGCAGGCG CCGGCCAGAG CTGGCTTGGC
GCCACAGGGC CCTCGCTGCT CCGCGCCGTC GAGGCCAGTC TGATGATACC GGCGGGCCTT
GCCGCGCTCG CGGCGGCCGC AGCGCTCTGC GCCGCCTCTC CCCGAGCGCC GAAAACGGCC
TAA
 
Protein sequence
MVGLVCIGAF LGQLDATIVQ LALPTLGKTF AAPLQQVSWV ALAYLVAFAS FLPIFGRLCE 
IFGRKSLYLA GYALFVIASA LCGLSTSLSE LIVFRVLQGI GGSLLGANSI SILVAAIGPK
QRGKALGIFA AAQAVGMSAG PAIGGLILAA LDWRWLFWLA APFGAAAIIV GWLALPQTET
TEQDMRFDWR GAVLIGPALI FLIVALNHLS ALTSPLTMAF LAGSSALGWL LIRHERASPR
PLVDLLLFRS RAFCCGAIAV ALAYALLYSM FFLMSFALEH GYGDSPAAAG LRLAIIPVAL
GATAPFSGAL SDRLGAPLLS AAGMACCLVA LLMLVISGQD SGGGHLVAAA AFALFGVGLG
AFIAPNNHST VEAAPARLSG EAGSMLNLMR VLGASLGVAA ATASLSWRLE DAGAGQSWLG
ATGPSLLRAV EASLMIPAGL AALAAAAALC AASPRAPKTA