Gene Msil_0857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0857 
Symbol 
ID7093290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp946195 
End bp947406 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content63% 
IMG OID643464195 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002361189 
Protein GI217977042 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAATCG CTGCGCGCTC TCACGCGAAG ATCGAAGCGC GCCGTGCGCG CGTTGGCGCC 
GCTGCTTGGC GCTCCATTCT CGTCGCGTTG ACCGCCTTTC TGACTGTCGT CGACCTGTTC
GCGACACAGG CTATCTTGCC GCCGCTGGCT CATGCCTACG CTGTGACTCC GGCCGCGATG
GGCCTCGCCG TCAATGCGAG CACTTTCGGC ATGGCCGCTG CGAGCTTTGC GGTCGCCGCG
TTCAGCCACC GGATCGATCG CCGTCGCGGC GTGATCATGA GTCTCGTCGC ATTGTCGGTC
CCGACGCTGC TGCTCGCCAT AGCGCCCAAT CTTGCCGTGT TTGCCCTGTT GCGAATCATG
CAAGGATTGC TGATGGCCTC CGCCTTCACG CTGACGCTCG CCTATTTGAG CGAGCGATGC
AGCGCATCGG ATACTGCAAG CGCCTTCGCC GCCTATATCG CAGGCAATGT CGCGTCCAAT
TTATTCGGTC GCCTTCTCGC CGCAGCGACG ACGGATCATT TCGGGTTGGC CACGAATTTT
GTGCTGTTTG CCTGCCTCAA TCTCGCAGGC ACCGCGCTTG TCTATTTCAC GGTTCGACGC
CAATCCGCGC CGCCCCAGGA CTACGCGCCG ACGGATCCAG CTTCGGCCAT AAGGGGGCAT
CTGAACAACC CGGCGCTGCG CGCCAGCTTT GGCGTCGGCT TTTGCATTCT CTTCGCCTTC
ATCGGCGTTT TCACTTTTGT GAATTTTGTC CTCGTTCGTC CCCCGATCAG CGCGGGCATG
ATGACTGTCG GATTCGTCTA TCTCGTCTTT CTGCCCTCGA TCGCGACAAC CTTGTGGGCG
GGGCGGGCTG TCGCGCGGTT AGGGCAGAGG CGCGGCCTTA TCGGCGGGCT CGTCGTCGCG
GCAGCTGGAT TGCCGTTCCT GCTGACGTCG TCTCTCATGC TCGTGACCGC GGGTCTCGGC
TTTTTCGCGA TTGGAACATT CTTCGCGCAG GCGGTGGCGA CGGGATTCGT CGGCCCCGCA
GCGACGGGCG ACCGAGGCGC CGCGAGCGGC CTCTACCTTG CATGCTACTT TCTCGGCGGG
ATCGCCGGCA CGGCGACGCT TGGCTGGATA TTCGACAGTT TCGGCTGGGC CGCCTGCATC
GGCGGCGTCG CGTTTTCGCT GAGCGTCGCG GCGCTGCTCG GGACGCGGTT TTTCCTGCCC
GCGCATCACT GA
 
Protein sequence
MSIAARSHAK IEARRARVGA AAWRSILVAL TAFLTVVDLF ATQAILPPLA HAYAVTPAAM 
GLAVNASTFG MAAASFAVAA FSHRIDRRRG VIMSLVALSV PTLLLAIAPN LAVFALLRIM
QGLLMASAFT LTLAYLSERC SASDTASAFA AYIAGNVASN LFGRLLAAAT TDHFGLATNF
VLFACLNLAG TALVYFTVRR QSAPPQDYAP TDPASAIRGH LNNPALRASF GVGFCILFAF
IGVFTFVNFV LVRPPISAGM MTVGFVYLVF LPSIATTLWA GRAVARLGQR RGLIGGLVVA
AAGLPFLLTS SLMLVTAGLG FFAIGTFFAQ AVATGFVGPA ATGDRGAASG LYLACYFLGG
IAGTATLGWI FDSFGWAACI GGVAFSLSVA ALLGTRFFLP AHH