Gene Msil_1303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1303 
Symbol 
ID7091313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1400948 
End bp1402654 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content62% 
IMG OID643464641 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002361630 
Protein GI217977483 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCG CCTCGGTATC GGAAGCGGAC AGCGCCGGAC TGCGGCGCAT CACCGCCGTC 
GACAAGAAGC TCATCTTCGC CTCCTGTCTC GGCACGATCT TCGAATGGTA TGATTTTTAT
CTCTATGTTT CGCTGACCGG CATCATAGGC GCCCAGTTCT TCGGCCAGTT CGACGAGGCG
ACGGCGGATC TTTTCGTATT GATCGCCTTT GCGGCCGGGT TCCTCGTGCG GCCGTTGGGA
GGCCTGGTGT TCGGGCGCCT TGGCGATCTC GTCGGCCGCA AATACACTTT TCTGCTGACC
ATCCTGATCA TGGGCTTTTC GACCTTCGTG ATCGGGCTCG TGCCCGGCTA TGGCGTAATT
GGCGCCGCCG CGCCGGTCCT GCTGATTGTG ACGCGGCTGT TGCAGGGATT GGCGATCGGC
GGCGAATATG GCGGAGCGGC GATTTTCGTC GCCGAACATG CGCCACAGCG GCGGCGGGGC
CTGTTCACCT CCTTCATTCA GGCGACCGCG ACGCTCGGCC TCATGTCGTC GCTCATCGTG
ATCCTGCTCA CCAAGGCGAT CCTCGGCGAG GCGAATTTCA TCAACTTCGG CTGGCGGATT
CCGTTCCTGC TGTCGGTGGG GCTGCTTGGC GTTTCCTTGT GGATACGCCT GCAGCTCAAT
GAATCTCCGG CTTTCGCGAA GATCAAGGCC GATGGAACCG CCTCGCGCGC GCCGCTGAAG
GACGCCTTCG GCAATTGGGC CAATGGGCGG ATCGTGCTGC TCGCGCTGTT CGGCCTCACC
ATGGGGCAGG GCGTCATCTG GTACACGGGA CAGTTCTACG CACTGTCTTT CCTGCGGCAG
ACGCTGCGGA TCGACAATTT TACGGCGAGC GTGCTGCTCG TCTGGGCGAT GGCGCTTGCC
GCCGGATTCT TCGTTTTTTT CGGGTGGCTC TCGGACAGGA TTGGACGCAA GCCGATCATC
ATGTTTGGCT GCCTCATCGC GGCGGCGACC TACATGCCGA TCTTCGATCG CATCACGGCG
ATCGCCAATC CGGCTCTCAA CACCGCCTAT CAAAAAGTTT CGGTCAAAGT CATCGCCGAC
CCCGCCGATT GCGGCAATCT GTTTAACCCG GTCGACACGC GCAAAGCGGC GACCTCCTGC
GATATCGCGC GCGACTTGCT GGCGAGCCAT TCCGTGAAAT ATGCGCGCGA GGCCGCGCCT
GCGGGGACGC AGGCGCAGAT CGTGATCGGA TCGGAGGGGA TCGACGTTTT TGCCGCGGAG
GATGTCGCCG ACCCGCAGGC CGCGCGCGTG GAACTCGATC AAAAGCTCGT CGGCGCGCTG
GAGCAGGCCG GCTATCCGCG GACCGACGAC ACAACGCGCG TTCGCATGTC GAACGCCTTC
GACGTGTTTC GACCGCAGGT CGCGATGCTC ATCGGCCTGT TGTTTCTTTT GGTTTTATAT
GTCGCAATGG TCTATGGGCC GATCGCGGCC GCGCTGGTCG AATTGTTTCC AACGCGGATC
CGCTACACGT CGCTGTCGCT GCCCTATCAC ATCGGCAACG GCTGGTTCGG CGGCCTGCTG
CCCGCCACAG CCTTCGCGAT GGTCGCGGAA ACGGGCGATC GTCTTTACGG GCTGTGGTAT
CCGATGGCGA TCGCGCTCGC CTGTTTTGCG ATCGGAGTCG CCTTCGTGCC GGAGACCAAG
GACCGCGACA TTACCGCCGG CGACTGA
 
Protein sequence
MAIASVSEAD SAGLRRITAV DKKLIFASCL GTIFEWYDFY LYVSLTGIIG AQFFGQFDEA 
TADLFVLIAF AAGFLVRPLG GLVFGRLGDL VGRKYTFLLT ILIMGFSTFV IGLVPGYGVI
GAAAPVLLIV TRLLQGLAIG GEYGGAAIFV AEHAPQRRRG LFTSFIQATA TLGLMSSLIV
ILLTKAILGE ANFINFGWRI PFLLSVGLLG VSLWIRLQLN ESPAFAKIKA DGTASRAPLK
DAFGNWANGR IVLLALFGLT MGQGVIWYTG QFYALSFLRQ TLRIDNFTAS VLLVWAMALA
AGFFVFFGWL SDRIGRKPII MFGCLIAAAT YMPIFDRITA IANPALNTAY QKVSVKVIAD
PADCGNLFNP VDTRKAATSC DIARDLLASH SVKYAREAAP AGTQAQIVIG SEGIDVFAAE
DVADPQAARV ELDQKLVGAL EQAGYPRTDD TTRVRMSNAF DVFRPQVAML IGLLFLLVLY
VAMVYGPIAA ALVELFPTRI RYTSLSLPYH IGNGWFGGLL PATAFAMVAE TGDRLYGLWY
PMAIALACFA IGVAFVPETK DRDITAGD