Gene Msil_0395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0395 
Symbol 
ID7093554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp431472 
End bp432890 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content64% 
IMG OID643463725 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002360731 
Protein GI217976584 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.90687 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCGG TCGAGACCGT GATTCCCGCG CGGCTCGATC GGCTCGCCTG GAGCTGGTTT 
CATACGCTTG TGGTCCTCGC GCTCGGCGTC ACCTGGATTC TTGACGGGCT CGAAGTGACG
CTGGCGGGCT CTGTCGCCGG CGCCCTCAAG GCGAGCCCAA GATTGCAATT CTCCGACGCC
GACGTTGGAC TTGCCTCGAC CGCTTATCTC GTCGGCGCGG TGGCCGGCGC GCTGCTGTTT
GGCTGGATGA CCGATCGCCT CGGCCGCCGC CTTATGTTTT TCGCGACTCT CGGCCTTTAT
CTCGTCGGCG GCGCGGCCAC GGCCCTGTCC TGGGATCTGC CGAGCTTCTG CCTCTTCCGC
CTCCTCACCG GGGCCGGCAT CGGCGGCGAA TATAGCGCGA TCAATTCGAC GATTCAGGAG
ATGATCCCGG CCCGTTATCG CGGCCGGACG GACCTTGCCA TTAATGGTTC GTTCTGGATC
GGCGGCGCCC TTGGCGCGGC GAGTTCCCTG GCGCTGCTGG ATCCGGCGAT CATTGACCCG
GAAACCGGCT GGCGCCTCGC CTTTATGATC GGCTCGCTGA TCGGCCTCGC CGTGCTGGTT
ATGCGCCGCT TCATTCCCGA AAGCCCGCGC TGGCTCGTCA TCCACGGCCG GCTCGACGAC
GCGGACCGCA TCATGGATGA GATCGAGGCG AGTGCCGGGG CCGGCAAGGC GCCGGAAACG
CTCAAACCCC TGCGCATCCG TCCGCGCCGC TTTACGCCGC TGCGCGAAGT GTTTCACACG
CTCTTCGTGG TCTACCGCCA GCGCTCGCTC GTCGGCCTCG CGCTGATGGC CGCGCAGGCG
TTCTTCTATA ATGCAATTTT TTTTACTTAT GCGCTTGTGC TGACGCGGTT TTATGGCGTT
CCGGCCAATC ATATCGGCTG GTTCATCCTG CCCTTTGCGC TGGCCAATTT TATGGGGCCG
CTGGCGCTCG GGCCGCTCTT CGACAGCCTC GGCCGCAAGC CGATGATCAG TTTCACCTAT
GCGATTTCCG GCGCGCTTCT CGCCTTGAGC GGAGTGCTGT TCGCGTTCGA ACTGGTTAGC
GCGGCGCAAT TGACCATCGC CTGGATGGTC ATTTTCTTCT TCGCATCCGC CGCGGCGGGA
GCCGCCTATC TGACGGTCAG CGAGACCTTT CCGGTCGAGA TCAGGGCGCT CGCCATCGCC
GTTTTCTACG CCGCCGGCAC GGCGATCGGC GGCGCCGGAG CTCCGTATCT GCTTGGCGTC
CTGATCGCGA CCGGCTCGCG AATGAGCGTG CTGATTGGAT ATCTCATCGG CGCGGCGCTG
ATGCTGGCCG CCGCGCTGGC GCAATGGCTT TACGGCATTG CGGCCGAAGG CAGGCCTCTC
GAGGCTGTCG CTCAGCCCTT GTCTTCGCTC ACAGAATGA
 
Protein sequence
MAPVETVIPA RLDRLAWSWF HTLVVLALGV TWILDGLEVT LAGSVAGALK ASPRLQFSDA 
DVGLASTAYL VGAVAGALLF GWMTDRLGRR LMFFATLGLY LVGGAATALS WDLPSFCLFR
LLTGAGIGGE YSAINSTIQE MIPARYRGRT DLAINGSFWI GGALGAASSL ALLDPAIIDP
ETGWRLAFMI GSLIGLAVLV MRRFIPESPR WLVIHGRLDD ADRIMDEIEA SAGAGKAPET
LKPLRIRPRR FTPLREVFHT LFVVYRQRSL VGLALMAAQA FFYNAIFFTY ALVLTRFYGV
PANHIGWFIL PFALANFMGP LALGPLFDSL GRKPMISFTY AISGALLALS GVLFAFELVS
AAQLTIAWMV IFFFASAAAG AAYLTVSETF PVEIRALAIA VFYAAGTAIG GAGAPYLLGV
LIATGSRMSV LIGYLIGAAL MLAAALAQWL YGIAAEGRPL EAVAQPLSSL TE