Gene M446_0517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_0517 
Symbol 
ID6129216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp615084 
End bp616298 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content78% 
IMG OID641640839 
Productmajor facilitator transporter 
Protein accessionYP_001767514 
Protein GI170738859 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.159005 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00108561 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGCGCCC GCCCCCGGCT GGCGGTGGTC AGCGCCCTCG GGGTGGTCCA GATCCTGACC 
TGGGGCACGT CGTACTACCT GCTCACGGTG CTGGCGCCGC CGATCGCCGC CGAGACGGGC
TGGCCGCTGC CCTGGATCGT GGGCAGCCTG TCCGCCGGCC TGCTCGTCTC CGGCCTGGTC
TCGCCGCGGG TCGGCCGGGC GATCGGCCGC TCGGGCGGCC GCCCGGTCCT GGCGCTCGCC
ATCCTGGTGC TGGCGCTCGG CCTCACGGTG ATCGGCCTCG CGCCGAGCCT GCCGGTCTTC
TTCGCCGGCT GGCTGATCCT GGGCGCCGGC ATGGGCGGCG GCCTCTACGA CGCCGCCTTC
GCCACCCTGG GGCGCCTCTA CGGCGCGGGC GCCCGGCCGG CGATCTCGGC CCTGACCCTG
TGGGGCGGCT TCGCCAGCAC GGTCTGCTGG CCGCTCTCGG CCGTCCTGGT GGCGCAGCTC
GGCTGGCGGG GCACCTGCCT GGCCTATGCG GGCCTCCTCC TCGCCCTTGC CCTGCCCCTG
GTCCTCCTCG TCATCCCGGC CCCGCCGCCG CTCCCGGAGA CGGCCGCGGC CCCGGCGGGG
AGCGCCGCCC TGGCGCCCGG CGAGCGGCGG GCCTTCCTGT GCCTCGCCGG CATCCTGACG
CTCGGCGGGG CGACGACCGC GATCGTCTCG GTGCACCTCC TCACCCTGCT CCAGGCGCGC
GGCGCGTCCC TCGCGGCGGC GGTCGCCCTC GGGGCCGTGG TCGGCCCCGC CCAGGTCGGC
GCGCGGGTGA TCGAGATGGC GAACCGGGGC CGGCACCACC CGATCTGGAC CCTCACCGCC
GCCATGGGCC TGATCGCGGC CGGGCTCGTC CTGCTGGCGT TCGGCGTCGC GTGGCCGGCC
CTCGCGCTCG TCCTCTACGG CGCCGGCAAC GGCGTCTACT CGATCGCGCG GGGCACGCTG
CCGCTCGCCC TGTTCGGTCC GCTGCGCTAC GCGCCGCTCG TGGGCCGGCT CGCCCGGCCG
AACCTCGTCG CCCAGGCTCT GGCCCCCTCC GCCGGCGCCT TCGTGCTCGC GGCGGCGGGC
GCGCAGGCGG CCCTCGCGCT CCTGACCGGG CTCGCCCTCG CCAACCTCGT TCTGGTGGCG
GCGCTCTGGC GGGAGCGCCG CGCCGCAGGG GAGGGCGCGC CCGCCGGCGA ACCGGGATCC
CGGCCGGATC GTTGA
 
Protein sequence
MSARPRLAVV SALGVVQILT WGTSYYLLTV LAPPIAAETG WPLPWIVGSL SAGLLVSGLV 
SPRVGRAIGR SGGRPVLALA ILVLALGLTV IGLAPSLPVF FAGWLILGAG MGGGLYDAAF
ATLGRLYGAG ARPAISALTL WGGFASTVCW PLSAVLVAQL GWRGTCLAYA GLLLALALPL
VLLVIPAPPP LPETAAAPAG SAALAPGERR AFLCLAGILT LGGATTAIVS VHLLTLLQAR
GASLAAAVAL GAVVGPAQVG ARVIEMANRG RHHPIWTLTA AMGLIAAGLV LLAFGVAWPA
LALVLYGAGN GVYSIARGTL PLALFGPLRY APLVGRLARP NLVAQALAPS AGAFVLAAAG
AQAALALLTG LALANLVLVA ALWRERRAAG EGAPAGEPGS RPDR