Gene Mext_1916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1916 
Symbol 
ID5835220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2132893 
End bp2134056 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content74% 
IMG OID641367716 
Productmajor facilitator transporter 
Protein accessionYP_001639386 
Protein GI163851343 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGAC GGGGCAGGCT GACCGTCATC TCGGCCCTGG GCGTGGTGGA GATACTCGCC 
TGGGGCTCAT CCTTCTACCT GCCGGCGGTG CTCGCCGGCC CCATCGCGGC CGACACGGGC
TGGCCGCTGG CGTGGGTGGT CGGCGGCTTG TCCATCGGAC TTCTTGTGGC GGCGGTCGCC
TCGCCCCGCG TGGGCATCGC CATACAACGT CACGGGGGTC GGCCGGTCCT GGCGCTCGCC
GCCGTCCTGC TAGCGGTCGG CCTCGCCGCG CTCGGCCTGG CGCCGAACCT GCCGGCCTTC
CTCGCCGGCT GGCTGGTCGT CGGCCTCGGC ATGGGCTGCG GACTGTACGA TCCGGCCTTC
GCCACGCTTG GGCGCCTCTA CGGTTCCGAG GCGCGACCGG CCATCACGAC GCTGACCCTG
TGGGGCGGCT TCGCCAGCAC GGTCTGCTGG CCGCTCTCGG CCTTCCTCGT GGAGCAGGTC
GGCTGGCGCC ATGCCTGCCT CGCGTATGCC GGCCTCCACC TCCTGGTCAC CCTGCCGCTC
GTGCTCGGAC TTATCCCGAG GGCGCCGGCG GCGGAAGCCG CGCGGGGAGA GGTGCACCAC
CGCGGCGGGA TGCTCACAGC CAGGGAGCGG CGCGCCTTCC TGTTGATGGC GGGCGTGCTG
GTCCTCGGCG GCGCGGTCAT GACCTTGGTC TCGGTGCACC TCATCACGCT GCTGCAGGCC
CGGGGCGTGG CGCTCGCCGC GGCCGTGTCC TACGGCGCGC TGATCGGCCC CGCGCAGGTC
GGCGCCCGGA TCGTCGAGAT GGCTGGCAAG GGCAGGCATC ACCCGCTCTG GACCCTGACC
GCGGCCATGG TCCTCGTGGC GGCCGGCCTG GCCGTCCTGG CGGCGGGGAT ACCCGCGGTC
GGGCTCGCCC TCGTGCTCTA CGGGGCGGGC AACGGCATCT ACTCCATCGC CCGGGGAACG
GTACCGCTCT CGCTGTTCGG GCCTGAGCGC TACGCGACGC TGGTCGGGCG GCTCGCCCGT
CCGGGCCTGG CGGCGCAGGC GCTCGCCCCG TCGCTCGGGG CCGCAGCGCT GGCCTATGGT
GGTGCGGACA CGGCCTACGT CCTGCTCCTG GCGCTCGCAC TGGCGAACGT TGTGCTCGTG
GCAGCCCTTT GGGGCGCCCG GTGA
 
Protein sequence
MSGRGRLTVI SALGVVEILA WGSSFYLPAV LAGPIAADTG WPLAWVVGGL SIGLLVAAVA 
SPRVGIAIQR HGGRPVLALA AVLLAVGLAA LGLAPNLPAF LAGWLVVGLG MGCGLYDPAF
ATLGRLYGSE ARPAITTLTL WGGFASTVCW PLSAFLVEQV GWRHACLAYA GLHLLVTLPL
VLGLIPRAPA AEAARGEVHH RGGMLTARER RAFLLMAGVL VLGGAVMTLV SVHLITLLQA
RGVALAAAVS YGALIGPAQV GARIVEMAGK GRHHPLWTLT AAMVLVAAGL AVLAAGIPAV
GLALVLYGAG NGIYSIARGT VPLSLFGPER YATLVGRLAR PGLAAQALAP SLGAAALAYG
GADTAYVLLL ALALANVVLV AALWGAR