Gene Mext_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1043 
Symbol 
ID5833664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1136119 
End bp1137366 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content68% 
IMG OID641366838 
Productmajor facilitator transporter 
Protein accessionYP_001638519 
Protein GI163850476 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0605275 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCCG AAGCGATTGA GAATCCGGGG TTGCCCGGCC CGCCCGAGGT CGCCGTGACG 
GCACCGTCCC AAGCGCGCGG ACAAGCTTCC CCAGCTCTGG CGGTGCTCCT GGGACTCAGC
CTCTCGCACC TCCTCAACGA TCTCGTGCAG TCGCTTCTGC CGGCGCTCTA TCCCCTGCTC
AAGGCAGGCT TCCACCTCGA TTTCGGGCAG ATCGGCCTCA TCACCTTCGT GTTCCAGGGG
ACGGCCTCGC TGCTCCAGCC GGCGGTCGGC CTCTACACCG ACCGGCGCCC GCTGCCCTAC
TCGCTGGCCA TTGGCATGGT GCTGTCGCTG GCCGGGCTCG CCCTGTTGTC GGTGGCGTCG
GCCTACGGGG CCCTGCTCGC CGCCGCCGCG CTGATCGGGC TCGGCTCCGC CATCTTCCAC
CCCGAGGCGA GCCGAGTGGC GCGGCTCGCC TCCGGTGGCC GTTACGGTCT GGCGCAATCG
GTGTTCCAGG TCGGCGGCAA TGCCGGCACG GCACTCGGGC CGTTGCTCGC CGCCTTCGTC
GTGGTGCCGC ATGGCCAGGG CAGCGTCGCG TGGTTCTGCC TCGCCGCGCT CGCCGGCATC
CTCGTGCTCG GCACGGTCGG CCGCTGGTAC GCGCAGCGGC TCGCCACGAC GCCGCGAACC
GCAGGGAAGA GCACGGGCGC CGCCTCCGGC CGCCTCAGCC GGGTGCGGAT CGTGGCGACG
ATCGCGATCC TGCTGGGGCT GATCTTCTCC AAGTACTTCT ACATGGCGAG CTTTTCGTCT
TACTACACCT TCTACTTGAT TCACCGCTTC GGCGTACCGG TGGCGCTCGC GCAGGTCTAC
CTGTTCGTCT TCCTCGGAGC GGTGGCGGCG GGGACGATTC TCGGCGGCCC CATCGGTGAC
CGATTCGGGC GCAAGCTCGT GATCTGGATC TCGATTCTCG GCGTACTGCC GTTCTCGCTC
GCCTTGCCGC ACGTGAATCT GTTCTGGACG GTGATTCTCT CGGTGCCGAT CGGACTGATT
CTGGCGTCCG CCATGCCGGC GATCCTGGTC TATGCACAGG AATTGTTGCC GGGCCGGATC
GGGCTCGTCG GCGGCCTGTT CTTCGGCTTC GCCTTCGGCA TGGGCGGCCT CGGCGCCGCG
CTGCTCGGGG AGATGGCCGA CCATGTCGGC ATCGAGCGGG TCTACGATCT CTGCGCCTTC
CTCCCCGCGC TGGGATTGAT GGCGGTGTTC CTGCCGCGGC TGCGGTGA
 
Protein sequence
MRAEAIENPG LPGPPEVAVT APSQARGQAS PALAVLLGLS LSHLLNDLVQ SLLPALYPLL 
KAGFHLDFGQ IGLITFVFQG TASLLQPAVG LYTDRRPLPY SLAIGMVLSL AGLALLSVAS
AYGALLAAAA LIGLGSAIFH PEASRVARLA SGGRYGLAQS VFQVGGNAGT ALGPLLAAFV
VVPHGQGSVA WFCLAALAGI LVLGTVGRWY AQRLATTPRT AGKSTGAASG RLSRVRIVAT
IAILLGLIFS KYFYMASFSS YYTFYLIHRF GVPVALAQVY LFVFLGAVAA GTILGGPIGD
RFGRKLVIWI SILGVLPFSL ALPHVNLFWT VILSVPIGLI LASAMPAILV YAQELLPGRI
GLVGGLFFGF AFGMGGLGAA LLGEMADHVG IERVYDLCAF LPALGLMAVF LPRLR