Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4750 |
Symbol | |
ID | 5834554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 5305706 |
End bp | 5306920 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641370547 |
Product | major facilitator transporter |
Protein accession | YP_001642189 |
Protein GI | 163854146 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGACG ACAGCGACCG GTTCATCGAG GCCGGGACGC CGACCTTTCG CCGGGCGACG CTCGCCCTGT TCGCAGCGGG GTTCTCCACC TTCGCGGTGC TCTACGGCGT CCAGCCGCTG CTGCCGATCT TTCACGACAC CTTTGCGGTC TCCCCGGCCG AGAGCAGCCT CGCCCTGTCG CTGCCCTGTG CCACGCTCGC GATCGCGCTG CTGATCGTCA GCCCGCTCTC CGAGGTGTGG GGGCGCAAGC GGGTGATGGC GGTCTCGCTG TTCGCCTCGG CCCTGCTCAC GATCGGCGCC GTCCTGATGC CGAGCTGGCA TGGATTCCTG GTGCTGCGGG CGCTCACCGG CATCGCCGCG AGCGGCTTGC CCGCCGTCGC CATGGCCTAT CTCAGTGAGG AGATGCACGG CCGCGCCATC GGCCTGTCGA TGGGGCTGCT GATCGGCGGC AACGCGCTCG GCGGCATGGT CGGGCGGCTG CTCGCGGGGG TGATCGCCGA TCACGCCTCT TGGCGCTGGG GCCTCGGCAT CATCGGGGTG CTGGCTCTGT TCGCGGCTCT GGCCTTCCAG TTCGCCCTGC CACCCTCGCG GCACTTTCTC GCCCACCGGA TGCGCTGGCG CGAGGTGCCG GGCACCTTCA CCCACCATTT CCGGGATGCG GGCCTGCCCT GGCTGTTCTT CGAGCCGTTC CTGCTGATGG GCGGATTCGT CTGCGTCTAC AACTACATCG GCTTCCGCCT GCTCGATCCG CCGTTCTCGT TGAGCCAGAC GGTGATCGGC CTGATCTTCA TCGTTTACCT CGCCGGCACC GCGAGTTCGG CCATCACCGG CCACATCGCC GGGGTGCTGG GGCGGCGCAA GGTGCTGTGG CTGGCGATCC TGCTCGGAAT CGGCGGCATC GCCCTGACGC TGGCGGACAA CCTCGTTCTC ATCATCGGTG GCATCGTCGT CGTCACGGTC AGCTTCTTCG GCGCGCATTC GGTGGCGTCG AGCTGGGTCG GGCGCCGGGC CTTGAGCGAC CGGGCACAGG CCTCCGCGAT CTATCTCTGC ATGTACTATC TCGGCTCGTC CCTGCTCGGC ACGGCGGGCG GCTGGTTCTT CCTGCATTAC GGCTGGCCCG GCGTCGCGGG CTTCTTCGGC AGCCTCTACG TGGCAGCTTT GCTCATCGCG CTGCGCCTGA CGCGGCTCGC GCCACTCCCG CCTACGGGCG GCTGA
|
Protein sequence | MADDSDRFIE AGTPTFRRAT LALFAAGFST FAVLYGVQPL LPIFHDTFAV SPAESSLALS LPCATLAIAL LIVSPLSEVW GRKRVMAVSL FASALLTIGA VLMPSWHGFL VLRALTGIAA SGLPAVAMAY LSEEMHGRAI GLSMGLLIGG NALGGMVGRL LAGVIADHAS WRWGLGIIGV LALFAALAFQ FALPPSRHFL AHRMRWREVP GTFTHHFRDA GLPWLFFEPF LLMGGFVCVY NYIGFRLLDP PFSLSQTVIG LIFIVYLAGT ASSAITGHIA GVLGRRKVLW LAILLGIGGI ALTLADNLVL IIGGIVVVTV SFFGAHSVAS SWVGRRALSD RAQASAIYLC MYYLGSSLLG TAGGWFFLHY GWPGVAGFFG SLYVAALLIA LRLTRLAPLP PTGG
|
| |