Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4101 |
Symbol | |
ID | 5831410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4561604 |
End bp | 4563181 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641369892 |
Product | EmrB/QacA family drug resistance transporter |
Protein accession | YP_001641542 |
Protein GI | 163853499 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.420872 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCCA GCGCGACCCT CGCCGCAAGC CCTGCCGCCG ACCCGCCCCT CGACCGCCGC CGCATGGTGG CGTTCCTGTG CATGGTGTTC GGGATGTTCA TGGCGATCCT CGACATCCAG ATCGTCTCGG CCTCGCTCAA CGAGATCCAG GCCGGCCTCT CGGCCTCCGG CGACGAGATC CCCTGGGTGC AGACGAGCTA CCTCATCGCC GAGGTCATCT CGATCCCGCT CTCGGGCACC CTGTCGCGGG TGCTCTCGAC GCGCTGGATG TTCTCGATCT CGGCCGCCGG CTTCACGCTG ATGAGCCTGA TGTGCGCCAC CTCCTCGTCG ATCGGCGAGA TGATCGTCTG GCGCGCGCTC CAGGGCTTCA TCGGCGGCGG CATGATCCCG ACCGTGTTCG CCGCGGCGTT CACGATCTTT CCGCCGTCCA AGCGCTCGAT CGTCTCGCCG ATGATCGGCC TCGTCGCGAC GCTGGCCCCC ACCATCGGCC CAACCATCGG CGGCTACCTC ACCGATCTGT TCTCCTGGCA CTGGCTGTTC CTCATCAACA TCGTGCCGGG CATCTTCGTC ACGATCTCGA CCTTCCTCCT GATCGATTTC GACCGGCCGA ACTTCGACCT GCTCAAGTCC TTCGACTGGG CCGGGCTCGC CTTCATGGCG GGCTTCCTCG GCTGCCTCGA ATACGTGCTG GAGGAGGGCC CGAACCACGA CTGGCTGCAG GACGAGGCGG TGTTCGTCTG CGCCCTCGTC TGCGTCGTGT CGGGCTTGGC CTTCTTCGCC CGCGTCTTCA CCGCGCGCCA GCCGATCGTC GACCTGCGCG CCTTCTCGGA CCGCAACTTT GCCGCCGGCT GCGTCTTCAG CTTCGTGATG GGCATCGGCC TCTACGGCCT GACCTACCTC TACCCGGTCT ATCTCGCCCG CGTGCGCGGC TACTCGGCGC TGCAGATCGG CGAGACGATG TTCGTCTCAG GTCTGTGCAT GTTCGCCACC GCCCCGATCG CGGGAAAGCT CTCCGCCAAG CTCGATCCGC GCATCATGAT GGCGATGGGC TTCTCCGGTT TTGCCGTCGG CACCTGGATC GTCACCGGGC TGACCAAGGA CTGGGACTTC TGGGAGCTGC TGTGGCCGCA GGTGCTGCGC GGCTGCTCCC TGATGCTGTG CATGATCCCG ATCAACAACA TCGCGCTCGG CACCCTGCCG CCGGAGCGGA TGAAGAACGC GTCCGGCCTG TTCAACCTCA CGCGCAACCT CGGCGGCGCG GTCGGCCTCG CCCTCATCAA CACGGTGCTG AACGCCCGCT GGGACCTCCA TCTCGCGCGC CTGCACGAGC GCTTCACCTG GGCCAACAGC GCCGCGCTGG AACGCCTCGA CGCCATGCGG CGCCAGTTCG AGGTGTTCGG GGGCGATGCC AACGGCATGG CGCTGAAGGC GCTCAACAAC ACCGTGCGGA TTCAAGGCTT GGTGATGAGC TTCGAGGACG TGTTCCTCGT CCTCACCGTG CTGTTCCTGG CCATGGCCTG CGGCACGCCG TTGATCCGAC GTCCGCGCGC GGCGGCGCCG GCCGGCGCGG GGCATTGA
|
Protein sequence | MAASATLAAS PAADPPLDRR RMVAFLCMVF GMFMAILDIQ IVSASLNEIQ AGLSASGDEI PWVQTSYLIA EVISIPLSGT LSRVLSTRWM FSISAAGFTL MSLMCATSSS IGEMIVWRAL QGFIGGGMIP TVFAAAFTIF PPSKRSIVSP MIGLVATLAP TIGPTIGGYL TDLFSWHWLF LINIVPGIFV TISTFLLIDF DRPNFDLLKS FDWAGLAFMA GFLGCLEYVL EEGPNHDWLQ DEAVFVCALV CVVSGLAFFA RVFTARQPIV DLRAFSDRNF AAGCVFSFVM GIGLYGLTYL YPVYLARVRG YSALQIGETM FVSGLCMFAT APIAGKLSAK LDPRIMMAMG FSGFAVGTWI VTGLTKDWDF WELLWPQVLR GCSLMLCMIP INNIALGTLP PERMKNASGL FNLTRNLGGA VGLALINTVL NARWDLHLAR LHERFTWANS AALERLDAMR RQFEVFGGDA NGMALKALNN TVRIQGLVMS FEDVFLVLTV LFLAMACGTP LIRRPRAAAP AGAGH
|
| |