Gene Rmet_5244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_5244 
Symbol 
ID4042105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp1942311 
End bp1943852 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content68% 
IMG OID637980662 
Productmajor facilitator superfamily transporter 
Protein accessionYP_587372 
Protein GI94314163 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.249195 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCTC CGCAAGCGCC CTCCGCCCCG CATTCACCCC CAGACGTGCC CGTGATGCAG 
GTTATCTTCG GCCTGATGCT GGCCATCATG CTGGGTGCCC TCGAACAATC GATCGTGGCC
GTGGTCCTGC CCGAGATCGC CTTGCAGCTC AACGGTTTCG AGGACATGGC GTGGGTGATC
TCCGCCTATC TGGTGGCGTC CACCGTGGTT ACGCCGATCT ACGGCAAGCT GTCCGACGTG
CTCGGGCGCC GCTCGGTGCT GACGTTCTCG ATCGTGCTGT TCGTGCTGGC GTCGATCGCC
TGCGCGATGG CTACGACGAT GCCGATGCTG ATCGTTGCGC GCATTCTGCA GGGCCTTGGC
GGTGGCGGCC TGATCTCGGT ATCGCAAGCA ACGATCGCCG ACGTGGTGCC ACTACGTGAA
CGCGGCAAGT ACCAGGGCTA CGTCAGCGGC GTCTGGGCCG TGGCCAGCAT GGCCGGGCCC
GTGATTGGCG GCTACCTTGC GCACTTCCTC TCATGGCGCT GGATTTTCTG GATCAACATT
CCGCTGGGGC TGATCGCGCT GATCGTGGTG CGCCGGGCGC TGCGGCATCT GCCGGTCAGC
GGACGCAAGC ATCGCATCGA CTACCTTGGC GCGCTGCTGT TTGGTGGCGG CCTGTCCGGC
GTGCTGGTGT TCCTGACGCG GATGGGGCAG GGCCACTCGC CGCTGGAGCC GCAGACGATC
GGCCTGCTGG CTGCCGGGCT GATCGGTCTG GTGCTTTTCA TCTGGCAGGA GCGGCGTGCC
GTCGACCCGG TGATTCCGCT GAAGATGCTC GCCGTGCCGA CCGTCGCGAT CTGCTGCCTG
ACGCTGTTCC TGTGCTTCTT CCAGTTGATC GCGATGTCGG TGCTGCTGCC GCTGCGCTTC
CAGGTCGTGG GCGGCGCGGG GGCCGACACC GCCGCGCTGC GGCTGGTGCC GCTGACGCTG
GCCATTCCGT TCGGCGCCTA TGCGTCGGGC CGGCTGATGT CGTGGAGCGG GCGTTACAAG
CCGCTGCAAC TGGCTGGCTG TCTGGTGGCG CCGATTGCCA TCGTCGGGCT GGCCTTTGTG
CCGCCGCAGG CGGTGGTGCC GGCCGCGCTG GTGATGATCG TCCTGGGCCT GTCAATCGGC
CTGCAACTGC CGAGCGGGCT GGTGGCCACG CAGAACTCGG TGCCGCCGCA GCAGGTTGGC
ATCGCCACCG CGCTGACCGC CTTCTCGCGC CTGCTTGGTG GCGCAGTGGG CGTGGCGGTG
CTGACCACGG TGCTGATCGC GCTGCTACGC CATAGCGGCA TGGCGGTCAG TGACCTGCAT
GGCGGCGAGG ATGTGCTGAT GAGCATGTTC CGGCGGGCCA TGGATGCGGG GGATCAGGGC
GATGCTGCCG CGGTGCGCGA GGCGGCGGAG CATGCCTTCC GCCTGCTGTT CCTGATGAGC
GCGGGCGTGT CGTTGATCGC GCCGTTCTTC GTCATGCGGC TCAAGGAGAA GACGCTGCGC
GGCAGCCCGG CCGGGGCAGC GGCGTCCGCC GCTGCCGAGT GA
 
Protein sequence
MSAPQAPSAP HSPPDVPVMQ VIFGLMLAIM LGALEQSIVA VVLPEIALQL NGFEDMAWVI 
SAYLVASTVV TPIYGKLSDV LGRRSVLTFS IVLFVLASIA CAMATTMPML IVARILQGLG
GGGLISVSQA TIADVVPLRE RGKYQGYVSG VWAVASMAGP VIGGYLAHFL SWRWIFWINI
PLGLIALIVV RRALRHLPVS GRKHRIDYLG ALLFGGGLSG VLVFLTRMGQ GHSPLEPQTI
GLLAAGLIGL VLFIWQERRA VDPVIPLKML AVPTVAICCL TLFLCFFQLI AMSVLLPLRF
QVVGGAGADT AALRLVPLTL AIPFGAYASG RLMSWSGRYK PLQLAGCLVA PIAIVGLAFV
PPQAVVPAAL VMIVLGLSIG LQLPSGLVAT QNSVPPQQVG IATALTAFSR LLGGAVGVAV
LTTVLIALLR HSGMAVSDLH GGEDVLMSMF RRAMDAGDQG DAAAVREAAE HAFRLLFLMS
AGVSLIAPFF VMRLKEKTLR GSPAGAAASA AAE