Gene Rmet_3950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_3950 
Symbol 
ID4040808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp507040 
End bp508350 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content65% 
IMG OID637979374 
Productmajor facilitator protein family permease 
Protein accessionYP_586087 
Protein GI94312878 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCT CCAGCAGTAC TGCCGGGGCT TCGGGTGCGC AAGGCGGCCA TGCCGCTCGC 
CCGCTGACCG GTCAGGATTA CAAGACGCTC GCGCTTGCCG CACTTGGCGG CGCGCTCGAG
TTCTACGATT TCATTATTTT TGTTTTCTTC GCCAAGGTCA TTGGCCAGTT ATTCTTCCCG
GCATCCGTGC CGGACTGGCT GCGCATGCTG CAGACCTTCG GCATCTTCGC GGCCGGCTAC
CTGGCACGCC CGCTGGGCGG CATCATCATG GCGCACTTCG GTGACCTGCT CGGCCGCAAG
AAAATGTTTA CGTTGTCGAT TCTGCTGATG TCGGTACCGA CGCTGCTGAT GGGCCTTCTG
CCGACCTACG CTTCGGTGGG CCTGCTGGCG CCGATGGCGC TGCTGGTGCT GCGCATCCTG
CAGGGTGCCG CCGTGGGCGG CGAAGTCCCC GGCGCATGGG TGTTCGTGTC CGAACACGTG
CCCCGGCGTC ACGTCGGATA TGCCTGCGGC ACGCTGACTG CAGGTCTGAC GGCCGGCATC
CTGCTCGGCT CGCTGGTAGC CACAGGCATC AACGCCGTAT TCGATCCGCA GGAACTCGTT
GACCATGGCT GGCGCGTGCC GTTCGTGCTG GGCGGCGTGT TCGGTGTCGG CTCGATGTAC
CTGCGCCGCT GGCTCCATGA AACCCCGGTC TTCGCGGAAC TGCAGCAACG CAAGGCGCTG
GCCGCCGAAC TGCCGCTCAA GACGGTCGTG CGCGACCACC GCGGCGCCGT TGTCATCTCG
ATGCTGCTCA CGTGGATGCT GTCCGCTGGC ATCGTGGTGG TGATCCTGCT GACGCCGACC
TACCTCCAGA CGCTGTACGG CTTCGACGCC CGCACCGCGC TGGTGGCCAA CAGCGCCGCA
ACGCTGTGCC TGTCGATCGG CTGCGTGGTG GCCGGCATCC TGGCTGACCG CATTGGCGCG
CGCCTGACAC TTTCGATCGG TGGCGCACTG CTGGCTGCCA CGGCCTGCGT GCTGTACACG
ACGATCGGCA CCCGTCCCGA CCTGCTGCTG CCGCTGTACG CCCTGGCCGG CTTCTTCGTC
GGCACGATTG GCGCCGTGCC TTACGTGCTG GTGCACGCGT TCCCAGCGCA AGTGCGGTTC
TCCGGGCTGT CCTTCTCGTA CAACGTCTCG TACGCGATCT TCGGCGGACT GACGCCGGTG
ATCGTCTCGC TGATGCTCAA GAATGATTCG CTGGCCCCGG CCCACTATGT GGTTGGCGTC
TGCATCATGG GTATCGTGAC GGCGCTGTTC GTACGCAAGC GGCACGCCTG A
 
Protein sequence
MSTSSSTAGA SGAQGGHAAR PLTGQDYKTL ALAALGGALE FYDFIIFVFF AKVIGQLFFP 
ASVPDWLRML QTFGIFAAGY LARPLGGIIM AHFGDLLGRK KMFTLSILLM SVPTLLMGLL
PTYASVGLLA PMALLVLRIL QGAAVGGEVP GAWVFVSEHV PRRHVGYACG TLTAGLTAGI
LLGSLVATGI NAVFDPQELV DHGWRVPFVL GGVFGVGSMY LRRWLHETPV FAELQQRKAL
AAELPLKTVV RDHRGAVVIS MLLTWMLSAG IVVVILLTPT YLQTLYGFDA RTALVANSAA
TLCLSIGCVV AGILADRIGA RLTLSIGGAL LAATACVLYT TIGTRPDLLL PLYALAGFFV
GTIGAVPYVL VHAFPAQVRF SGLSFSYNVS YAIFGGLTPV IVSLMLKNDS LAPAHYVVGV
CIMGIVTALF VRKRHA