Gene Rmet_5039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_5039 
Symbol 
ID4041901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp1725278 
End bp1727113 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content66% 
IMG OID637980460 
Productacetolactate synthase large subunit 
Protein accessionYP_587170 
Protein GI94313961 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.818274 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.707988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTTGT CTGCACCCGG CATCGGCAAT TGCTGTCCGG ATTCTTGCAT ACGATTAAGT 
ATACTTAAAA GTATTCTTTC GAGTATGCTC CATCGGACGC CGGCGCTTCC TTCACCCGGC
CGATCAATGG AGATCAAGCG GATGACCAAG ATGAACGGCG CCGAAGCGAT GGTGCGTATG
CTCCAGCTCA ACGGTGTGAA GCACATTTTT GGCTTGTGTG GCGATACCAG CCTGCCTTTT
TACGATGCTC TGGCCCGTCT GGACCATGGC ATGGACCACG TGCTGGCGCG CGACGAGCGC
AGCGCCGCGT ACATGGCGGA CGCCTACGCC CGCGTCACGG GCAAGGTCGG CGTGTGTGAG
GGTCCGAGTG GAGGAGGCGC CACATATCTG TTGCCGGGGC TGGTGGAGGC CAATGAATCG
TCGGTGCCGG TGCTGGGCAT TACGTCCGAT GTATCCGTCA ACTCGCGCGG CAAATACCCC
CTGACCGAGC TGGACCAGGA ATCCCTGTAT CGGCCACTGA CCAAGTGGAA CACCACGATC
GACCGTGCCG ATCAGATTCC CGACGCCGTG CGCGCTGCCT TCCGCGCCAT GACGACCGGC
AAGCCGGGTT CGGCGCATCT GTGCCTCCCC TATGACGTTC AGAAGCATGA CGTGGACCCG
GCCGGGATCT GGGCTCAGGC CGGTCACGAT CGTTTCCCGG CGCTGCGCTA CGCGCCGGAC
CCCGACGAGG TCGATCGCGC CGCGCGCCGT CTGACCGAAG CGCGCGCGCC GCTGATCATC
TGCGGCGGTG GCGTGGTGAT TTCGGGCGCC TGCGCTGAGC TGGACACGCT CGCGACCTCG
CTCAACGCGC CGGTGTGCAC CACGGTCAGC GGCCAGGGCA GCCTGGCCGA TACGCACCCG
CTCAACGCCG GCGTGGTAGG CGCCAATGGC GGCATCCCGG CCACCCGCGA TCTGGTGGCC
AATGCCGATG TCGTGCTGTT TATCGGTTGC CGCGCCGGCT CCACCACCAC CGAGCACTGG
CGCTTCCCCG GCCGCAACGT GCCGATCCTG CATATCGACA TCGATCCAAT GGTGATCGCG
GCCAACTACA ACACCGACGT TGGTATGGTG GGCGATGCCT TGCTGGCCCT TCGCATGTTG
AACGCCGCAG TGCGTGACCG ACTGCCGATG CGCCGTGCCG ATACCGCCGA CGGGCGCGCG
CTGGTGGAAG TCGCGCGCGC GGCCAAGCGT GCGAGCTTCG CACCGCTGGC GGCCTCGCTG
GAGCGGCCGA TCAAGCCCGA GCGCGTGGTC GATACGCTCA ACCGCCTCTT GCCCGAGGAC
GCCATCGTGG TGGCGGACCC TGGCACACCT TGCCCGTATT TCTCCGCCTA TCACGAGAGC
CGACGCGCCG GCCGGCAGTT CATCACCAAT CGCGCGCATG GTGCGCTGGG TTTCTCGCTG
GCGGCCGGTA TCGGCGCTTC GCTGGGTCGC CCTGGCACCA CGGTTGTCTC CGTGATGGGC
GATGGCAGTT TCGGCTTTAC CTGCGGCGAG ATGGAAACGC TGGTACGCCG CCGCATCCCG
CTGAAGATGA TCGTGTTCTC GAACTCGGTG TTCGGCTGGA TCAAGGCGAG CCAGAAGGCC
GGCTACGACC GCCGCTACTT CTCCGTGGAT TTCAGCCGCA CCGATCACGC GCGCGTGGCC
GAGGCCTTTG GCGTGCGCGC GTGGCGCGTG GAAGATCCCG CGATGCTCGA CGCAGCCATT
CGCGCCGCGC TGGAGCATGA CGGTCCCGCG CTGGTGGACG TCATCACGCA GGAGTTGCAG
GATGCCGCGG CACCGGTCAG CCAGTGGATG GGCTGA
 
Protein sequence
MPLSAPGIGN CCPDSCIRLS ILKSILSSML HRTPALPSPG RSMEIKRMTK MNGAEAMVRM 
LQLNGVKHIF GLCGDTSLPF YDALARLDHG MDHVLARDER SAAYMADAYA RVTGKVGVCE
GPSGGGATYL LPGLVEANES SVPVLGITSD VSVNSRGKYP LTELDQESLY RPLTKWNTTI
DRADQIPDAV RAAFRAMTTG KPGSAHLCLP YDVQKHDVDP AGIWAQAGHD RFPALRYAPD
PDEVDRAARR LTEARAPLII CGGGVVISGA CAELDTLATS LNAPVCTTVS GQGSLADTHP
LNAGVVGANG GIPATRDLVA NADVVLFIGC RAGSTTTEHW RFPGRNVPIL HIDIDPMVIA
ANYNTDVGMV GDALLALRML NAAVRDRLPM RRADTADGRA LVEVARAAKR ASFAPLAASL
ERPIKPERVV DTLNRLLPED AIVVADPGTP CPYFSAYHES RRAGRQFITN RAHGALGFSL
AAGIGASLGR PGTTVVSVMG DGSFGFTCGE METLVRRRIP LKMIVFSNSV FGWIKASQKA
GYDRRYFSVD FSRTDHARVA EAFGVRAWRV EDPAMLDAAI RAALEHDGPA LVDVITQELQ
DAAAPVSQWM G