Gene Rmet_1966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_1966 
Symbol 
ID4038771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007973 
Strand
Start bp2141794 
End bp2143104 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content64% 
IMG OID637977349 
Producthomoserine dehydrogenase 
Protein accessionYP_584114 
Protein GI94310904 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000284917 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.216212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCCA TCAAAGTCGG CCTTCTCGGC ATCGGTACCG TCGGTAGCGG CACGTTCAAC 
GTGCTCAAGC GCAATCAGGA GGAAATTCGT CGCCGTGCAG GCCGCGGCAT CGAGATTGCG
GTGGTGGCTG ACCTGAACAC CGAGCGCGCC CGCGAGCTGA CCGGTGGGAC GGTGGACGTC
GTCAGCGATG CGAACGACGT GGTGACGCGT CCGGACATCG ACATCGTCAT CGAGCTGATC
GGCGGCTATG GCATCGCCCG CGAGTTGGTG CTCAAGGCGA TCGAGAATGG CAAGCACGTG
GTCACCGCCA ACAAGGCGCT GCTGGCCGTG CATGGCAACG AGATTTTCGA GGCCGCGCGC
AAGAAGGGCG TGATCGTCGC CTTCGAGGCG GCAGTGGCGG GTGGCATCCC CATCATCAAG
GCGCTGCGCG AAGGCCTGAC CGCGAACCGC ATCCAGTGGA TCGCCGGCAT CATCAACGGC
ACGACGAACT TCATCCTGTC CGAGATGCGC GACAAGGGTC TGGATTTCGA TACCGTGCTC
AAGGAAGCGC AGCAACTGGG CTATGCCGAG GCCGATCCGA CCTTCGACAT CGAAGGCGTC
GACGCCGCGC ACAAGGTCAC GCTGATGAGC GCGATCGCAT TCGGTATGCC GGTGCAGTTC
GACCGCGCCC ACGTGGAAGG CATCACCAAG CTGTCGGCCA TCGATATCAA ATACGCCGAG
GAACTGGGTT ATCGCATCAA GCTGCTCGGC ATCACCCGCC GCCGCGAGGA AGGCGTGGAA
CTGCGCGTGC ACCCGACGCT GGTGCCGGCC TCGCGCCTGA TCGCCAACGT GGAAGGCGCG
ATGAACGCCG TGCTGGTGCA GGGCGATGCC GTGGGCGCCA CTCTGTACTA CGGCAAGGGC
GCCGGCGCCG AGCCGACCGC CTCGGCCGTG ATTGCCGATC TGGTCGACGT GACCCGCCTG
CACACCGCCG ATCCGAACCA CCGCGTACCG CACCTGGCAT TCCAGCCGGA CGAGCTGTCG
AACGTGCCCG TGCTGCCGAT CGACGAAGTC ACTAGCTCGT ACTACCTGCG TATGCGTGTG
TCGGACGAAA CTGGCGTGCT GGCAGAGATC ACGCGCATCC TGGCGGAAGC CGGCATCAGC
ATCGACGCGA TGCTGCAGAA GGAATCGCGC GAAGGCGAGC CGCAGACCGA CATCATCATC
CTGACGCACC TGACGCGCGA GAAGCACGTC AATGCCGCGA TTCGCAGCAT CGAAGCGCTC
CAGACCGTGC TGTCGCCGGT CACGCGCCTG CGCATGGAAG AACTGAACTG A
 
Protein sequence
MNPIKVGLLG IGTVGSGTFN VLKRNQEEIR RRAGRGIEIA VVADLNTERA RELTGGTVDV 
VSDANDVVTR PDIDIVIELI GGYGIARELV LKAIENGKHV VTANKALLAV HGNEIFEAAR
KKGVIVAFEA AVAGGIPIIK ALREGLTANR IQWIAGIING TTNFILSEMR DKGLDFDTVL
KEAQQLGYAE ADPTFDIEGV DAAHKVTLMS AIAFGMPVQF DRAHVEGITK LSAIDIKYAE
ELGYRIKLLG ITRRREEGVE LRVHPTLVPA SRLIANVEGA MNAVLVQGDA VGATLYYGKG
AGAEPTASAV IADLVDVTRL HTADPNHRVP HLAFQPDELS NVPVLPIDEV TSSYYLRMRV
SDETGVLAEI TRILAEAGIS IDAMLQKESR EGEPQTDIII LTHLTREKHV NAAIRSIEAL
QTVLSPVTRL RMEELN