Gene Rmet_1932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_1932 
Symbol 
ID4038737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007973 
Strand
Start bp2100202 
End bp2101962 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content63% 
IMG OID637977315 
Productextracellular solute-binding protein 
Protein accessionYP_584080 
Protein GI94310870 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.142751 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.615683 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCATG GCCACTGGCA CATCAATTGC TTGACGGAGC GATGTCCATC CACAGCCCAT 
TCCGTTTCAG GAGCCCCGCC CGTGACCTAC TCCACGCCCT CCCATTCGAA AGGCCTTTCC
GCCGCGATTC GCACTGTCTC CGCCGCCGTC ACGTTGTCGC TGCTGGCCCT GCCCGGCGCC
GCGATGGCCG AGAAGGTGCT CCGCATCGGC ATGACGGCCG CTGACATCCC GCGCACACTC
GGCCAGCCCG ACCAGGGTTT CGAAGGCAAC CGGTTCACCG GCATCCCGAT GTATGACGCA
CTCACCCAGT GGGACCTGTC GAAGAGCAAC GGCCCAAGCC TGCTGATTCC CGACCTGGCA
ACCGAATGGA AGGTCGACGA CAAGAACAAG ACCGAATGGA CGTTCAAGCT GCGCCCAGGC
GTGAAGTTTC ACGACGGCTC GCCGTTCAAT GCCGATGCCG TGGTCTGGAA CGTCAACAAG
GTGCTCGACA AGACCGCGAA GCAGTTCGAC CCCAGCCAGG TTGGCGTCAC CGCTTCGCGC
ATGCCCACGT TGCGTAGCGC GAAGAAGATC GATGACATGA CCGTGCAACT CGTCACGTCA
GAGCCGGATT CGTTCCTGCC GTACAACGTG TCGAACCTGT TCATGGCGTC GCCGACCCAG
TGGGAGAAGA AGTACGCGGC GGTCCCTGCC TCCGTGACCG AGCCGGCGGA GCGCAGCAAG
CAGGCATGGG TGGCCTTTGC GGCAGATGCA TCGGGTACGG GCCCATTCAA GATGACCCGA
TTCGTCCCGC GCGAACGCCT GGAGCTTGCC AAGAACCCCG CCTACTGGGA CAGCAAGCGG
GTGCCAAAGA TCGACAAGGT CGTGATGCTG CCGATGCCCG AGGCCAATGC ACGCACCGCG
GCACTGCTGT CAGGCCAGGT GGACTGGATC GAGGCCCCGG CACCTGATGC CATCGCGCAG
ATCAAGAGCC GTGGCTTCGA TGTGTACGCC AACGCGCAGC CGCACATGTG GCCATGGCAA
TTGTCATTCG CGCCGAATTC CCCGTGGCTC GACAAGCGCG TGCGCCAGGC CGCGAACCTT
TGCGTGAATC GCGCCGGCCT GAAGACACTG CTCGGCGGCT ACATGGCCGA GGCCAAGGGC
ATCGTCGAAG CCGGAAACCC GTGGTGGGGC AATCCGGCAT TCACCATCAA GTACGACCCC
GCGGCCGCGC GCAAGCTGAT GACGGAGGCC GGATACTCCG CCGCCAAGCC GGTGAAGGTA
AAGGTCCAGG TATCCGCATC GGGCTCCGGC CAGATGCAGC CGCTGCCCAT GAACGAGTAC
GTACAGCAAA ACCTCAAGGA GTGCTTCTTC GACGTGGACT TTGACGTCGT CGAATGGAAC
ACGCTGTTCA CGAACTGGCG CATTGGCGCG AAGGACGCGA GCGCGCATGG CGCCAACGCG
ATCAACGTGA GCTTTGCGGC GATGGACCCT TTCTTTGCCA TGGTCCGCTT CGTCAGCACG
AAGACGCAGC CGCCGGTTTC GAACAACTGG GGCTACTTCG GCAACGCGGA GTTTGACAAG
CTGATCGAGA CCGCGCGCAC CTCGTTTGGC GAAAAGGAAC GAGATGCCGC GCTGGCAAAA
CTCCACGCGC GCATCGTTGA AGAAGCGCCA TTCGTGCTGA TCGCCCACGA TGTGGGCCCG
CGAGCGATCT CCCGGAAGAT CAAGGGGGTG GTGCAGCCGC AGAGCTGGTT CATCGATATC
GCGACGATGT CTATCGATTG A
 
Protein sequence
MVHGHWHINC LTERCPSTAH SVSGAPPVTY STPSHSKGLS AAIRTVSAAV TLSLLALPGA 
AMAEKVLRIG MTAADIPRTL GQPDQGFEGN RFTGIPMYDA LTQWDLSKSN GPSLLIPDLA
TEWKVDDKNK TEWTFKLRPG VKFHDGSPFN ADAVVWNVNK VLDKTAKQFD PSQVGVTASR
MPTLRSAKKI DDMTVQLVTS EPDSFLPYNV SNLFMASPTQ WEKKYAAVPA SVTEPAERSK
QAWVAFAADA SGTGPFKMTR FVPRERLELA KNPAYWDSKR VPKIDKVVML PMPEANARTA
ALLSGQVDWI EAPAPDAIAQ IKSRGFDVYA NAQPHMWPWQ LSFAPNSPWL DKRVRQAANL
CVNRAGLKTL LGGYMAEAKG IVEAGNPWWG NPAFTIKYDP AAARKLMTEA GYSAAKPVKV
KVQVSASGSG QMQPLPMNEY VQQNLKECFF DVDFDVVEWN TLFTNWRIGA KDASAHGANA
INVSFAAMDP FFAMVRFVST KTQPPVSNNW GYFGNAEFDK LIETARTSFG EKERDAALAK
LHARIVEEAP FVLIAHDVGP RAISRKIKGV VQPQSWFIDI ATMSID