Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_4047 |
Symbol | |
ID | 4040905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007974 |
Strand | + |
Start bp | 614798 |
End bp | 615787 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637979471 |
Product | extra-cytoplasmic solute receptor |
Protein accession | YP_586184 |
Protein GI | 94312975 |
COG category | [S] Function unknown |
COG ID | [COG3181] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0364236 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.64466 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTCCG AGATGAAGTC CGTATTGAGA AGCCTGCTGT GCGGTGCCGC CGCATTGACG ATGTCGTTTG CGAACCTGGC CAGCGCGGCC GATGCCTGGC CGACCAAGCC GATCACGCTC ATCGTGCCGT GGGCGGCGGG TGGTTCGACG GATATCCTGG CGCGCACGCT GTCCGAGCAG TTGACCAAGT CGCTCGGCCA GCCGGTCATC GTCGACAACC GGCCGGGTGC TTCCGGCAAC ATCGGTTCGG CAATGGTGGC ACGCGCGAAG CCGGACGGCT ACACGCTGCT GATCGGCTCG ATGAGCACGC ACGCGATGAA CCCGGCGTTG ATGCCGAACA TGCCGTTCAA GGGTGTGGAT GATTTCACGC CGCTGGGATT GCTGGCCTAT GTGACCAACA CGATGGTCGT GAACGCGTCC GTGCCGGTGC ACAACGTCAA GGAACTGATC GCGTACGCGA AGGCCAATCC GGGCAAGGTC GCCTACGCCA GCGCGGGCCC CGGCTCCACC AATCATCTGA GCGCGGTGTT GTTCGAGAAG ATGGCCGGCG TGCAGTTGCT GCATGTGCCG TACAAGGGTG GCGCGCCGGC CGTGGTCGAT ACGGTGGCGG GCCAGACGCA ATTGCTGTTC TCGGCGGGTA CCCAGACGCT GACGCACGTG AAGTCCGGCA AGCTGCGCCT GCTGGCCGTG ACCGAAGCCA AGCGCTCCCC GCTGCTGCCC AATGTGCCAA CGGTGGGCGA GACCCTGCCC GGCTACGAGC TTTCGGTCTG GTACGGCGCC TTTGGCCCGA AGAACATGCC TGCGGAACTG GTGACGCGCT TGAACAACGA GATCAACCGC GCGATGGCGC TGCCCGACGT CAAGGCCAAG ATGGAGTCGA TCGGCGTCGA GACGGCGACG TCCACCCCGC AGGAATTCGG CAAGATCCTG CGCCGCGATG CGGACCGCTA CGGCAAGCTG ATCCGCGAAC TCGGCATCCA GGGCGAATGA
|
Protein sequence | MKSEMKSVLR SLLCGAAALT MSFANLASAA DAWPTKPITL IVPWAAGGST DILARTLSEQ LTKSLGQPVI VDNRPGASGN IGSAMVARAK PDGYTLLIGS MSTHAMNPAL MPNMPFKGVD DFTPLGLLAY VTNTMVVNAS VPVHNVKELI AYAKANPGKV AYASAGPGST NHLSAVLFEK MAGVQLLHVP YKGGAPAVVD TVAGQTQLLF SAGTQTLTHV KSGKLRLLAV TEAKRSPLLP NVPTVGETLP GYELSVWYGA FGPKNMPAEL VTRLNNEINR AMALPDVKAK MESIGVETAT STPQEFGKIL RRDADRYGKL IRELGIQGE
|
| |