Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_3900 |
Symbol | |
ID | 4040753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007974 |
Strand | + |
Start bp | 445274 |
End bp | 446470 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637979319 |
Product | extra-cytoplasmic solute receptor |
Protein accession | YP_586037 |
Protein GI | 94312828 |
COG category | [S] Function unknown |
COG ID | [COG3181] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.264332 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.374616 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGGGG GCGCGGACTG GCGTGGATGC GGGTTATCCC GCGACTTTTT CATGCCCTCG CCCCCCCTTC GGCAAAGCCG AATTGTGCCG CCGGGTCAGT GGATTCACGC TGCGCGCTAC TCGAATCCCA GAAACCAGAC GCGTATGACG CGCGCATTCA GGAGACAGCG TATGACCACA TTCCCGCAGT TGCCAGCCAT CGGTACCGGC AAGTGGCGAG GCATGACGTG GGCGGCACTC GCGCTAGCAG CCGTGGCGGT GGCTGTTCCC GGCCAGGGCC GAGCGAATGA TGCCTACCCC TCCAAGCCCA TCGTGATGGT CGTGCCATTC GCGGCGGGTG GCCCGACGGA CGTCGTGGCG CGCTCGGTGG CGGCGGCGAT GTCCAAGACG CTGGGCCAGA GCGTGGTGGT GGAGAACCGT CTGGGTGCCG GCGGCACGGT GTCCGCTGCG TACGTGGCCA AGGCCGCGCC GGACGGCTAC ACGATCCTGA TTCACCATAA CGGCATGGCC ACCGCGCCGG CGCTCTACTC GAAGCTGCCG TACAAGCCGC TGACCGACTT CAGCTTCGTG GGCCAGGTGG CGGATGTGCC GATGACGCTG CTGGGACGCC ACGATCTGCC GCCCAACAAC CTGCCCGAGC TGGTGACCTA CATCCAGAAG AACCAGAACA AGGTGAACCT GGCCAACGCG GGGCTGGGCG CGGTGTCGCA GTTGTGCGGG ATGCTGTTCC AGAAGGCGAT CGGCGTGGAT GTGCAGACGA TCCCATACCA GGGCACCGCG CCGGCGATGA CTGCGCTGCT GGGTGGTCAG GTGGATGTGC TGTGCGACCA GACCACGCAG ACGCTTACCC ATATCAAGGC TGATAAGGTC AAGCTGTACG GCGTGACCAC GGCTGAGCGC ATTCCCGCGC TACCGAACGC GCCGACGCTG CGCGAAGGCG GCCTGAAGGG CTTCGAAGTC AAGGTCTGGC ACGGGATCTA CGCGCCCAAG GGCACGCCGC CTGCCGTGAT CAACAAGCTC AACGGCGCGT TGCGCGCGGC GCTGAAGGAC CCGGCCGTTG CGGCGCGTAT GCAGGATCTG GGCGCGGTGA TCGTGCCCGA GGACAAGCAG ACGCCGGAAG GGCTGCGCAC GTGGCTGGCC TCGGAGATCG ACAAGTGGTC GCCGATTATC AAGGCGGCTG GCGTGAAGGC GGACTAA
|
Protein sequence | MTGGADWRGC GLSRDFFMPS PPLRQSRIVP PGQWIHAARY SNPRNQTRMT RAFRRQRMTT FPQLPAIGTG KWRGMTWAAL ALAAVAVAVP GQGRANDAYP SKPIVMVVPF AAGGPTDVVA RSVAAAMSKT LGQSVVVENR LGAGGTVSAA YVAKAAPDGY TILIHHNGMA TAPALYSKLP YKPLTDFSFV GQVADVPMTL LGRHDLPPNN LPELVTYIQK NQNKVNLANA GLGAVSQLCG MLFQKAIGVD VQTIPYQGTA PAMTALLGGQ VDVLCDQTTQ TLTHIKADKV KLYGVTTAER IPALPNAPTL REGGLKGFEV KVWHGIYAPK GTPPAVINKL NGALRAALKD PAVAARMQDL GAVIVPEDKQ TPEGLRTWLA SEIDKWSPII KAAGVKAD
|
| |