Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_5058 |
Symbol | bugT |
ID | 4041920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007974 |
Strand | + |
Start bp | 1748467 |
End bp | 1749453 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637980479 |
Product | extra-cytoplasmic solute receptor |
Protein accession | YP_587189 |
Protein GI | 94313980 |
COG category | [S] Function unknown |
COG ID | [COG3181] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0827045 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATCA AGATGGGCGG GGCGCGTCTG GCCGCCGCAT GGGTTGCGGC ATCCGCATTG TCATTCGGCA TGGCCGGTAC CGCGCAGGCC GCCTGGCCTG AGCGGCCGAT CCGGTTGATC GTGCCGGCCG CCGCGGGCGG CACCACCGAT ATCGCGGCCC GCCTGGTAGG CAAGCGTATG AGCGAGATCC TGGGCCAGCC GGTGGTGGTG GACAACCGTG CCGGCGGCGC CGGCATCATC GGCTCGCAAG CGCTGCTCCT GGCGCCGGCC GACGGCTACA CGCTGATGAT GGGTAACATC GGGCCGAACG CGATCAACTA CGCGCTGTAT CGCCAGTTGC CCTACAAACC CCAGGATTTC GCGCCGATCA CGATGGTGGT CTCCGTGCCA AACGTGCTGG TGGTCAATGC CAAGGTGCCC GCGCGCAACG TCGCGGAACT GGTGGCGCTG GCGAAGTCCG AGCCGGGCAA GCTGTCGTTC GGGTCGTCGG GCTCGGGCCA GTCGGTGCAC CTGTCGGGCG AGCTATTCAA GAAGCGCGCA GGCATCGACA TCATCCACGT GCCCTACAAA GGCGCGGCGC CTGCCGTGGC CGATCTGGTG GCGGGCCAGG TGACCATGAT GGTCGACAAC CTGCCCAGTT CGTTGCCGCA GATCCAGGCT GGCAAGCTGC GCGCGCTGGC CGTGACCAGC GGGACGCGGG TGGCCGAGCT ACCTGACGTG CCGACGATGA AGGAAGCCGG CTTCGACGAT TTCCAGGTGA CGGCGTGGTT CGGCCTGGTG GCACGCGCCG GCACACCACC GGCGGTGATC GCGCAACTGT ACAAGGCTGC GGCAACGGCG CTGGCCGAAC CGCAGATCAA GTCGCGACTG GCTGAACTTG GCGGACAGGC GGGCGGCGAC ACGCCCGAGC ACTTCGGCCA GTTCATCGAA CAGGAGCGCC AGCGCTGGGC GCGGGTCGTC AAGGACACGG GTATTCCGCA GCAGTAA
|
Protein sequence | MTIKMGGARL AAAWVAASAL SFGMAGTAQA AWPERPIRLI VPAAAGGTTD IAARLVGKRM SEILGQPVVV DNRAGGAGII GSQALLLAPA DGYTLMMGNI GPNAINYALY RQLPYKPQDF APITMVVSVP NVLVVNAKVP ARNVAELVAL AKSEPGKLSF GSSGSGQSVH LSGELFKKRA GIDIIHVPYK GAAPAVADLV AGQVTMMVDN LPSSLPQIQA GKLRALAVTS GTRVAELPDV PTMKEAGFDD FQVTAWFGLV ARAGTPPAVI AQLYKAAATA LAEPQIKSRL AELGGQAGGD TPEHFGQFIE QERQRWARVV KDTGIPQQ
|
| |