Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_1598 |
Symbol | |
ID | 4038401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007973 |
Strand | + |
Start bp | 1722932 |
End bp | 1724482 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637976982 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_583750 |
Protein GI | 94310540 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1174] ABC-type proline/glycine betaine transport systems, permease component [COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.144217 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCTCATT CATTCACGCG GTTCGCCCTC CGGCAATGCC GCTTCCTTGG CGCGCTGTTA GTGCTGTTGG TCATGGCCTG TGGTCCGGCT GCCGGCGCCG ACACGCTGCG CATCGGTTCC AAGCGCTTTA CCGAGTCGTA CATCCTGGGC GAGATCCTGA GGCAGGTTGC CGAGCCGCAC GGCCCGGTGC GATATCTGCC GGGGCTGGGC AACACCGCCA TCGTGTTCCA GGCGCTCCAG GCCGGAAGCA TCGACCTATA CCCGGACTAC ACCGGCACGC TCGCCAGCGA GATCCTGAGA CTACCCGGTA CCGCCACGCT GGAGCAGATC AACGCCGCGC TGGCGCCAAT GGGATTGGGC GCGGCCATCG CGCTCGGCTT CGACAATACC TATGCGATCG CGGTCTCCGA TGCCCAGCCA GTCAGCCTGC GCGCGCTTGG CGACCTTGCG GGGCAGCCGG ACCTGCGGAT CGCCCTGTCG CACGAGTTTC TTGGCCGTGC CGATGGCTGG CCCGCGCTCA AGCGCGCGTA TGGACTGCCT CAGCGACCAT CCGGGATCGA TCATGGCCTC GCCTACGAGG CATTGGCGCA CGGACAGATT GACGCGACCG ACATCTATTC GACCGATGCC AAGATCAGCA AGTACCATCT GCGCGTGCTC GAGGACAACC AGCGCGTGTT TCCGCGCTAC GAGGCGGTGA TCGTCTACCG GCTCGACGTG CCGACGCGTC ATGCGGCGGC GTGGCAGGCA CTGCGGCGTC TGGCTGGCAC TATCGGAACG CAGGACATGA TCGCCATGAA TGCGGCAGCG GAAGTCGATG GGAAGACGTT CTCTGCTGTG GCCCGCAATT TCCTGTCCGG GCATCCAGGG GTATCGGCGG ACGGTGTGGC GAGATCGGAT CAGCGCGACA ACCTGGCGGG GATGCTGACG AACGCGGATA CCGGGCGCCT GACTGTCCGG CATCTGGCGC TGGTCGGCGG CTCGGTGGGG GCCGCCACGC TGGTCGGCGT GCCGCTGGGT GTGGTGGCTG CGCGCCGGCG CCGGTTCGGT CAGGTGCTGC TGGCCTTGGT CGGTATGTTG CAGACCATCC CGTCATTGGC GCTGCTGGCG ATGCTGATAC CGGCGCTGGG CCGCATCGGC ATCTGGCCGG CACTGGTCGC GCTGTTCCTC TATGCTTTGC TGCCGATCGT GCGCAACGCG TGCACCGGGT TGCAGGAAGT GCCGGCGGGG ATGCGCGATG CCGCCCTGGC ACTGGGGATG CGGCCGTTGC AAGTGCTCTG GTACGTGGAG TTGCCGCTCT CCCTGCCGGT GTTGCTGGCC GGTATCAAGA CCGCAGCCAT CATCAGCGTA GGCACGGCGA CCATCGCTGC GTTTGTCGGC GCGGGCGGGT ACGGCGAGCG GATCGTGACC GGACTGGCGC TGAACGACGC CACCCAGCTT CTCGGCGGCG CGATTCCGGC CGCGCTGCTG GCATTGGTGG TGCAGGGTGG CTTTGGTCTG GCCGAATACT TGCGGGATCG AAAGCGTGCA CGCATGGCGA AATTTGGGTA A
|
Protein sequence | MSHSFTRFAL RQCRFLGALL VLLVMACGPA AGADTLRIGS KRFTESYILG EILRQVAEPH GPVRYLPGLG NTAIVFQALQ AGSIDLYPDY TGTLASEILR LPGTATLEQI NAALAPMGLG AAIALGFDNT YAIAVSDAQP VSLRALGDLA GQPDLRIALS HEFLGRADGW PALKRAYGLP QRPSGIDHGL AYEALAHGQI DATDIYSTDA KISKYHLRVL EDNQRVFPRY EAVIVYRLDV PTRHAAAWQA LRRLAGTIGT QDMIAMNAAA EVDGKTFSAV ARNFLSGHPG VSADGVARSD QRDNLAGMLT NADTGRLTVR HLALVGGSVG AATLVGVPLG VVAARRRRFG QVLLALVGML QTIPSLALLA MLIPALGRIG IWPALVALFL YALLPIVRNA CTGLQEVPAG MRDAALALGM RPLQVLWYVE LPLSLPVLLA GIKTAAIISV GTATIAAFVG AGGYGERIVT GLALNDATQL LGGAIPAALL ALVVQGGFGL AEYLRDRKRA RMAKFG
|
| |