Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_1878 |
Symbol | |
ID | 4038680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007973 |
Strand | + |
Start bp | 2039178 |
End bp | 2040785 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637977258 |
Product | hypothetical protein |
Protein accession | YP_584026 |
Protein GI | 94310816 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00309539 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGA CTGTCTCGCC TTCGACATCC CCGGACCCCG ACGCGACGCC GGACCGCATC GCGCTGGATC CCGCCGGCAC GGACGTTACG CCGCTTTACG ACCTGACGAC GATCCGTCGC GTGGAGCACG CCGGGCTGGC CGCGACCCCG CCTTTCACGT TGATGTCCCG CGCGGGCGCT GCGGCAGCGG ACTGGCTGCA CGCATGGGTG CCGAATGGCC GCATCCTCTG CCTGGCTGGC CCTGGCAACA ACGGCGGCGA CGCGCTGGTC GCCGCCCTGC GCCTGCATCA GCGCGGACGG CTCGTGGAAA CCTGGCTGAT CGGCGAAGCC GACCTGCTGC CCGCCGATGC GGCCCGCGCC TGGCTCGAGG CCCGCGCCGC CGGCGTGCCG CTCCTGGCGC TGCCCAACGA TGCCGATACC GGGATTCCTC CCTGGCCGAA TGGATGCGCG GCAATCGTTG ATGGATTGCT GGGGATCGGA CTCAACCGTG CGGCCGACGG CAATATGGCG CGTTGGATTG ATCATCTCAA TAACTCGCAC CTGCCGGTAT TCTCGCTCGA TATCCCGAGC GGATTGTTCG CCGACACCGG CGCCGGCAAC CCGGCGGTCC GGGCACAACG CACGCTGACG TTCCTGGCTG CCAAGCCGGG CCTGCTGACC CTGGATGGCC GCGATTGCGC TGGCGAGGTC GATATCGCAC CACTCGGGCT CGACTATCCG CCAGCCGAAC ATCCCGTGGC GGTGGTCAAT CAGCCGCCGG GTTTTTCCCA TTCGCTGCCG CGACGTGAGC ACGCGGATAA CAAAGGCAGC TTCGGCAGCC TGGCAGTGAT CGGTGGCAGC CATGGAATGA CCGGGGCCCC ATTGCTTGGC GCGCGCGCCG CCCTATATCT GGGCGCCGGT CGTGTCCATG TGGGATTCCT CGCGCAACCG GCACCCGCCA TCGACCCGGT CCATCCGGAG TTGATGCTGC ACGCCGTGAC GGAGCTTTCG CCCGGCGCGA TGTCCGCCTA CGTGGTCGGC CCCGGGATGG GCACGGGTGC CTCCGCGCGC AAGCAACTGA CGCAATTGAT CGATATCTGC GCCAACGCGG CGACAGCCGC ACCGCTGGTG CTCGATGCGG ATGCGCTCAA CCTGCTGGCT ACCGATGGCA CGCTGGCCCA GCAACTTGTC GAAAGCGGCG TGGCACACGT GATGACGCCC CACCCGCTTG AGGCCGCGCG ACTGCTGGGC AGCACCGTGG CCGATATCCA GCGCAACCGG CTGGCGGCCG CCACCGCGCT GGCCACGCGG TGGCAGGCCA CGATCGTACT CAAGGGCTCC GGGAGCGTGA TCGCGTCCCC AGACGGCGCG CCGCCCGCGA TCAATCCAAC CGGCAATGCC GCGCTCTCCA CCGCGGGCAC CGGTGATGTC CTGGCCGGAA TGATCGGCGC CCTGATCGCC CAGGGCATGC CGATCGTCGC AGCCGCGCGC GCGGCGGTCT GGATTCACGG CCGCGCCGCC GACAGGCTAG TAGCGTCGGG CACGGGCCCG GCTGGCATGA CTGCCAGCGA GCTCTATCTT CCCGCCCGGG ATATTTTCAA CGCGTTGCTG CGCGGTGGCG GCGCGTGA
|
Protein sequence | MSTTVSPSTS PDPDATPDRI ALDPAGTDVT PLYDLTTIRR VEHAGLAATP PFTLMSRAGA AAADWLHAWV PNGRILCLAG PGNNGGDALV AALRLHQRGR LVETWLIGEA DLLPADAARA WLEARAAGVP LLALPNDADT GIPPWPNGCA AIVDGLLGIG LNRAADGNMA RWIDHLNNSH LPVFSLDIPS GLFADTGAGN PAVRAQRTLT FLAAKPGLLT LDGRDCAGEV DIAPLGLDYP PAEHPVAVVN QPPGFSHSLP RREHADNKGS FGSLAVIGGS HGMTGAPLLG ARAALYLGAG RVHVGFLAQP APAIDPVHPE LMLHAVTELS PGAMSAYVVG PGMGTGASAR KQLTQLIDIC ANAATAAPLV LDADALNLLA TDGTLAQQLV ESGVAHVMTP HPLEAARLLG STVADIQRNR LAAATALATR WQATIVLKGS GSVIASPDGA PPAINPTGNA ALSTAGTGDV LAGMIGALIA QGMPIVAAAR AAVWIHGRAA DRLVASGTGP AGMTASELYL PARDIFNALL RGGGA
|
| |