Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2172 |
Symbol | |
ID | 4897367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2303147 |
End bp | 2304253 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640112766 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_001044047 |
Protein GI | 126462933 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1087] UDP-glucose 4-epimerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0235632 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAGGA TACTCATCAC TGGCGGCTGC GGGTTCATCG GCCGGCATGT GGCCGAGGAA CTGCTGGCGC ACGGCTATGA GGTGCGTCTC TACGATGCGC TGATCGATCA GGTGCATGGC GGCACGTCGG CCGAGCTGCC CGAGGGCGCC GAGGTCGTGC GCGGCGACAT GCGCGACGCC GACCGGCTCC GCCCGGCGCT GAAGGACTGC GATGCGGTGC TGCATCTGGC GGCCGAGGTG GGCGTCGGAC AGTCCATGTA CGAGATCGCG CGCTATGTCG GCGCGAACGA CCTCGGCACG GCGGTGCTGC TCGAGGCGCT GATCGACCGG CCGGTGTCGC GGATCGTCGT GGCCTCGTCG ATGAGCGTCT ATGGCGAGGG GCACTATGCC CGCGAGGACG GGTCGCGGCT GGAGAAGGTG CGGCGCAGGG CGGCGGACAT CCGCGCCGCC CGCTGGAACC CGGTGGATGC GGACGGCCGG TCGCTGATGG CCGTGCCCAC CGACGAGGAG AAGCGGGTGG ATCTGGCCTC GATCTACGCG CTCACCAAAT ATGTGCAGGA GCAGGCGGTG CTGATCCATG GCGAGGCCTA CGGGGTCGAT GCCGTGGCGC TGCGGCTCTT CAATGTGTTC GGCGCGGGGC AGGCGCTGTC GAACCCTTAC ACCGGGGTGC TCGCGAACTT CGCCGCGCGG CTGGCCAACG GCGAGCGGCC GACGATCTTC GAGGATGGCG AGCAGAAGCG CGATTTCGTC CATGTGCGCG ACGTGGCCTG CGCCTTCCGC CTCGCGCTCG AGACGCCGGA CGCGGCGGGC GAGGTCATCA ATGTGGGGTC GGGCGCGGCC TATACGATCG CCGGCGTGGC GCGCCTTCTG GCCGAAGCGA TGGGGCGGCC CGAGCTCACG CCCGAGATCC TCAACCGCGC CCGGTCAGGC GATATCCGCA ACTGTTTCGC CGATATCTCG AAGGCGCGGT CGATCCTCAA CTTCGAGCCG CGCCACCGGC TCGAGGATTC GCTCGGCGAT TTCGTGGCCT GGGTGGCGGG CAGCGCTGCC GAGGATCGCG GTGCCGACAT GCGACGCCAG CTCGAGGAGC GGGGGCTCGT GACATGA
|
Protein sequence | MARILITGGC GFIGRHVAEE LLAHGYEVRL YDALIDQVHG GTSAELPEGA EVVRGDMRDA DRLRPALKDC DAVLHLAAEV GVGQSMYEIA RYVGANDLGT AVLLEALIDR PVSRIVVASS MSVYGEGHYA REDGSRLEKV RRRAADIRAA RWNPVDADGR SLMAVPTDEE KRVDLASIYA LTKYVQEQAV LIHGEAYGVD AVALRLFNVF GAGQALSNPY TGVLANFAAR LANGERPTIF EDGEQKRDFV HVRDVACAFR LALETPDAAG EVINVGSGAA YTIAGVARLL AEAMGRPELT PEILNRARSG DIRNCFADIS KARSILNFEP RHRLEDSLGD FVAWVAGSAA EDRGADMRRQ LEERGLVT
|
| |