Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_0520 |
Symbol | |
ID | 3718433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | - |
Start bp | 2258269 |
End bp | 2259375 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640071729 |
Product | NAD-dependent dehydratase/epimerase |
Protein accession | YP_353593 |
Protein GI | 77464089 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1087] UDP-glucose 4-epimerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAGGA TACTCATCAC TGGCGGCTGC GGGTTCATCG GCCGGCATGT GGCCGAGGAA CTGCTGGCGC ACGGCTATGA GGTGCGTCTC TACGATGCGC TGATCGACCA GGTGCACGGC GGCACGTCGG CCGAGCTGCC CGAGGGCGCC GAGGTCGTGC GTGGCGACAT GCGCGACGTC GACCGGCTCC GCCCGGCGCT GAAGGACTGC GATGCGGTGC TGCATCTGGC GGCCGAGGTG GGCGTCGGAC AGTCCATGTA CGAGATCGCG CGCTATGTCG GCGCGAACGA CCTCGGCACG GCGGTGCTGC TCGAGGCGCT GATCGACCGG CCGGTGTCGC GGATCGTCGT GGCCTCGTCG ATGAGCGTCT ATGGCGAGGG GCACTATGCC CGCGAGGACG GGTCGCGGCT GGAGAAGGTG CGGCGCAGGG CGGCGGACAT CCGCGCCGCC CGCTGGAACC CGGTGGATGC GGACGGCCGG TCGCTGATGG CCGTGCCCAC GGACGAGGAG AAGCGGGTGG ATCTGGCCTC GATCTACGCG CTCACCAAAT ATGTGCAGGA GCAGGCGGTG CTGATCCATG GCGAGGCCTA TGGGGTCGAT GCCGTGGCGC TGCGGCTCTT CAATGTGTTC GGCGCGGGGC AGGCGCTGTC GAACCCCTAT ACCGGGGTGC TCGCGAACTT CGCCGCGCGA CTGGCCAACG GCGAGCGGCC TACGATCTTC GAGGATGGCG AGCAGAAGCG CGATTTCGTC CATGTGCGCG ACGTGGCCCG CGCCTTCCGG CTCGCGCTCG AGACGCCGGA CGCGGCGGGC GAGGTCATCA ATGTGGGGTC GGGCGCGGCC TACACGATCG CCGGCGTGGC GCGCCTTCTG GCCGAAGCGA TGGGGCGGCC CGAGCTCACG CCCGAGATCC TCGACCGCGC CAGGTCGGGC GATATCCGCA ACTGTTTCGC CGATATCTCG AAGGCGCGGT CGATCCTCAA CTTCGAGCCG CGCCACCGGC TCGAGGATTC GCTCGGCGAT TTCGTGGCCT GGGTGGCGGG CAGCGCTGCC GAGGATCGCG GCGCCGACAT GCGACGCCAG CTCGAGGAGC GGGGGCTCGT GACATGA
|
Protein sequence | MARILITGGC GFIGRHVAEE LLAHGYEVRL YDALIDQVHG GTSAELPEGA EVVRGDMRDV DRLRPALKDC DAVLHLAAEV GVGQSMYEIA RYVGANDLGT AVLLEALIDR PVSRIVVASS MSVYGEGHYA REDGSRLEKV RRRAADIRAA RWNPVDADGR SLMAVPTDEE KRVDLASIYA LTKYVQEQAV LIHGEAYGVD AVALRLFNVF GAGQALSNPY TGVLANFAAR LANGERPTIF EDGEQKRDFV HVRDVARAFR LALETPDAAG EVINVGSGAA YTIAGVARLL AEAMGRPELT PEILDRARSG DIRNCFADIS KARSILNFEP RHRLEDSLGD FVAWVAGSAA EDRGADMRRQ LEERGLVT
|
| |