Gene RSP_0520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_0520 
Symbol 
ID3718433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp2258269 
End bp2259375 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content70% 
IMG OID640071729 
ProductNAD-dependent dehydratase/epimerase 
Protein accessionYP_353593 
Protein GI77464089 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAGGA TACTCATCAC TGGCGGCTGC GGGTTCATCG GCCGGCATGT GGCCGAGGAA 
CTGCTGGCGC ACGGCTATGA GGTGCGTCTC TACGATGCGC TGATCGACCA GGTGCACGGC
GGCACGTCGG CCGAGCTGCC CGAGGGCGCC GAGGTCGTGC GTGGCGACAT GCGCGACGTC
GACCGGCTCC GCCCGGCGCT GAAGGACTGC GATGCGGTGC TGCATCTGGC GGCCGAGGTG
GGCGTCGGAC AGTCCATGTA CGAGATCGCG CGCTATGTCG GCGCGAACGA CCTCGGCACG
GCGGTGCTGC TCGAGGCGCT GATCGACCGG CCGGTGTCGC GGATCGTCGT GGCCTCGTCG
ATGAGCGTCT ATGGCGAGGG GCACTATGCC CGCGAGGACG GGTCGCGGCT GGAGAAGGTG
CGGCGCAGGG CGGCGGACAT CCGCGCCGCC CGCTGGAACC CGGTGGATGC GGACGGCCGG
TCGCTGATGG CCGTGCCCAC GGACGAGGAG AAGCGGGTGG ATCTGGCCTC GATCTACGCG
CTCACCAAAT ATGTGCAGGA GCAGGCGGTG CTGATCCATG GCGAGGCCTA TGGGGTCGAT
GCCGTGGCGC TGCGGCTCTT CAATGTGTTC GGCGCGGGGC AGGCGCTGTC GAACCCCTAT
ACCGGGGTGC TCGCGAACTT CGCCGCGCGA CTGGCCAACG GCGAGCGGCC TACGATCTTC
GAGGATGGCG AGCAGAAGCG CGATTTCGTC CATGTGCGCG ACGTGGCCCG CGCCTTCCGG
CTCGCGCTCG AGACGCCGGA CGCGGCGGGC GAGGTCATCA ATGTGGGGTC GGGCGCGGCC
TACACGATCG CCGGCGTGGC GCGCCTTCTG GCCGAAGCGA TGGGGCGGCC CGAGCTCACG
CCCGAGATCC TCGACCGCGC CAGGTCGGGC GATATCCGCA ACTGTTTCGC CGATATCTCG
AAGGCGCGGT CGATCCTCAA CTTCGAGCCG CGCCACCGGC TCGAGGATTC GCTCGGCGAT
TTCGTGGCCT GGGTGGCGGG CAGCGCTGCC GAGGATCGCG GCGCCGACAT GCGACGCCAG
CTCGAGGAGC GGGGGCTCGT GACATGA
 
Protein sequence
MARILITGGC GFIGRHVAEE LLAHGYEVRL YDALIDQVHG GTSAELPEGA EVVRGDMRDV 
DRLRPALKDC DAVLHLAAEV GVGQSMYEIA RYVGANDLGT AVLLEALIDR PVSRIVVASS
MSVYGEGHYA REDGSRLEKV RRRAADIRAA RWNPVDADGR SLMAVPTDEE KRVDLASIYA
LTKYVQEQAV LIHGEAYGVD AVALRLFNVF GAGQALSNPY TGVLANFAAR LANGERPTIF
EDGEQKRDFV HVRDVARAFR LALETPDAAG EVINVGSGAA YTIAGVARLL AEAMGRPELT
PEILDRARSG DIRNCFADIS KARSILNFEP RHRLEDSLGD FVAWVAGSAA EDRGADMRRQ
LEERGLVT