Gene Rsph17029_2172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2172 
Symbol 
ID4897367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2303147 
End bp2304253 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content69% 
IMG OID640112766 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001044047 
Protein GI126462933 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0235632 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGGA TACTCATCAC TGGCGGCTGC GGGTTCATCG GCCGGCATGT GGCCGAGGAA 
CTGCTGGCGC ACGGCTATGA GGTGCGTCTC TACGATGCGC TGATCGATCA GGTGCATGGC
GGCACGTCGG CCGAGCTGCC CGAGGGCGCC GAGGTCGTGC GCGGCGACAT GCGCGACGCC
GACCGGCTCC GCCCGGCGCT GAAGGACTGC GATGCGGTGC TGCATCTGGC GGCCGAGGTG
GGCGTCGGAC AGTCCATGTA CGAGATCGCG CGCTATGTCG GCGCGAACGA CCTCGGCACG
GCGGTGCTGC TCGAGGCGCT GATCGACCGG CCGGTGTCGC GGATCGTCGT GGCCTCGTCG
ATGAGCGTCT ATGGCGAGGG GCACTATGCC CGCGAGGACG GGTCGCGGCT GGAGAAGGTG
CGGCGCAGGG CGGCGGACAT CCGCGCCGCC CGCTGGAACC CGGTGGATGC GGACGGCCGG
TCGCTGATGG CCGTGCCCAC CGACGAGGAG AAGCGGGTGG ATCTGGCCTC GATCTACGCG
CTCACCAAAT ATGTGCAGGA GCAGGCGGTG CTGATCCATG GCGAGGCCTA CGGGGTCGAT
GCCGTGGCGC TGCGGCTCTT CAATGTGTTC GGCGCGGGGC AGGCGCTGTC GAACCCTTAC
ACCGGGGTGC TCGCGAACTT CGCCGCGCGG CTGGCCAACG GCGAGCGGCC GACGATCTTC
GAGGATGGCG AGCAGAAGCG CGATTTCGTC CATGTGCGCG ACGTGGCCTG CGCCTTCCGC
CTCGCGCTCG AGACGCCGGA CGCGGCGGGC GAGGTCATCA ATGTGGGGTC GGGCGCGGCC
TATACGATCG CCGGCGTGGC GCGCCTTCTG GCCGAAGCGA TGGGGCGGCC CGAGCTCACG
CCCGAGATCC TCAACCGCGC CCGGTCAGGC GATATCCGCA ACTGTTTCGC CGATATCTCG
AAGGCGCGGT CGATCCTCAA CTTCGAGCCG CGCCACCGGC TCGAGGATTC GCTCGGCGAT
TTCGTGGCCT GGGTGGCGGG CAGCGCTGCC GAGGATCGCG GTGCCGACAT GCGACGCCAG
CTCGAGGAGC GGGGGCTCGT GACATGA
 
Protein sequence
MARILITGGC GFIGRHVAEE LLAHGYEVRL YDALIDQVHG GTSAELPEGA EVVRGDMRDA 
DRLRPALKDC DAVLHLAAEV GVGQSMYEIA RYVGANDLGT AVLLEALIDR PVSRIVVASS
MSVYGEGHYA REDGSRLEKV RRRAADIRAA RWNPVDADGR SLMAVPTDEE KRVDLASIYA
LTKYVQEQAV LIHGEAYGVD AVALRLFNVF GAGQALSNPY TGVLANFAAR LANGERPTIF
EDGEQKRDFV HVRDVACAFR LALETPDAAG EVINVGSGAA YTIAGVARLL AEAMGRPELT
PEILNRARSG DIRNCFADIS KARSILNFEP RHRLEDSLGD FVAWVAGSAA EDRGADMRRQ
LEERGLVT