Gene Rsph17029_3681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3681 
Symbol 
ID4898508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp785939 
End bp786970 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content66% 
IMG OID640114289 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001045543 
Protein GI126464430 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0230856 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.593918 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGAAC ACAAGCGCAT TCTGGTGACG GGCGGTCTCG GCTTCCTCGG CTCCTTCCTG 
TGCGAGAGCC TGCTTGCGGA CGGCCACGAG GTCATCTGCG TCGACAGCTT CCAGACCGGC
TCCCGCGAGA ATGTGGCCCA TCTCCGGGAC CATCCCAACT TCGAGATCAT GCGGCATGAC
GTGACCGTGC CGCTGCATGT CGAGGCCGAC GAGATCTTCA ACCTCGCCTG CCCGGCCTCG
CCGATCCACT ATCAGGTCGA TCCGGTGAAG ACGGTGAAGA CCAGCGTCAT GGGGGCGATC
AACCTGCTCG ACCTCGCGCG GCGCACCAAG TCGAAGATCT TTCAGGCCTC GACCTCCGAG
GTCTACGGCG ATCCGAAGGT CCATCCCCAG CCCGAGGGCT ACTGGGGCCA TGTGAACCCC
AACGGCCCGC GCTCCTGCTA CGACGAGGGC AAGCGCTGCG CCGAGACCCT GTTCTTCGAC
TATCACCGCC AATATGGCGT CAACATCCGC ATCGCCCGGA TCTTCAACAC CTACGGGCCG
CGGATGCACC CGAACGACGG GCGGGTGGTC TCGAACTTCA TCGTTCAGGC GCTGAGCGGC
AAGCCGATCA CCATCTACGG CGACGGCACG CAGACCCGCT CCTTCTGCTA CGTCACCGAC
CTGATCCGGG GCTTCCGCGC CCTGATGGAC GCGCCGGACG GGATCGAGCT GCCGGTGAAC
CTCGGCAACC CGGGCGAGTT CACCATGCTC GAGCTGGCGA CGCTGGTGAT CGAGCTGACC
GGCTCGCGCT CCAAGGTCGT GCATCTGCCG CTGCCGAAGG ACGATCCCAC CCAGCGCAAA
CCCGACATCA CCCGCGCCAC CGAGACGCTC GGCTGGAAGC CCGAGATCCC GCTGTTCGAC
GGCCTGCAGC GCACGATCGC CCATTTCGAT CAGCTGCTGA GCCGGACGCA GAAGCGGGCC
GTCCCCGAGA TGTCGATGGC GATGGTCGCG AACGGTCTCG CCCGCAACGG CGCCTCCGAA
GCGCTGCGCT GA
 
Protein sequence
MHEHKRILVT GGLGFLGSFL CESLLADGHE VICVDSFQTG SRENVAHLRD HPNFEIMRHD 
VTVPLHVEAD EIFNLACPAS PIHYQVDPVK TVKTSVMGAI NLLDLARRTK SKIFQASTSE
VYGDPKVHPQ PEGYWGHVNP NGPRSCYDEG KRCAETLFFD YHRQYGVNIR IARIFNTYGP
RMHPNDGRVV SNFIVQALSG KPITIYGDGT QTRSFCYVTD LIRGFRALMD APDGIELPVN
LGNPGEFTML ELATLVIELT GSRSKVVHLP LPKDDPTQRK PDITRATETL GWKPEIPLFD
GLQRTIAHFD QLLSRTQKRA VPEMSMAMVA NGLARNGASE ALR