Gene Rsph17029_1794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1794 
Symbol 
ID4896379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1891725 
End bp1892762 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content66% 
IMG OID640112388 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001043673 
Protein GI126462559 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.381286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGAA GCCTTATTCC GGTCGATGTG ACCGGTCATG ATTCTTTTAT GCCTCGTTTC 
CATCGCCGGA AGGTCATTCT CGTCACGGGA GGAGCCGGAT TTGTGGGCTC TCATCTTTGC
GAGCGGCTGA TTGCCGAAGG TCATTCCGTC GTCTGTCTCG ATAATCTTCT GACCGGCCGC
AAAGAGAATG TCGCTGGGCT GCTCGGCCAT CCCCAATTCC GCTTTCTCGA GCAGGACATC
CTGAGCCGGA TCGACTGGCA GGGGCCGCTG GACGAGATCT ACAACCTTGC CTGCGCGGCC
TCTCCGCCGC TCTACCAGCG CGACCCGATC CATACGTTCC GCACCTGCAC CGAGGGCGTG
CTGAACCTGC TCGCGCTGGC GCGGGCCACG GGCGCGCGCA TCCTGCAGGC CTCGACCTCC
GAGGTCTATG GCGATCCCGA GATCTCGCCC CAGCACGAGG GCTACCGCGG CTGCGTCAAT
ACGGTGGGTC CGCGGGCCTG CTACGACGAG GGCAAGCGCG CGGCCGAGAC GCTGTTCTGG
GAGTTCGGGG CCCATCAGGG CCTCGAGGTG CGGATCGCGC GGATCTTCAA CACCTACGGG
CCGCGGATGA GCCCCGAGGA CGGCCGCGTT GTCTCGAACT TCATCGTCCA GGCGCTGACC
CGCAGCGACA TCACCCTCTA TGGCGACGGG ATGCAGACGC GCTCCTTCTG CTATGTGGAC
GATCTGGTGA CCGGGCTGAT GGCGCTGATG GCGTCGGAGG TGAGCGAACC GGTCAACCTC
GGCAATCCGG GCGAATTCAC CATGCGGGAG CTGGCCGAGA TGGTGCTGGC TCAGACCGGC
TCTTCCTCGC GGCTGGTTCA TCGGCCGCTG CCGGTGGACG ATCCGCGCCA GCGCCGGCCC
GACATCGCGC AGGCCGCGCG GCTTCTCGGC TGGGCGCCGA CGGTGCCGCT GGCCGAAGGC
ATCGCCCGGA CCATCCGGCA TTTCGCGGGC GAACCTCAGG TCGTCCGGGC GCGCGAGGCT
CTGCTGGTCC ATGCCTGA
 
Protein sequence
MDGSLIPVDV TGHDSFMPRF HRRKVILVTG GAGFVGSHLC ERLIAEGHSV VCLDNLLTGR 
KENVAGLLGH PQFRFLEQDI LSRIDWQGPL DEIYNLACAA SPPLYQRDPI HTFRTCTEGV
LNLLALARAT GARILQASTS EVYGDPEISP QHEGYRGCVN TVGPRACYDE GKRAAETLFW
EFGAHQGLEV RIARIFNTYG PRMSPEDGRV VSNFIVQALT RSDITLYGDG MQTRSFCYVD
DLVTGLMALM ASEVSEPVNL GNPGEFTMRE LAEMVLAQTG SSSRLVHRPL PVDDPRQRRP
DIAQAARLLG WAPTVPLAEG IARTIRHFAG EPQVVRAREA LLVHA