Gene Rsph17029_3798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3798 
Symbol 
ID4898263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp925872 
End bp926906 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content72% 
IMG OID640114402 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001045650 
Protein GI126464537 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.387174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.267254 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAGA CCGCCCCCCT GCCAGCCCCG CGCCGGATCC TGATGCTGGG CGCCACCGGC 
ACCATCGGGC AGGCGACGGC GAAGGCGCTC CTCGCGCGCG GCCATGAGGT CGTCTGCCTT
CTCCGCCCTC GCGGCACCCG GCGGCAGGCG CGCCTGCCGG ACGGCGCGGT CCTGCGCTAC
GGCGACGTGA CCGACCCGCA GTCGCTCACC CGCGACGGCT TCTGCGGCGA GCGGTTCGAT
GCGCTCGTCT CCTGCCTCGC CTCGCGCACG GGCGCGCCGC GCGACGCCTG GGCCATCGAC
CATGCGGCCC ATTCCCATGC GCTCGCAGCG GCCCGCGCGG CGGGGGTGAC GCAGGTCGTG
CTCCTCTCGG CGATCTGCGT GCAGAGGCCG CTTCTCGCCT TCCAGCAGGC GAAGCTCGCC
TTCGAGGAAG AGCTCATGCG CTCGGGGCTA AACTGGTCCA TCGTGCGGCC GACCGCCTTC
TTCAAGTCGC TCTCGGGACA GGTGAAGCGG GTGCAGGAGG GCCGGCCCTT CCTCGTCTTC
GGCGACGGCA CCCTCACCGC CTGCAAGCCG ATCAGCGACG ACGACCTCGG CCGCTACATG
GCGCTCTGCC TCGAGGATCC CGCGCTCAGG AACCGGATCC TGCCCATCGG CGGCCCGGGG
CCGGCGCTGA CCCCGCGGGC GCAGGCCGAG ATGCTGTTCC GGCTCATGGG CCGCCCGCCG
AAGATCCGGC AGGTGCCGGT GGCGCTGCTC GATGCGATCA TCGCGGTTCT CTCCCTCGGC
GGCCTATTGC TGCCCTCGCT CCGCGACAAG GCCGAACTCG CCCGCATCGG GCGCTATTAC
GCGACCGAGT CGATGCTGGT CCTCGATCCC GCCACGGGGC GCTACGATGC GGAGGCCACG
CCCTCCTTCG GCACCGAGAC GCTGGAGGAC TTCTACCGGC AGCTCCTCGC GGGCGAAGCG
ACGGTCGATC TGGGCGAGCA CGCGGTCTTC CGCGAGCGGG CCGTCACGGC AGAAGGATCG
CCCGGAAATC GTTGA
 
Protein sequence
MPETAPLPAP RRILMLGATG TIGQATAKAL LARGHEVVCL LRPRGTRRQA RLPDGAVLRY 
GDVTDPQSLT RDGFCGERFD ALVSCLASRT GAPRDAWAID HAAHSHALAA ARAAGVTQVV
LLSAICVQRP LLAFQQAKLA FEEELMRSGL NWSIVRPTAF FKSLSGQVKR VQEGRPFLVF
GDGTLTACKP ISDDDLGRYM ALCLEDPALR NRILPIGGPG PALTPRAQAE MLFRLMGRPP
KIRQVPVALL DAIIAVLSLG GLLLPSLRDK AELARIGRYY ATESMLVLDP ATGRYDAEAT
PSFGTETLED FYRQLLAGEA TVDLGEHAVF RERAVTAEGS PGNR