Gene Rsph17025_1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1101 
Symbol 
ID5084704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1126046 
End bp1127416 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content68% 
IMG OID640482659 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001167307 
Protein GI146277148 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0457175 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGGG GCTGGACCAA GACCGACTGG CGCGCCAAGC CGCGCATCCA GATGCCCGAC 
TATCCGGAGG CCGCCGCCGT CGAGGCGGTC GAGGCGCAGC TTGCGAAATA TCCGCCCCTC
GTCTTCGCGG GCGAGGCCCG CAAGCTGAAG GCGGCGCTGG CCGAGGCGGC CGAGGGCCGC
GCCTTCCTGC TGCAGGGCGG CGACTGCGCC GAGAGCTTTG CGGAATTCTC GGCCGACAAC
ATCCGCGACA CGTTCCGCGT GCTCCTGCAG ATGGCGGTCG TGCTCACCTA CGGGGCCAAG
GTGCCGGTCG TGAAGATCGG CCGCATGGCG GGGCAGTTCG CCAAGCCGCG CTCGGCCCCG
ACCGAGGTCA TCAACGGGAT GGAGCTGCCG TCCTACCGGG GCGACATCAT CAACGGCTTC
GACCCGAGCC CCGAGTCGCG CATCCCCGAT CCCCGGCGGA TGCTGCAGGC CTACACACAG
GCCGCGGCCT CGCTCAATCT GCTGCGCGCC TTCTCGACGG GCGGCTTCGC CGACATCCAC
CGCGTCCATT CCTGGACGCT GGGCTTCTGC GAGCAGGACA AGGCCGAGCG GTATCGCGAC
ATCTCGAACC GGATCTCGGA CGCGCTCGAC TTCATGTCGG CCGCGGGCGT GAACGGTTCG
ACCTCGCACG ATCTGGCGAC GGTGGACTTC TACACCTCGC ACGAGGCGCT GCTGCTGGAA
TATGAAGAGG CGCTCTGCCG GATCGATTCG ATCACCGGCC AGCCGATCGC GGGCTCGGGC
CACATGATCT GGATCGGCGA CCGCACGCGC CAGATCGATG GCGCGCATGT CGAATTCTGC
CGCGGCGTGC TGAACCCGAT CGGGCTGAAA TGCGGCCCCT CGACCACGGT CGAGGATCTC
AAGGTGCTGA TGGCCAAGCT CAACCCGCAG AACGAGGCGG GGCGGCTCAC GCTGATCGCG
CGCTTCGGCG CGGGCAAGGT GGGCGAGCAT CTGCCGCGGC TGATCAAGGC CGTGCGCGAG
GAGGGCGCCA AGGTTACCTG GTGCTGCGAT CCGATGCACG GCAACACGAT CAAGGCGGCC
TCGGGCTACA AGACCCGCCC GTTCGACTCG GTGCTGCGCG AGGTGCGCGA GTTCTTCGCG
ATCCACAAGG CCGAGGGCAC GATCCCCGGC GGCGTGCATT TCGAGATGAC CGGGCAGGAC
GTGACCGAAT GCACCGGCGG CCTGCGTGCG GTGACGGACG AGGATCTCTC GAACCGCTAC
CACACGGCCT GCGATCCCCG CCTCAACGCC TCGCAGTCGC TGGAGCTGGC CTTCCTCGTG
GCCGAGGAAC TGACCACGAT GCGCGAAGCG GGCCGGCGCG TGGCGCTGTA G
 
Protein sequence
MSRGWTKTDW RAKPRIQMPD YPEAAAVEAV EAQLAKYPPL VFAGEARKLK AALAEAAEGR 
AFLLQGGDCA ESFAEFSADN IRDTFRVLLQ MAVVLTYGAK VPVVKIGRMA GQFAKPRSAP
TEVINGMELP SYRGDIINGF DPSPESRIPD PRRMLQAYTQ AAASLNLLRA FSTGGFADIH
RVHSWTLGFC EQDKAERYRD ISNRISDALD FMSAAGVNGS TSHDLATVDF YTSHEALLLE
YEEALCRIDS ITGQPIAGSG HMIWIGDRTR QIDGAHVEFC RGVLNPIGLK CGPSTTVEDL
KVLMAKLNPQ NEAGRLTLIA RFGAGKVGEH LPRLIKAVRE EGAKVTWCCD PMHGNTIKAA
SGYKTRPFDS VLREVREFFA IHKAEGTIPG GVHFEMTGQD VTECTGGLRA VTDEDLSNRY
HTACDPRLNA SQSLELAFLV AEELTTMREA GRRVAL