Gene RPD_2058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2058 
Symbol 
ID4022540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2306602 
End bp2307990 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content66% 
IMG OID637962251 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_569194 
Protein GI91976535 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.210314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAAC GTTGGACACC CGATAGCTGG CGCAGCAAGC CGGTTCAGCA GATGCCGGAT 
TACCCGGACG CGAAGGCGTT GGGCGATGTC GAGGCGCAGC TGTCGACCTT TCCGCCGCTG
GTTTTTGCAG GTGAGGCGCG CAACCTGAAG AAGGCGCTGG CGAGCGTGGC GGCTGGCGAA
TCTTTCCTGC TTCAGGGCGG CGATTGCGCC GAGAGCTTTG CCGAGCACGG CGCCAACAAC
ATCCGCGACC TGTTCCGCGT CTTCTTGCAG ATGGCGATCG TGCTGACCTA TGCGGGCGCC
TCGCCCGTGG TGAAGGTCGG CCGCATCGCC GGTCAGTTCG CCAAGCCGCG CTCGGCGCCA
GTCGAGAAGC GCGACGGCGT CGAACTGCCG AGCTACCGCG GCGACATCGT CAACGACATC
GCCTTCACCG AGGACGCGCG CCGGCCTGAT CCGCGCCGCC AGCTCGAGGC TTATCGCCAG
TCCGCCGCGA CGCTCAACCT GCTGCGCGCC TTCGCCAAGG GCGGCTACGC CAGCGTGGAG
AACGTTCACC GCTGGATGCT GCAGTCGGTC AGCGACAGTC CGCAGTCGAA GGCCTATGCG
GATCTTGCCG ACCGGGTCTC CGGCGCGCTC GATTTCATGC GCGCCTGCGG CCTGACCTTC
GCGGTCGACA GCGCGCTCGG CACCACCGAT TTCTACACCA GCCACGAAGC GCTGCTGCTC
GGCTACGAGC AGGCGATGAC CCGGATCGAT TCGACAACCG GCGACTGGTA CGCGACCTCC
GGCCACATGA TCTGGATCGG CGATCGCACC CGTCAGCTCG ATCACGCCCA TATCGAGTAT
TTCCGCGGCA TCAAGAATCC GATCGGCCTG AAATGCGGTC CGTCGCTGAA GACCGATGAG
CTGCTGAAGC TGATCGACGT GCTCAATCCC GAGAACGAGG CGGGCCGGCT GACGCTGATC
GGCCGGTTCG GCGCCGACAA GATTGGCGAC AGCCTTCCGG CGATGATCCG CACCGTACAG
CGCGAGGGCC GCAAGGTGGT GTGGTCGTGC GATCCGATGC ACGGCAACAC CATCACCTCG
ACCTCGGGCT ACAAGACCCG GCCGTTCGAC CGCATTTTGT CGGAAGTTCG CTCGTTCTTC
ACGATCCACG CCGCAGAGGG CACCCATGCC GGCGGCGTGC ATCTGGAGAT GACCGGGCAG
AACGTCACCG AATGCATCGG CGGCGCGCGC GCGATCACCG ACGAGGACCT CAACAACCGC
TATCACACCG CCTGCGATCC GCGGCTCAAT GCCGAGCAGT CGATCGACAT GGCGTTCCTG
ATTGCGGATC TGCTGAAGCA GGGCCGCGAC GGCAAGGTGA GCCCGCTGCC GGTCGCCGCG
GGACTGTGA
 
Protein sequence
MSERWTPDSW RSKPVQQMPD YPDAKALGDV EAQLSTFPPL VFAGEARNLK KALASVAAGE 
SFLLQGGDCA ESFAEHGANN IRDLFRVFLQ MAIVLTYAGA SPVVKVGRIA GQFAKPRSAP
VEKRDGVELP SYRGDIVNDI AFTEDARRPD PRRQLEAYRQ SAATLNLLRA FAKGGYASVE
NVHRWMLQSV SDSPQSKAYA DLADRVSGAL DFMRACGLTF AVDSALGTTD FYTSHEALLL
GYEQAMTRID STTGDWYATS GHMIWIGDRT RQLDHAHIEY FRGIKNPIGL KCGPSLKTDE
LLKLIDVLNP ENEAGRLTLI GRFGADKIGD SLPAMIRTVQ REGRKVVWSC DPMHGNTITS
TSGYKTRPFD RILSEVRSFF TIHAAEGTHA GGVHLEMTGQ NVTECIGGAR AITDEDLNNR
YHTACDPRLN AEQSIDMAFL IADLLKQGRD GKVSPLPVAA GL