Gene RPC_2080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2080 
Symbol 
ID3971845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2276967 
End bp2278355 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content65% 
IMG OID637925188 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_531953 
Protein GI90423583 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.273143 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGC GGTGGACACT CGACAGTTGG CGCAGCAAGC CGGTGCAGCA AATGCCGGAT 
TATCCGGATG CCAAGGCGCT TGGCGAGGTC GAGGCGCAGC TGGCCACGTT TCCGCCGCTG
GTTTTTGCAG GTGAGGCGCG CAATCTGAAG CGGGCACTGG CGCGGGTTTG CGCCGGCGAA
GCCTTCCTGT TGCAGGGCGG CGACTGCGCC GAGAGCTTCG CCGAGCACGG CGCCAACAAT
ATCCGGGATT TCTTCCGCGT GCTGCTGCAG ATGTCGGTGG TCTTGACCTA TGCCGGCGCG
CTGCCGGTGG TGAAGGTCGG CCGCATCGCC GGGCAATTCG CCAAGCCGCG GTCCTCGCCG
ATGGAAAAGC GCGGCGACGT CGAATTGCCG AGCTATCGCG GCGACATCGT CAACGACATC
GGCTTCACTG CGGCGTCGCG GGTGCCCGAT CCGCAGCGCC AGCTGATGGC CTATCGGCAG
TCGGCGGCGA CGCTGAACCT GCTGCGGGCG TTTGCCACTG GCGGCTTCGC CAATCTCGGC
AGCGTGCACC AGTGGATGCT TGGTTTCCTG AAGGACTCCC ACCAGTCGCG GCGTTACAAA
GAGTTGGCCG ACCGGATCTC CGACGCGCTG AACTTCATGC GCGCCTGCGG CCTCAACCTG
GAAAGCCATC CGGAGTTGCG CGCCACCGAG ATCTACACCA GCCATGAGGC GCTGCTGCTC
GGCTACGAGC AGGCCTTCAC CCGGGTGGAT TCCACCACCG GCGATTGGTA CGCCACCTCC
GGCCACATGT TGTGGATCGG CGACCGCACC CGCCAGCTCG ATCACGCCCA TATCGAATAT
TTCCGCGGCA TCAAGAATCC GATCGGGTTG AAGTGCGGCC CGTCGCTCAA GACCGATGAA
TTGCTGAAGC TGATCGACGT GCTCAATCCG GACAACGAGC CGGGTCGCCT CACGCTGATC
GGCCGGTTCG GCGCCGACAA GATCGGCGAC AGCCTGCCGG GGATGATCCG CGCCGTGCAG
CGCGAGGGCC GCGCCGTGGT GTGGTCGTGC GATCCGATGC ACGGCAACAC CATCACCTCG
ACCTCGGGCT ACAAGACCCG GCCGTTCGAC CGCATCCTGT CGGAGGTGAA ATCGTTCTTC
ACCATCCACG CCGCGGAAGG CACCCACGCC GGCGGCGTGC ACCTCGAGAT GACCGGCCAG
GACGTCACCG AATGCATCGG CGGGGCGCGG GCGATCACCG ACGAGGACCT CAACAACCGC
TATCACACCG CCTGCGATCC GCGGCTCAAT GCCGAGCAGT CGATCGACAT GGCGTTCCTG
ATCGCGGAAC TGTTGAAGCA GGATCGGGTC GGCAAGGCCA GCCCGTTGCC GGTCGCCGCT
GGACTGTGA
 
Protein sequence
MSERWTLDSW RSKPVQQMPD YPDAKALGEV EAQLATFPPL VFAGEARNLK RALARVCAGE 
AFLLQGGDCA ESFAEHGANN IRDFFRVLLQ MSVVLTYAGA LPVVKVGRIA GQFAKPRSSP
MEKRGDVELP SYRGDIVNDI GFTAASRVPD PQRQLMAYRQ SAATLNLLRA FATGGFANLG
SVHQWMLGFL KDSHQSRRYK ELADRISDAL NFMRACGLNL ESHPELRATE IYTSHEALLL
GYEQAFTRVD STTGDWYATS GHMLWIGDRT RQLDHAHIEY FRGIKNPIGL KCGPSLKTDE
LLKLIDVLNP DNEPGRLTLI GRFGADKIGD SLPGMIRAVQ REGRAVVWSC DPMHGNTITS
TSGYKTRPFD RILSEVKSFF TIHAAEGTHA GGVHLEMTGQ DVTECIGGAR AITDEDLNNR
YHTACDPRLN AEQSIDMAFL IAELLKQDRV GKASPLPVAA GL