Gene RPC_0533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0533 
SymbolaroB 
ID3970774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp575703 
End bp576851 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content69% 
IMG OID637923649 
Product3-dehydroquinate synthase 
Protein accessionYP_530427 
Protein GI90422057 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.630888 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGC CGCTCAATCA CTCCGCCCCG ATCACCGTCG AGGTCGCGCT CGGCGACCGC 
GGCTATGACA TCGTGATCGG CCGCGACGTG CTGCGTTCGC TCGGGACCCG CATCGCCGCG
TTGCGGCCCG GCGCCCGCAC GGCGATCGTC ACCGACCGTA ATGTCGCCAC CTGCTGGCTG
GCGCAGACCC AGGCCGCGCT CGACGATGTC GGCATCGTTT CGATGCCGAT CGTGGTCGAG
GGCGGCGAAG GCTCGAAGAG CTATGCCGGG CTGCAGCAGG TCTGCGAGGC GCTGATCGCC
GCCAAGATCG AACGCAACGA TCTGGTGATC GCGCTCGGCG GCGGCGTGGT CGGCGATCTC
GCTGGCTTTG CCGCCTCCAT CGTCCGCCGC GGCCTCGATT TCGTGCAGGT GCCGACCTCG
CTGCTGGCGC AGGTGGATTC CTCGGTCGGC GGCAAGACCG GGATCAACTC GCCGCACGGC
AAGAATCTGG TCGGCGCGTT TCATCAGCCG GTGCTGGTGA TCGCCGACAC CGCGGTGCTC
GACACGCTGT CGCCGCGGCA GTTTCGCGCC GGCTATGCCG AAGTGGCGAA GTACGGCGCG
CTCGGCGACG AGGCGTTCTT CGCCTGGCTC GAGGCCAACC ACGCCGAGAT CGTGCGCGGC
GGCAGCGCCC GCGAACACGC CATCGCCACC TCCTGCCGCG CCAAGGCGGC GATCGTGGCG
CGCGACGAGC GCGAGACCGG CGAGCGCGCG CTGCTCAATC TCGGCCACAC CTTCGGCCAC
GCGCTGGAAG CCGCCACCGG CTTCTCCGAA CGGCTGTTCC ACGGCGAAGG CGTCGCCGTC
GGCATGGTGC TGGCGGCGCA GTTTTCCGCG GAACGTGGCA TGTTGTCGAA CGACGCCGCG
GCGCGGCTGT CGCATCACCT CGCCGAAGTG GGACTGCCGA CAAGGCTGCA GGACATCGCC
GGTTTCGCGC AGGAGGGCCT GGCCGACGCC GACGCCTTGA TGGCGCTGAT GGCGCAGGAC
AAGAAGGTCA AGCGCGGCCG GCTCACCTTC ATTCTGCTGG AAGCGATCGG CCGCGCGGTG
ATCGCACACG ACGTCGAGCC GGAACCGGTT CGCGATTTTC TGGCGCGCAA GCTCGCGGAC
AAGACTTGA
 
Protein sequence
MTAPLNHSAP ITVEVALGDR GYDIVIGRDV LRSLGTRIAA LRPGARTAIV TDRNVATCWL 
AQTQAALDDV GIVSMPIVVE GGEGSKSYAG LQQVCEALIA AKIERNDLVI ALGGGVVGDL
AGFAASIVRR GLDFVQVPTS LLAQVDSSVG GKTGINSPHG KNLVGAFHQP VLVIADTAVL
DTLSPRQFRA GYAEVAKYGA LGDEAFFAWL EANHAEIVRG GSAREHAIAT SCRAKAAIVA
RDERETGERA LLNLGHTFGH ALEAATGFSE RLFHGEGVAV GMVLAAQFSA ERGMLSNDAA
ARLSHHLAEV GLPTRLQDIA GFAQEGLADA DALMALMAQD KKVKRGRLTF ILLEAIGRAV
IAHDVEPEPV RDFLARKLAD KT