Gene RPB_0536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0536 
SymbolaroB 
ID3909575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp601413 
End bp602561 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content70% 
IMG OID637882424 
Product3-dehydroquinate synthase 
Protein accessionYP_484158 
Protein GI86747662 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCGC CGCTGAACCA TTCCGCTCCG ATCAAGGTCG AGGTCGCGCT CGGCGATCGC 
GCCTACGACA TCGTGATCGG CCGCAACGTG CTCGGCACGC TCGGCGAGCG GATCGCCAAG
CTGCGGCCCG GCGCGCGCAC CGCGATCGTC ACCGATCGCA CCGTGGCGCG GACCTGGCTG
GCGCCGACCG AGGCGGCGCT GGATGCCGCC GGCATCGCGC ATGCGCGCGT CGTGGTCGGC
GAGGGCGAAA GCTCCAAGAC CTATGCGGGG CTGGCGGAAG TCAGCGAGGC GCTGATCGCC
GCCAAGATCG AACGCAACGA TCTGGTGATC GCGCTCGGCG GCGGCGTGGT CGGCGATCTC
GCCGGCTTCG CGGCGTCGAT CCTGCGCCGC GGCGTCGATT TCGTGCAGGT GCCGACCTCG
CTGCTGGCAC AGGTTGATTC GTCGGTCGGC GGCAAGACCG GCATCAACTC GCCGCAGGGC
AAGAACCTGC TCGGCGCGTT CCATCAGCCC GTATTGGTGA TCGCCGACAC CGCGGTGCTC
GACACGCTGT CGCCGCGCCA GTTCCGTGCC GGCTATGCCG AAGTGGCGAA ATACGGCGCG
CTCGGCGACG AGGCGTTCTT CGCCTGGCTC GAAGCCAATC ACGCCGAGAT CGTCTCAGGC
GGGCCGGCGC GCGAGCACGC CATCGCCACG TCGTGCCGGG CGAAGGCGGC GATCGTGGCG
CGCGACGAGC GCGAAAACGG CGAGCGCGCG CTGCTCAATC TCGGCCACAC GTTCGGCCAT
GCGCTGGAGG CCGCGACCGG CTTCTCCGAC CGGCTGTTTC ACGGCGAGGG CGTGGCGATC
GGCATGGTGC TGGCGGCGCG GTTCTCCGCC GAGCGCGGCA TGATGCCGGA GGCCGACGCC
ATCCGGCTGC AGCGCCATCT CGCCGATGTC GGCCTGCCGA CCCGGCTGCA GGACATCGCC
GGCTTCGCCC AGGAAGGCCT CGCCGACGCC GACGCGCTGT TGGCGCTGAT GACTCAGGAC
AAGAAGGTCA AACGCGGCCA GCTCACCTTC ATCCTGATGG AAGGGATCGG CCGCGCGGTG
ATCGCCGACA AGGTCGAGCC GGCGCCGGTT CGCGATTTCC TGGCCCGGCA GCTCGCGCGC
GCATCGTGA
 
Protein sequence
MTAPLNHSAP IKVEVALGDR AYDIVIGRNV LGTLGERIAK LRPGARTAIV TDRTVARTWL 
APTEAALDAA GIAHARVVVG EGESSKTYAG LAEVSEALIA AKIERNDLVI ALGGGVVGDL
AGFAASILRR GVDFVQVPTS LLAQVDSSVG GKTGINSPQG KNLLGAFHQP VLVIADTAVL
DTLSPRQFRA GYAEVAKYGA LGDEAFFAWL EANHAEIVSG GPAREHAIAT SCRAKAAIVA
RDERENGERA LLNLGHTFGH ALEAATGFSD RLFHGEGVAI GMVLAARFSA ERGMMPEADA
IRLQRHLADV GLPTRLQDIA GFAQEGLADA DALLALMTQD KKVKRGQLTF ILMEGIGRAV
IADKVEPAPV RDFLARQLAR AS