Gene Pnap_0679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_0679 
SymbolaroB 
ID4689702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp723057 
End bp724175 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content63% 
IMG OID639833673 
Product3-dehydroquinate synthase 
Protein accessionYP_980919 
Protein GI121603590 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.241981 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCAA ATCCTCCTTC TCTCGCAACG GCCCAGGTGC AAATCAACCT GGCCGAACGC 
AGCTATCCCA TCCTGATTGG CACTTCACTG CTGGCCAATG CACTGACCTA TCAGCATCTG
CCCCAAGCCG CAACGGCACT CGTGGTGTCC AACACGACCG TGGCGCCCCT GTACGCGGCG
CAATTGACTG AAGCGCTGCA GGCGCACTAC GGCAAGGTTC TGCTGGTGAC CCTGCCCGAT
GGCGAAGTCC ACAAGGACTG GCCGACCCTG CAACTGATTT TTGATGCGCT GCTTGAAAAC
GGCTGCGACC GCAAGACGGT GCTTTTCGCG CTCGGCGGCG GTGTGGTGGG CGACATGACC
GGCTTTGCGG CGGCCAGCTA CATGCGCGGC GTGCCGTTTG TGCAGGTGCC GACAACCCTG
CTGGCCCAGG TGGACTCGTC GGTCGGTGGC AAGACCGCGA TCAATCACCC GCTGGGCAAG
AACATGATTG GCGCGTTCTA CCAGCCCCAG CAGGTAATCT GCGACCTGGA GGTGCTGAAG
ACCTTGCCCG ACCGTGAACT GAGCGCCGGA CTGGCCGAAG TCATCAAGTA CGGGCCGATT
GCCGACATGG CCTTCCTCGA CTGGATCGAA GCCAACCTGG ATGCGCTGCT GGCCAAGGAG
CCTGCCGCGC TGGCGCACGC CATTCAGCGC AGCTGCGAGA TCAAGGCCTG GGTCGTCGGC
CAGGATGAGC GCGAGTCGGG TCTGCGGGCG ATTCTGAATT TCGGCCACAC CTTTGGCCAT
GCGATTGAAT CCGGGCTGGG CTATGGCGAA TGGCTGCACG GCGAAGGCGT GGGCTGCGGC
ATGGTGATGG CCGCGCACCT GTCGCAGCGC CTGGGCCGGA TTGACATGGC GTTTGTGCAG
CGCCTGACCA CGCTGATCCA GCGCGCCGGA CTGCCGGTCA AGGCGCCGCT GCTCTCAAGC
ACGGACAATG CAGGCCGCTA CCTCGACCTG ATGCGGATTG ACAAGAAATC CGAAGCCGGC
GAGATTCGCT TCGTGGTGAT TGATGGACCG GGCAAGGCCG CCGTGTGCGC CGCGCCCGAT
GCCGTGGTGC GTGAAGTCAT CGACTTGTGC TGCGCCTGA
 
Protein sequence
MQANPPSLAT AQVQINLAER SYPILIGTSL LANALTYQHL PQAATALVVS NTTVAPLYAA 
QLTEALQAHY GKVLLVTLPD GEVHKDWPTL QLIFDALLEN GCDRKTVLFA LGGGVVGDMT
GFAAASYMRG VPFVQVPTTL LAQVDSSVGG KTAINHPLGK NMIGAFYQPQ QVICDLEVLK
TLPDRELSAG LAEVIKYGPI ADMAFLDWIE ANLDALLAKE PAALAHAIQR SCEIKAWVVG
QDERESGLRA ILNFGHTFGH AIESGLGYGE WLHGEGVGCG MVMAAHLSQR LGRIDMAFVQ
RLTTLIQRAG LPVKAPLLSS TDNAGRYLDL MRIDKKSEAG EIRFVVIDGP GKAAVCAAPD
AVVREVIDLC CA