Gene Pnap_3398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3398 
Symbol 
ID4686280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3603412 
End bp3604533 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content63% 
IMG OID639836412 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_983616 
Protein GI121606287 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.677013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCCA AAGCCACCCC CGCCAGCCCA AGCTGGTATG CGGACGTCGA AAAAACCAGC 
CAGACCGACG ACAAACGCAT CAAGGACATC ACCGTGCTAC CCCCTCCCGA GCATCTCATC
CGCTTTTTCC CGATCAACGG CACCGCCGTC GAATCGCTGA TCACCCAGAC GCGCCAGAAC
ATCCACAACA TCATGGCCGG CACCGACGAC CGCCTGCTGG TGGTGATTGG CCCGTGCTCG
ATCCACGACC CGATGGCCGC GCTCGACTAT GCCCGCCGCC TGGCCGAGCA GCGCAAGAAA
TACGCCGGAA CGCTGGAAAT CGTGATGCGG GTGTACTTTG AAAAGCCGCG CACCACCGTC
GGCTGGAAAG GCTTGATCAA CGACCCGTAC CTCGATGAAA CCTTCCGCAT CGACGAGGGC
CTGCGCATTG CCCGCCAGCT GCTGATCGAC ATCAACCGCC TGGACGTGCC GGCCGGCAGC
GAGTTCCTGG ACGTGATCTC CCCGCAGTAC ATCGGCGACC TGATTTCCTG GGGCGCGATT
GGCGCGCGCA CCACCGAAAG CCAGGTGCAC CGCGAACTGG CTTCGGGCCT GTCGGCGCCA
ATTGGCTTCA AGAACGGCAC CGACGGCAAC ATCCGCATCG CCACCGATGC CATCCAGGCG
GCGGCGCGCG GCCACCATTT CCTGTCGGTC CACAAAAACG GCCAGGTCGC GATTGTGCAA
ACCCAGGGCA ACAAGGACTG CCACGTCATC CTGCGCGGCG GCAAGGCGCC CAACTATGAC
GCCGCCAGCG TGAGCGCTGC CTGCAAGGAA CTGCATGCCG CAGGCCTGCC GGCCACCTTG
ATGGTCGATT GCAGCCATGC CAACAGTTCC AAAAAGCACG AAAGGCAGAT GGACGTGGCG
CGCGACATTG CCGCCCAGAT CGCCGATGGC TCACGCAATG TGTTTGGCCT GATGGTTGAA
AGCCACCTGA AGGCCGGCGC GCAAAAGTTC ACGGCCGGCA AGGACGATGC CCGGGCGCTC
GAATATGGCC AGAGCATCAC CGACGCCTGC CTGGGCTGGG ACGACTCGCT GGCCATGCTG
GAGTCGCTGT CAGCCGCCGT AGAGGCTCGT CGGGCAAAAT AA
 
Protein sequence
MNAKATPASP SWYADVEKTS QTDDKRIKDI TVLPPPEHLI RFFPINGTAV ESLITQTRQN 
IHNIMAGTDD RLLVVIGPCS IHDPMAALDY ARRLAEQRKK YAGTLEIVMR VYFEKPRTTV
GWKGLINDPY LDETFRIDEG LRIARQLLID INRLDVPAGS EFLDVISPQY IGDLISWGAI
GARTTESQVH RELASGLSAP IGFKNGTDGN IRIATDAIQA AARGHHFLSV HKNGQVAIVQ
TQGNKDCHVI LRGGKAPNYD AASVSAACKE LHAAGLPATL MVDCSHANSS KKHERQMDVA
RDIAAQIADG SRNVFGLMVE SHLKAGAQKF TAGKDDARAL EYGQSITDAC LGWDDSLAML
ESLSAAVEAR RAK