Gene Pnap_4700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4700 
Symbol 
ID4685954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008760 
Strand
Start bp83264 
End bp84334 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content69% 
IMG OID639826693 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_973856 
Protein GI121583425 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.0467137 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCC AAATCTCAGA CATCCACATC GCCCAGGCCG ACCCGCTTCC GCAACCCCGG 
CTGCTGCAGG GCGAACTGCC AGCGGGCGAA GCCGAAGCCG CATTCATTGC CGCCTCGCGC
GCCGCCACCC GCAATATATT GCGCGGCCTG GATGACAGGC TGCTGGTTAT CGTGGGCCCG
TGCTCGATCC ACGAGCCTGA GTCGGCCCTG GAATACGCTG CACGGCTGCG CCGGCTGGCC
CCGCGCCTGG ACGATTCGCT GCTGCTGGTG ATGCGCGTCT ACTTCGAAAA ACCGCGCACG
CGCATGGGCT GGAAGGGTTT GATCTACGAT CCGGAACTCG ACGGCCAGGG CGACATTGGC
GCGGGCCTGC GCCATGCGCG GCGCATCTTG CTGGAATGCG CGCGGCTGGG CGTGCCGGCG
GCCTCTGAAA TCCTGGACCT GGTGACGCCG CAGTATTACG CCGAACTGCT CACCTGGGGC
GCGATCGGCG CCCGCACGGT GCAAAGCCCG CTGCACCGGC AGATGGCTTC GGCCCTGTCG
GCGCCCGTGG GCTTCAAGAA CGCTACCAAC GGCAGCGTGG GCGCCGCCAT CGACGCCATC
CATGTGGCCG TCCAGTCGCA TCGCTTTCCC TCCATCTCGC TCGAAGGCAA GGCCATCGTC
ATCACGACCA CCGGCAACCC TGATGGCCAC CTGGTGCTGC GCGGCGCCAG TGACGGGCCG
AACTACGACG CCGCCAGCGT CAGCCGCGCC GCGGCGAGCC TGTCCCAGGC CGGCCTGCCC
GCGCGGCTGG TGATCGACTG CAGCCACGGC AACAGCAACA AGGACTTTTC CAGGCAGCCC
GCCGTGGCGG CCGATATCGC GCAGCAGATC GCCAGCGGCT CGAGCAGCAT CTGCGGCCTC
ATGATTGAGA GCCACCTGGT CGAAGGCCGG CAGGACATCG TCGATGGCCG CCAAGGCCTG
CGCTACGGGC AGAGCGTCAC CGACGCCTGC ATCGGCTGGG AGGCGACCGT GGCCGTGCTG
GAGCAGCTGG CGGCGTCCGT GCGCCAGCGC CGGGCGGGCG CCAGGGCTTG A
 
Protein sequence
MSTQISDIHI AQADPLPQPR LLQGELPAGE AEAAFIAASR AATRNILRGL DDRLLVIVGP 
CSIHEPESAL EYAARLRRLA PRLDDSLLLV MRVYFEKPRT RMGWKGLIYD PELDGQGDIG
AGLRHARRIL LECARLGVPA ASEILDLVTP QYYAELLTWG AIGARTVQSP LHRQMASALS
APVGFKNATN GSVGAAIDAI HVAVQSHRFP SISLEGKAIV ITTTGNPDGH LVLRGASDGP
NYDAASVSRA AASLSQAGLP ARLVIDCSHG NSNKDFSRQP AVAADIAQQI ASGSSSICGL
MIESHLVEGR QDIVDGRQGL RYGQSVTDAC IGWEATVAVL EQLAASVRQR RAGARA