Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnap_4700 |
Symbol | |
ID | 4685954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas naphthalenivorans CJ2 |
Kingdom | Bacteria |
Replicon accession | NC_008760 |
Strand | + |
Start bp | 83264 |
End bp | 84334 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639826693 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_973856 |
Protein GI | 121583425 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 0.0467137 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACCC AAATCTCAGA CATCCACATC GCCCAGGCCG ACCCGCTTCC GCAACCCCGG CTGCTGCAGG GCGAACTGCC AGCGGGCGAA GCCGAAGCCG CATTCATTGC CGCCTCGCGC GCCGCCACCC GCAATATATT GCGCGGCCTG GATGACAGGC TGCTGGTTAT CGTGGGCCCG TGCTCGATCC ACGAGCCTGA GTCGGCCCTG GAATACGCTG CACGGCTGCG CCGGCTGGCC CCGCGCCTGG ACGATTCGCT GCTGCTGGTG ATGCGCGTCT ACTTCGAAAA ACCGCGCACG CGCATGGGCT GGAAGGGTTT GATCTACGAT CCGGAACTCG ACGGCCAGGG CGACATTGGC GCGGGCCTGC GCCATGCGCG GCGCATCTTG CTGGAATGCG CGCGGCTGGG CGTGCCGGCG GCCTCTGAAA TCCTGGACCT GGTGACGCCG CAGTATTACG CCGAACTGCT CACCTGGGGC GCGATCGGCG CCCGCACGGT GCAAAGCCCG CTGCACCGGC AGATGGCTTC GGCCCTGTCG GCGCCCGTGG GCTTCAAGAA CGCTACCAAC GGCAGCGTGG GCGCCGCCAT CGACGCCATC CATGTGGCCG TCCAGTCGCA TCGCTTTCCC TCCATCTCGC TCGAAGGCAA GGCCATCGTC ATCACGACCA CCGGCAACCC TGATGGCCAC CTGGTGCTGC GCGGCGCCAG TGACGGGCCG AACTACGACG CCGCCAGCGT CAGCCGCGCC GCGGCGAGCC TGTCCCAGGC CGGCCTGCCC GCGCGGCTGG TGATCGACTG CAGCCACGGC AACAGCAACA AGGACTTTTC CAGGCAGCCC GCCGTGGCGG CCGATATCGC GCAGCAGATC GCCAGCGGCT CGAGCAGCAT CTGCGGCCTC ATGATTGAGA GCCACCTGGT CGAAGGCCGG CAGGACATCG TCGATGGCCG CCAAGGCCTG CGCTACGGGC AGAGCGTCAC CGACGCCTGC ATCGGCTGGG AGGCGACCGT GGCCGTGCTG GAGCAGCTGG CGGCGTCCGT GCGCCAGCGC CGGGCGGGCG CCAGGGCTTG A
|
Protein sequence | MSTQISDIHI AQADPLPQPR LLQGELPAGE AEAAFIAASR AATRNILRGL DDRLLVIVGP CSIHEPESAL EYAARLRRLA PRLDDSLLLV MRVYFEKPRT RMGWKGLIYD PELDGQGDIG AGLRHARRIL LECARLGVPA ASEILDLVTP QYYAELLTWG AIGARTVQSP LHRQMASALS APVGFKNATN GSVGAAIDAI HVAVQSHRFP SISLEGKAIV ITTTGNPDGH LVLRGASDGP NYDAASVSRA AASLSQAGLP ARLVIDCSHG NSNKDFSRQP AVAADIAQQI ASGSSSICGL MIESHLVEGR QDIVDGRQGL RYGQSVTDAC IGWEATVAVL EQLAASVRQR RAGARA
|
| |