Gene Pnap_4121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4121 
Symbol 
ID4686188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp4409056 
End bp4410153 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content67% 
IMG OID639837133 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_984332 
Protein GI121607003 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.752451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.115252 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCC ACGTCCGCCC TGCCACGCCG CTTTCCACCC ACGACACCAC TCGCATCGAC 
GACCTGCGCA TCGGCGCGGT GCGTCCGCTG ATCACGCCCG CGCTGCTGCA GGAATGGCTG
CCCACGCCGG TCAGCGTCCA GGCGCTGGTC GCGGCCAGCC GCGCCGCGAT CTCGCGTGTG
CTGCACGGCG CCGACGACCG GCTGGTGGTC GTGGTCGGGC CGTGTTCCAT CCACGACCAT
GCGCAGGCCA TGGACTACGC CCGCCAATTC AAGGCGCAGG CCGATGCGCT CAAGGACGAT
TTGCTGGTCG TGATGCGCGT GTATTTTGAA AAGCCGCGCA CCACCGTTGG CTGGAAGGGC
TACATCAACG ACCCGCACCT GGACGGCAGC TTTGCCATCA ACGAAGGGCT GGAAATGGCG
CGCCAGCTGC TGCTCGACGT GCTGGCGCTC GGCCTGCCGG TGGGCACCGA ATTCCTTGAT
CTGCTGAGCC CGCAGTTCAT CAGCGACCTG GTGAGCTGGG GCGCGATTGG CGCGCGCACC
ACCGAAAGCC AGAGCCACCG CCAGCTCGCC AGCGGCCTGT CGTGCCCGGT CGGCTTCAAG
AACGGCACCG ACGGCGGCGT GAAGGTGGCA GCCGACGCCA TCCAGGCGGC GCAGGCCACG
CACGCCTTCA TGGGCATGAC CAAGATGGGC CAGGCGGCAA TTTTTGAAAC CCGGGGCAAT
GACGACTGCC ATGTGATTCT GCGCGGCGGC AAGCAGACCA ATTATTCAAA GGCCGACGTG
GACGCGACCT GCGCGCAACT CAGGGCCGCC GGCCTGCGCG AGCAGGTGAT GATCGACGTG
TCGCACGCCA ACAGCAGCAA GCAGCACCAG CGGCAAATCG AAGTCGCCGC CGACGTGGCG
AGCCAGGTGG CGGCGGGCGA CCACCGCATC ATGGGCCTGA TGATTGAAAG CCACCTCAAC
GAAGGCCGGC AGGACATCGT CGCCGGCCAG CCCTTGAAGC ACGGCGTGTC GGTGACCGAT
GCGTGCATCA GTTTTGCGCA GACCGTGCCC GTGCTGCAAG GGCTGGCGGC GGCGGTGCGG
GCGCGGCGCC TGGCTTGA
 
Protein sequence
MNTHVRPATP LSTHDTTRID DLRIGAVRPL ITPALLQEWL PTPVSVQALV AASRAAISRV 
LHGADDRLVV VVGPCSIHDH AQAMDYARQF KAQADALKDD LLVVMRVYFE KPRTTVGWKG
YINDPHLDGS FAINEGLEMA RQLLLDVLAL GLPVGTEFLD LLSPQFISDL VSWGAIGART
TESQSHRQLA SGLSCPVGFK NGTDGGVKVA ADAIQAAQAT HAFMGMTKMG QAAIFETRGN
DDCHVILRGG KQTNYSKADV DATCAQLRAA GLREQVMIDV SHANSSKQHQ RQIEVAADVA
SQVAAGDHRI MGLMIESHLN EGRQDIVAGQ PLKHGVSVTD ACISFAQTVP VLQGLAAAVR
ARRLA