Gene Pnap_4068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4068 
Symbol 
ID4689055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp4342068 
End bp4343312 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content69% 
IMG OID639837081 
Productallantoate amidohydrolase 
Protein accessionYP_984280 
Protein GI121606951 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCCAG TACATAACCT GAAAATCAAC CCCCAGCGGC TGGCGCAAAG CCTGGCGGCG 
CTGGGGCAAG TCGGCACGCT GGAGGGCGGC GGCGTGAACC GGCTGGCCCT CACCGAGGCC
GACCGGCTCG GCCGCGAATG GACGCTGGCG CGCATGCGCG AGCTGGGCAT GGCCGTCACC
ATCGACGCCA TCGGCAACGT CACTGCCATC TATGCCGGCA CCGAAGACCT GCCGCCGGTG
ATGACCGGCT CGCATATTGA CACCGTGCGC ACCGGGGGGC TGTACGACGG CAACTACGGC
GTTCTGGCCG GCCTGGAAGT GGTGGCCACG CTGCGCGATG CCGGTGTGCG GCCGCGCCGG
CCGATTGCGG TCGCGTTCTT CACCAACGAG GAAGGCGCGC GCTTTCAGCC CGACATGATG
GGCAGCCTGG TGTATGTCGG CGGCCTGCCG CTGGCGCAGG CGCTGGCCAC CCGCGCCGCA
GACGGCCACA CCGTGGAAGA AGAACTGCAG CGTATAGGCT ACCGGGGACC GGCGGCCGTC
GGCTGCCCGG TGGTGGACAG CTTCGTGGAG CTGCACATCG AGCAGGGTCC GGTGCTGCAC
CAGCAAGGCC TGCAGATTGG CGTGGTCGAG GGCGTTCAGG GCATCTCGTG GACCGAGTTC
ACGATTGAAG GCGTGTCCAA CCATGCCGGC ACGACGCCTA TGGCCCTGCG CCATGACGCG
GGCGTGGTGG CGGTGCGCAT CGCGGCTTTT GTTCATGACC TGGCCTTGCG TTACGGCGGC
CGCCAGCTGG CGACCGTGGG CTCGATGCAG CTGTCGCCCA ACCTGGTCAA CGTCATTGCC
CAGCGCGCCG TGTTCACGGT GGACCTGCGC AACACCGACG AGGCCACGCT GGCCTGCGCC
GAGGCCGAGG TGCATGCGTT CGCCGCGCAG TGCGCGGCTG CGCAAGGCGT TGCGTGCAGC
CAGCGGCGCC TGGCGCGCTT CGAGCCGGTG GCGTTCGACC CGCTGGTGGT TAGCCTGATC
GAGCAGGAAA CCCGGGCGCT GGGCCTGTCC GCCCTGCGCC TGCCCAGCGG CGCCGGACAC
GACGCGCAGA TGCTGGCGCG GGTCTGCCCC GCCGGGATGA TCTTCGTGCC CAGCGTCAAT
GGACTGAGCC ACAACGTGAA CGAGTTCACC GAGCCCGACG ACCTGGCGCA GGGCGCGCAG
GTCTTGCTGC AGGTGCTGAT GCGGCTGGCC CAGCGTGGTG TTTGA
 
Protein sequence
MSPVHNLKIN PQRLAQSLAA LGQVGTLEGG GVNRLALTEA DRLGREWTLA RMRELGMAVT 
IDAIGNVTAI YAGTEDLPPV MTGSHIDTVR TGGLYDGNYG VLAGLEVVAT LRDAGVRPRR
PIAVAFFTNE EGARFQPDMM GSLVYVGGLP LAQALATRAA DGHTVEEELQ RIGYRGPAAV
GCPVVDSFVE LHIEQGPVLH QQGLQIGVVE GVQGISWTEF TIEGVSNHAG TTPMALRHDA
GVVAVRIAAF VHDLALRYGG RQLATVGSMQ LSPNLVNVIA QRAVFTVDLR NTDEATLACA
EAEVHAFAAQ CAAAQGVACS QRRLARFEPV AFDPLVVSLI EQETRALGLS ALRLPSGAGH
DAQMLARVCP AGMIFVPSVN GLSHNVNEFT EPDDLAQGAQ VLLQVLMRLA QRGV