Gene Pnap_4067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4067 
Symbol 
ID4688990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp4340848 
End bp4342071 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content69% 
IMG OID639837080 
Productdiaminopropionate ammonia-lyase 
Protein accessionYP_984279 
Protein GI121606950 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01747] diaminopropionate ammonia-lyase family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAGCC TTCCAAGCAT TTCAACCGAC TACTTTCTCA ACCCGGCGTT CCAGCGCGAG 
GAGCCCTATG GCGAGCAGCG CGCCAGCATT CTCAACGCCG GCGCGCTGGC CAGTGCCGGG
CAGGAAATCA CCGCCTGGCC CGGCTACCAG GTCACGCCCC GGCATTCGCT GGGCGGCCTG
GCCGATGCGC TGGGCGTGGC CTCAATCAAT TACAAGGACG AAGGCTCGCG CTTCGGCCTG
GGCAGCTTCA AGGCCCTCGG GGGCGCCTAC GCGGTCGGCC GGCTGCTGTG CCGCGTGCTG
GGGCAGCAGT TGGGCCGGGT CGTGAACCCG TCCGACCTGC TGCAGGGCGA GTTGCGCCAG
GCCGCCTCAA GCATCACGGT GACCTGCGCC ACCGACGGCA ACCATGGCCG GTCGGTGGCC
TGGGGCGCGC AGCTGTTCGG CTGCCGCTGC GTGATCTACA TTCACGCCTC GGTCAGCGAC
GGACGCAGGC AGGCCATCGA ACGCTATGGC GCGCAGGTGG TGCGCACCGA AGGCAACTAC
GACGACGCCG TGCGGCAGGC CGACCGCGAC GCCCGGGCCA AGGGCTGGCA CGTCATCTCC
GACACGTCCT ACCCGGGCTA CATGGACGTT CCGCGCGACG TCATGCAGGG CTACCAGCTG
ATGGTCCAAG AGGTCGTCCA GCAGCTGGGC GCCTGGCCGA CGCATGTCTT CGTGCAAGCC
GGCGTGGGCG GATTTTCCGC AGCCGTCTGC GCCTATTTCT GGGAGCGCGA CGCGCAGCGG
CGGCCCTTCT TCACGGTGGT GGAGCCGACC CGCGCCGACT GCCTGCTGCA AAGCGCGCGC
AACGGCCGCA TCACGGCGGT CACGGGCGAG CTGGACACGC TGATGGCCGG CCTGGCCTGC
GGCGAGGTGT CGCTGCTGGC CTGGGACATC CTGGAACGCG GCGCCAACGC CTTCTGCACC
GTGGACGACG ACGCGGCCGT AGCGGTGATG CGGCTGCTGG CGCATCCGCC ACGGAACGAC
GCCGCCATTG TGGCCGGTGA ATCGGCCGTG GCCGGGCTGG CTGCGGCGAT CGGCATCGCC
AGCAACCCCG AAGCCCGGGC CGCGTTCAGA ATCAATGCCG AAAGCCGGAT TCTGTTCTTT
GGCAGCGAAG CCGACACGGA CCCGGCGCTT TACCGGCAGC TGGTCGGCGC CAGCGCGGCC
GACGTCCTCG AGACGCAGGC ATGA
 
Protein sequence
MHSLPSISTD YFLNPAFQRE EPYGEQRASI LNAGALASAG QEITAWPGYQ VTPRHSLGGL 
ADALGVASIN YKDEGSRFGL GSFKALGGAY AVGRLLCRVL GQQLGRVVNP SDLLQGELRQ
AASSITVTCA TDGNHGRSVA WGAQLFGCRC VIYIHASVSD GRRQAIERYG AQVVRTEGNY
DDAVRQADRD ARAKGWHVIS DTSYPGYMDV PRDVMQGYQL MVQEVVQQLG AWPTHVFVQA
GVGGFSAAVC AYFWERDAQR RPFFTVVEPT RADCLLQSAR NGRITAVTGE LDTLMAGLAC
GEVSLLAWDI LERGANAFCT VDDDAAVAVM RLLAHPPRND AAIVAGESAV AGLAAAIGIA
SNPEARAAFR INAESRILFF GSEADTDPAL YRQLVGASAA DVLETQA