Gene Pnap_4021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4021 
Symbol 
ID4686123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp4286928 
End bp4288373 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content66% 
IMG OID639837035 
Productphenylhydantoinase 
Protein accessionYP_984234 
Protein GI121606905 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCCA TCTTGATCCG TGGCGGCACC GTGGTGAACG CCGACCGCGC CTTCCGCGCC 
GATGTGCTGA CGCAGGGCGG CCGCATCGCC GCCGTCGGTG AAGCGCTGGA GGCGCCTGCT
GGCGCCCTGG TCGTCGATGC CGGCGGCCAG TACGTGATGC CCGGCGGCAT CGACCCGCAC
ACGCATATGC AGCTGCCCTT CATGGGAACG GTGACGATGG ACGATTTCTT CAGCGGCACG
GCGGCCGGCC TGGCGGGCGG CACCACCAGC ATCATCGACT TCGTGATTCC CGCGCCGCAA
CAATCGCTGA TGGACGCCTA CCAGACCTGG CGCGGCTGGG CCGAAAAATC CGCCGCCGAC
TACGGCTTTC ATGTCGCCGT CACCTGGTGG GACGAGTCGG TTCGGCGCGA TATGGGCACG
CTGGTGCAGC ACGAAGGCGT GAACAGCTTC AAGCATTTCA TGGCCTACAA GAACGCCATC
ATGTGCGACG ACGAAACGCT GGTGAACAGC TTCAGGCGCT GCCTGGAACT GGGCGCCATG
CCCACGGTGC ATGCCGAAAA CGGCGAACTG GTGTTCATGC TGCAAAAGGA AATCGCTGCC
CAGGGCATCA CCGGCCCCGA AGGCCACCCG CTGTCGCGCC CGCCGATGGT CGAGGCCGAG
GCGGCGAACC GGGCGATTGC GATTGCCGAT GTGCTGAACG TGCCGATCTA CGTCGTGCAT
GTGTCGTGCG TCGAAGCGCT GGAAGCCATT GCACGCGCCA GAGCCCGCGG CCAGCGCGTC
TATGGCGAGG TGCTGGCCGG GCACCTGGTG GTCGATGACA GCGTCTACCG CCACCCCGAC
TTCGCCACCG CCGCCGCGCA TGTGATGAGC CCGCCTTTCA GGCCCAAGGC CAATCAGGAA
TTCCTGTGGC GCGGCCTGCA GGCGGGCAAC CTGCACACCA CGGCGACCGA CCACTGCACC
TTCTGCGCCG CGCAAAAAGC GGCGGGCAAG GACGATTTCG CCAAGATTCC GAACGGCTGC
GGCGGCGTCG AGGAACGCCT GGCCGTGGTC TGGGACGCGG GCGTGAACAC CGGCCGCCTG
ACGCCCAGCG AATTCGTCGC CGTCACCTCG GCCAACACCG CCAAACTGTT CAACATCTAC
CCGCAAAAAG GCAGCGTGTC GGTCGGTGCC GACGCCGACC TGGTGGTCTG GGACCCCGAG
GGCACGAAAA CCCTGTCCGC CAAGACCCAG CACAGCAAGG GCGACTTCAA CATCTTCGAA
GGCCGCACCG TGCGCGGCAT CCCCAGCCAC ACGCTCAGCC AGGGCGAACT GGTGTTCGTG
CAGGGCGACC TGCGCGCCGT TCAGGGCAAG GGCCGCTATA TCAAACGGCC GGCTTTTGGA
GCAAACTTCG CGGCGGCCAA GCTGCGCGCT GAAACGCTGG CACCCAGCCC CGTCGTGCGC
GCCTGA
 
Protein sequence
MTSILIRGGT VVNADRAFRA DVLTQGGRIA AVGEALEAPA GALVVDAGGQ YVMPGGIDPH 
THMQLPFMGT VTMDDFFSGT AAGLAGGTTS IIDFVIPAPQ QSLMDAYQTW RGWAEKSAAD
YGFHVAVTWW DESVRRDMGT LVQHEGVNSF KHFMAYKNAI MCDDETLVNS FRRCLELGAM
PTVHAENGEL VFMLQKEIAA QGITGPEGHP LSRPPMVEAE AANRAIAIAD VLNVPIYVVH
VSCVEALEAI ARARARGQRV YGEVLAGHLV VDDSVYRHPD FATAAAHVMS PPFRPKANQE
FLWRGLQAGN LHTTATDHCT FCAAQKAAGK DDFAKIPNGC GGVEERLAVV WDAGVNTGRL
TPSEFVAVTS ANTAKLFNIY PQKGSVSVGA DADLVVWDPE GTKTLSAKTQ HSKGDFNIFE
GRTVRGIPSH TLSQGELVFV QGDLRAVQGK GRYIKRPAFG ANFAAAKLRA ETLAPSPVVR
A