Gene PA14_23240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_23240 
Symbol 
ID4381163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp2011997 
End bp2013331 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content69% 
IMG OID639324425 
ProductN-ethylammeline chlorohydrolase 
Protein accessionYP_790010 
Protein GI116051159 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.00576591 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCAACG TCCGTAACCC CTTCGACCTC CTCCTGCTGC CGACCTGGAT CGTCCCCGTG 
GAGCCCGCCG GGGTGGTGCT GCGCGATCAC GCGCTGGGCA TCCGCGACGG CCAGATCGCC
CTGGTCGCGC CGCGCGAGCA GGCCATGCGC CATGGCGCCA CGGAAATCCG CGAATTGCCC
GGCATGCTGC TCGCCCCCGG CCTGGTCAAC GCCCACGGCC ATTCGGCAAT GAGTCTGTTC
CGCGGTCTCG CCGACGACCT GCCGCTGATG ACCTGGCTGC AGGACCACAT CTGGCCGGCC
GAAGGCCAAT GGGTCAGCGA GGACTTCATC CGCGACGGCA CGGAGCTGGC CATCGCCGAA
CAGGTAAAGG GCGGCATCAC CTGTTTCTCC GACATGTACT TCTATCCACA GGCCATCTGC
GGCGTGGTCC ATGACAGCGG GGTACGCGCC CAGGTGGCGA TCCCGGTGCT GGACTTCCCG
ATCCCCGGCG CCCGCGACAG CGCCGAGGCG ATCCGCCAGG GCATGGCACT GTTCGACGAC
CTCAAGCACC ACCCGCGCAT CCGCATCGCC TTCGGCCCGC ACGCCCCTTA TACGGTGAGC
GACGACAAGC TGGAGCAGAT CCTGGTGCTC ACCGAGGAAC TCGACGCCAG CATCCAGATG
CACGTCCACG AGACCGCCTT CGAGGTGGAG CAGGCCATGG AGCGCAACGG CGAGCGCCCG
TTGGCCCGCC TGCACCGCCT CGGCCTGCTC GGCCCGCGCT TCCAGGCGGT GCACATGACC
CAGGTAGACG ACGACGACCT GGCGATGCTG GTGGAAACCA ACAGTTCGGT GATCCACTGC
CCGGAATCCA ACCTCAAGCT GGCCAGCGGC TTCTGCCCGG TGGAAAAGCT CTGGCAGGCC
GGGGTCAACG TGGCCATCGG CACCGACGGC GCGGCCAGCA ACAACGACCT CGACCTGCTC
GGCGAGACCC GCACCGCGGC GCTGCTGGCC AAGGCAGTGT ACGGCCAGGC CACCGCCCTC
GACGCCCACC GCGCGCTGCG CATGGCCACC CTGAACGGAG CCCGCGCGCT TGGCCTGGAG
CGCCTGATCG GCTCCCTGGA AGCCGGCAAG GCCGCCGACC TGGTGGCCTT CGACCTGTCC
GGCCTGGCCC AGCAACCGGT CTACGACCCG GTTTCGCAAC TTATCTATGC CAGCGGCCGC
GACTGCGTGC GGCATGTCTG GGTCGGCGGC AGGCAACTCC TCGACGACGG CCGCCTGCTC
CGTCACGACG AACAGCGCCT GATCGCCAGG GCCCGCGAAT GGGGGGCGAA GATCGCCGCC
AGCGACAGGT CCTGA
 
Protein sequence
MPNVRNPFDL LLLPTWIVPV EPAGVVLRDH ALGIRDGQIA LVAPREQAMR HGATEIRELP 
GMLLAPGLVN AHGHSAMSLF RGLADDLPLM TWLQDHIWPA EGQWVSEDFI RDGTELAIAE
QVKGGITCFS DMYFYPQAIC GVVHDSGVRA QVAIPVLDFP IPGARDSAEA IRQGMALFDD
LKHHPRIRIA FGPHAPYTVS DDKLEQILVL TEELDASIQM HVHETAFEVE QAMERNGERP
LARLHRLGLL GPRFQAVHMT QVDDDDLAML VETNSSVIHC PESNLKLASG FCPVEKLWQA
GVNVAIGTDG AASNNDLDLL GETRTAALLA KAVYGQATAL DAHRALRMAT LNGARALGLE
RLIGSLEAGK AADLVAFDLS GLAQQPVYDP VSQLIYASGR DCVRHVWVGG RQLLDDGRLL
RHDEQRLIAR AREWGAKIAA SDRS