Gene PA14_01750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_01750 
Symbol 
ID4383541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp159868 
End bp161217 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content71% 
IMG OID639322698 
Producthydroxydechloroatrazine ethylaminohydrolase 
Protein accessionYP_788299 
Protein GI116053862 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000738453 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.736322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGCA CCTGGATTCG CAAACCCCTC GCCATCTTCA CCGCCAACGG TCTCGACGCC 
GCCGGCGGCC TGGTCGTCGA AGACGGCCGC ATCGTCGAGC TGCTCGGCGC CGGCCAGCAG
CCCGCGCAGC CCTGCGCCAG CCAGTTCGAC GCCAGCCGGC ACGTGGTCTT GCCGGGACTC
ATCAACACCC ATCACCACTT CTACCAGACC CTCACCCGCG CCTGGGCGCC GGTGGTCAAC
CAGCCGCTGT TCCCCTGGCT GAAAACCCTC TACCCGGTCT GGGCGCGGCT GACCCCGGAG
AAGCTCGAAC TGGCCACCAA GGTGGCGCTG GCCGAGCTGC TGCTGTCTGG CTGCACCACT
GCCGCCGACC ACCACTACCT GTTCCCCGGC GGCCTCGAAC AGGCTATCGA CGTGCAGGCC
GGGGTGGTCG AGGAACTGGG CATGCGCGCC ATGCTCACCC GCGGCTCGAT GAGCCTCGGC
GAGAAGGACG GCGGCTTGCC GCCGCAGCAG ACGGTGCAGG AGGCCGAGAC CATCCTCGCC
GACAGCGAGC GACTGATCGC TCGCTACCAC CAGCGCGGCG AAGGCGCCCG GGTGCAGATC
GCCCTGGCGC CCTGCTCGCC GTTCTCGGTG ACTCCGGAGA TCATGCGCGC CAGCGCCGAA
CTGGCGGCGC GCCATGACGT ACGCCTGCAC ACCCACCTGG CGGAGACCCT CGACGAGGAA
GACTTCTGCC TGCAGCGCTT CGGCCTGCGC ACCGTGGACT ACCTGGATAG CGTCGGCTGG
CTCGGCCCGC GCACCTGGCT GGCCCACGGC ATCCACTTCA ACGCCGAGGA GATCCGCCGG
CTCGGCGAGG CGGGCACCGG TATCTGCCAT TGCCCGAGTT CGAACATGCG CCTGGCCTCG
GGCATCTGCC CGACCGTGGA GCTGGAGGCG GCCGGCGCGC CGATTGGCCT GGGAGTCGAT
GGTTCGGCCT CCAACGACGC CTCGAACATG ATCCTCGAGG CGCGCCAGGC CCTGTACCTG
CAACGCCTGC GCTACGGCGC CGAGCGAATC ACCCCGGAAC TCGCCCTGGG CTGGGCCACC
CGTGGCTCGG CACGCCTGCT CGGACGCAGC GACATCGGCG AGCTGGCCCC CGGCAAGCAG
GCCGACCTGG CCTTGTTCAA GCTCGACGAG CTGCGCTTCT CGGGTAGCCA CGACCCGCTC
TCGGCGCTGC TGCTGTGCGC TGCCGACCGT GCCGACCGGG TAATGGTCGG CGGCGCCTGG
CGAGTGGTCG ATGGTGCCGT GGAAGGGCTC GACCTGGCCG CCCTGATCGC CCGCCACCGC
GAGGCGGCGA GCGCCCTGAT CGCCGGGTGA
 
Protein sequence
MSRTWIRKPL AIFTANGLDA AGGLVVEDGR IVELLGAGQQ PAQPCASQFD ASRHVVLPGL 
INTHHHFYQT LTRAWAPVVN QPLFPWLKTL YPVWARLTPE KLELATKVAL AELLLSGCTT
AADHHYLFPG GLEQAIDVQA GVVEELGMRA MLTRGSMSLG EKDGGLPPQQ TVQEAETILA
DSERLIARYH QRGEGARVQI ALAPCSPFSV TPEIMRASAE LAARHDVRLH THLAETLDEE
DFCLQRFGLR TVDYLDSVGW LGPRTWLAHG IHFNAEEIRR LGEAGTGICH CPSSNMRLAS
GICPTVELEA AGAPIGLGVD GSASNDASNM ILEARQALYL QRLRYGAERI TPELALGWAT
RGSARLLGRS DIGELAPGKQ ADLALFKLDE LRFSGSHDPL SALLLCAADR ADRVMVGGAW
RVVDGAVEGL DLAALIARHR EAASALIAG