Gene PA14_21050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_21050 
Symbol 
ID4385436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp1819004 
End bp1820782 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content69% 
IMG OID639324241 
Productshort chain dehydrogenase 
Protein accessionYP_789828 
Protein GI116051339 
COG category[R] General function prediction only 
COG ID[COG0300] Short-chain dehydrogenases of various substrate specificities
[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCGC TCCTCGCCCT CCCCGACCTG CAGCCGGAAC GGCTGTTCGT GCAGTCCGGC 
GACGTGCGCC TGGCGGTTCA TTGCTGGGGC GCGCCGGACA ACGACAAGCC GACGCTGCTG
ATGGTGCATG GCTACCCGGA CAACCACGAG ACCTGGCTGC CGCTGATCCG CCAACTGGCT
GGGCGCTACC GCATCGTCGC CTACGACGTG CGCGGCGCCG GCGCTTCGGA CAAGCCACGC
TGGAGCCGCG ACTACCACCT GCAACGGCTC AGCGAAGACC TGCAGGCGGT GATTCGCGCC
ACCAGCCCGG ATCGCCCGGT GCACCTGCTG GCCCATGACT GGGGCTCGAT CCAGACCTGG
GAAAGCGTCA CCGATCCGCA GTGCCGCCCG CTGATCGCGT CCTATACCTC GATCTCCGGG
CCCTGCCTGG ACCATGTCGG CTTCTGGATG CGCGAACACC TCAGGCAGCG CAGTCCGAAG
GCCCTGAAAG CGGTATTCGG CCAGTTGCTC CACTCCTGGT ACATCGCCTT CTTCCATACC
CCCGTCGTAC CCGAGCTGCT CTGGAGCGTC GGCCTGGCCC GGCTCTGGCC GCAGTTCTTG
AAGCGCGCCG AAGGCGTCCG CCATCCACAG GTCAACCCGA CCCAGGCCAG CGACGGCCGG
CACGGGGTCA AGCTGTACCG CGGCAACTTC ATCCGCAGCC TGTTCCGTCC GCGCAAGCGC
CACACCGAAG TGCCGGTGCA ACTGATCGTA CCGACCCGCG ACCGTTACGT CGGCGCCCAG
CTGTTCCAGC ACCTCAGCCT GTGGGCGCCA CGCCTGTGGC GGCGCGAGGC GAGCGTCGGC
CATTGGCAAC TGCTGGCCGA GCCGGAGCAA CTGGCCGGCT GGCTTGGCGA GTTCATCGAC
GCCCAGGAAA CCGGCGAGTC GCCGCCGGCC CTGCAACGCG CACAGGTCCG CCCCGACGCG
CGCTCGATGA GCGGCAAGCT GGTGGTGGTC ACCGGCGCCG GCGGCGGCAT CGGCCGTTCG
ACCCTGCTCA GCTTCGCCGA GCGCGGCGCC AGCCTGCTCG CCGCCGATCT CGACCTGGAA
GCCGCCGAAC GCAGCGCCGA ACTGGCCCGC GCCCTTGGCG CCACGGCCCA TGCCTACCAG
GTCGACGTCG GCGATACCCA GGCCATGGAG CGCTTCGCCG AGTGGGTCCG CGACACCCTC
GGAGTCCCGG ATGTGGTGGT CAGCAATGCC GGCATCGGCA TGGCCGGACC GATGCTCGAC
ACCTCGCCCG CGGAATGGGA GCGCCTGCTG CGGGTCAACC TGTGGAGCGT GATCGACGGC
TGCCGCCTGT TCGGCCGGCA GATGATCGCG GCGAACAAGC CGGGGCACCT GGTCAATGTC
GCCTCCGGCG TGGCCTTCGC GCCGTCGCGC AACTACCCCG CCTACGCCAC CAGCAAGGCC
GCGGTACTGA TGCTCAGCGA ATGCCTGCGG GCGGAACTGG CCGGACGTTC GATCGGAGTC
ACCGCGGTGT GCCCGGGCTT CGTCGATACC GGCATCGTCC AGGCCACCCG CTTCGTCGGC
ATGGACGCCG AACGCCAGGC GCGGCGCCAG GCGAAGATCC AGCGCTTCTA CAAGCGACGC
CGGCTCAGCC CGGACACCGT TGGGGAAAAG CTGGTGCGCG CCGTGGAGCG CAACAAGGCG
GTGGTCTCGG TGGGCAGCGA GGTGCACCTC GGCGCCCTGC AATGGCGCTT CGCGCCCTGG
GCCACGCGGT TTCTCGCGCG CTTCGACCTG ACTTCCTGA
 
Protein sequence
MNALLALPDL QPERLFVQSG DVRLAVHCWG APDNDKPTLL MVHGYPDNHE TWLPLIRQLA 
GRYRIVAYDV RGAGASDKPR WSRDYHLQRL SEDLQAVIRA TSPDRPVHLL AHDWGSIQTW
ESVTDPQCRP LIASYTSISG PCLDHVGFWM REHLRQRSPK ALKAVFGQLL HSWYIAFFHT
PVVPELLWSV GLARLWPQFL KRAEGVRHPQ VNPTQASDGR HGVKLYRGNF IRSLFRPRKR
HTEVPVQLIV PTRDRYVGAQ LFQHLSLWAP RLWRREASVG HWQLLAEPEQ LAGWLGEFID
AQETGESPPA LQRAQVRPDA RSMSGKLVVV TGAGGGIGRS TLLSFAERGA SLLAADLDLE
AAERSAELAR ALGATAHAYQ VDVGDTQAME RFAEWVRDTL GVPDVVVSNA GIGMAGPMLD
TSPAEWERLL RVNLWSVIDG CRLFGRQMIA ANKPGHLVNV ASGVAFAPSR NYPAYATSKA
AVLMLSECLR AELAGRSIGV TAVCPGFVDT GIVQATRFVG MDAERQARRQ AKIQRFYKRR
RLSPDTVGEK LVRAVERNKA VVSVGSEVHL GALQWRFAPW ATRFLARFDL TS