Gene PA14_41920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_41920 
SymbolaroF-1 
ID4381830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp3742858 
End bp3743934 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content63% 
IMG OID639325930 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_791495 
Protein GI116049700 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATT TACAGATCGA CGACCTTAAC GTTGCCTCCA ACGAGACCCT GATCACGCCG 
GAGCAGCTCA AGCGTGAAAT TCCCCTGACC GACAAGGCCC TGCAGACCGT GGCCCATGGT
CGCCAGGTGG TGCGCGACAT CCTGGATGGC AAGGACCACC GTCTGTTCGT GGTGATCGGC
CCCTGCTCCA TCCACGACAT CAAGGCCGCC CACGAATACG CCGACCGCCT CAAGGCGCTC
GCGGCCGAAG TGGCGGATAC GCTGTTCCTG GTGATGCGCG TGTACTTCGA GAAGCCGCGT
ACCACGGTGG GCTGGAAAGG CCTGATCAAC GATCCGTACC TGGACGACTC GTTCAAGATC
CAGGATGGCC TGCACATCGG TCGCCAACTG CTCCGCGACC TCGCCGAGAA AGGCTTGCCC
ACCGCCACCG AAGCGCTCGA CCCGATTTCC CCGCAGTACC TGCAGGACCT GATCAGCTGG
TCGGCGATCG GCGCCCGTAC CACCGAATCG CAGACCCACC GCGAGATGGC CTCCGGCCTG
TCTTCCGCGG TCGGCTTCAA GAACGGCACC GATGGCAGCC TGACCGTGGC GATCAATGCC
CTGCAGTCGG TCTCCAGCCC GCATCGCTTC CTCGGCATCA ACCAGCAGGG CGGCGTATCC
ATCGTCACCA CCAAGGGCAA CCGCTACGGT CACGTGGTGT TGCGCGGCGG CAACGGCAAG
CCGAACTACG ATTCGGTCAG CGTCGCGCTC TGCGAGCAGG ACCTGAACAA GGCGAAAATC
CCGCTGAACA TCATGGTCGA CTGCAGCCAC GCCAACTCCA ACAAGGACCC GGCCCTGCAA
CCGCTGGTGA TGGACAACGT CAGCAACCAG ATCGTCGAAG GCAACAACTC GATCGTCGGC
CTGATGGTGG AAAGCCACCT GGGCTGGGGC AGCCAGCCGA TTCCGAAGGA TCTCGACCAA
CTTCAGTACG GCGTCTCCAT CACCGACGCC TGCATCGACT GGGCGACCAC CGAGAAGAGC
ATCCGCAGCA TGCACGCCAA GCTCAAGGAC GTGCTGCCGA AACGCCAGCG CGGCTGA
 
Protein sequence
MADLQIDDLN VASNETLITP EQLKREIPLT DKALQTVAHG RQVVRDILDG KDHRLFVVIG 
PCSIHDIKAA HEYADRLKAL AAEVADTLFL VMRVYFEKPR TTVGWKGLIN DPYLDDSFKI
QDGLHIGRQL LRDLAEKGLP TATEALDPIS PQYLQDLISW SAIGARTTES QTHREMASGL
SSAVGFKNGT DGSLTVAINA LQSVSSPHRF LGINQQGGVS IVTTKGNRYG HVVLRGGNGK
PNYDSVSVAL CEQDLNKAKI PLNIMVDCSH ANSNKDPALQ PLVMDNVSNQ IVEGNNSIVG
LMVESHLGWG SQPIPKDLDQ LQYGVSITDA CIDWATTEKS IRSMHAKLKD VLPKRQRG