Gene PA14_30540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_30540 
Symbol 
ID4385918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp2643371 
End bp2644327 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content68% 
IMG OID639325028 
Productputative periplasmic aliphatic sulfonate-binding protein 
Protein accessionYP_790599 
Protein GI116050582 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.558339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCAGA CCCTGAGCCT GTTCCTGCTT CTCTGCCTGC TCGTCGGCCA GGCCTGTGCC 
GACGAGCGCG TGACCCTGCG CCTGGCCGAC CAGAAGGGCA ACATGCGCGC CCAGTTGGAG
GCTGCCGGCG CGCTGGACGA CCTGACCTAC GACATCCGCT GGTTCGAATT CCCCGCCGCC
GCCGCCCTCG CCGAAGCACT CAATGCCGGT GCGGTGGATG CCGGCATCAT CGGCGACGCG
CCATTGCTCT TCGCCCTGGC CGCCGGTGCC CGGCTGAAGG CCATCGCCGT CGACAAATCC
GATCCCTACG GTACCGCCGT GCTGGTGAGA GGGGATTCGC CGCTGCGCTC GGCCAACGAC
CTGAAAGGCC AGCGCATTGC CACCGGGCGC GGTTCCATCG GCCACTTCGT CGCGCTCAAG
GCCTTGGCCT CGGTCGGGCT GGGCGAGAAG GACGTCGAAT TCCGCTTCCT CGGCCCGGTG
GACGCGAAAA TGGCCCTCGC CAATGGTTCC GTGGATGCCT GGGCGACCTG GGAGCCCTAC
ACCGCGTTCG CGGAGACCGC CGACAAGGCT CGGGTACTGG TCGACGGTCG CGGCCTGTGG
GCTGGCAACA GCTTCCTCGC TGCTACCGAT AGCGCACTGG CCGACCCGGC GAAACGCGCC
GTGCTGCAGG ACTACCTGCA ACGCCTGGCC AGCGCCCAGC GCTGGGCGTA CCTGCACCTG
GACGAGTATT CGCGAAGCCT GGCGCAGATC ATCGGCTTCC CCGAGGACGC CGCGCGCTTG
CAGTTCGAAC GTCGGCGCTT GCGTTGGCAG GCGCTGGATG AGCGAACGCT CGGCCAGCAG
CAGGAGACCG CCGATTTCTA CCAGGCCCAT GGATTGATTC CGCAGCGCCT CGATGTGCGG
CCGACCTTCG CCAGCGGCTT TGCGGTCAGG CAGGCGTCGG ATGCGGATAT CCGGTAG
 
Protein sequence
MRQTLSLFLL LCLLVGQACA DERVTLRLAD QKGNMRAQLE AAGALDDLTY DIRWFEFPAA 
AALAEALNAG AVDAGIIGDA PLLFALAAGA RLKAIAVDKS DPYGTAVLVR GDSPLRSAND
LKGQRIATGR GSIGHFVALK ALASVGLGEK DVEFRFLGPV DAKMALANGS VDAWATWEPY
TAFAETADKA RVLVDGRGLW AGNSFLAATD SALADPAKRA VLQDYLQRLA SAQRWAYLHL
DEYSRSLAQI IGFPEDAARL QFERRRLRWQ ALDERTLGQQ QETADFYQAH GLIPQRLDVR
PTFASGFAVR QASDADIR