Gene PA14_00380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_00380 
SymbolbetC 
ID4384749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp33021 
End bp34532 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content68% 
IMG OID639322585 
Productcholine sulfatase 
Protein accessionYP_788186 
Protein GI116053751 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.103488 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCT CGCCGAACAT CCTGTTCATC ATGGCCGACC AGATGGCCGC GCCGCTGCTG 
CCGCTTCACG ATCCGCGCTC GGTGCTGCGC ATGCCTCACC TCTCGCGCCT CGCCGAACGG
GCCGTGGTGT TCGACTCGGC GTACTGCAAC AGCCCGCTCT GCGCGCCGTC GCGCTTCACC
CTGGTCAGCG GTCGCTTGCC TACTCGCATC GGCGCCTGGG ACAACGCCGC CGACTTCGCC
GCCGATACCC CCACCTACGC CCACTACCTG CGCAACCTCG GCTATCGCAC GGCGCTGTCG
GGCAAGATGC ACTTCTGCGG TCCCGACCAG TTGCACGGCT ACGAGGAACG CCTGACCAGC
GACATCTATC CGGCGGACTA TGGCTGGGCG GTGAACTGGG ACGAGCCGGA GGTGCGCCCG
AGCTGGTACC ACAACATGTC CTCGGTTTTG CAGGCCGGTC CCTGCGTGCG CACCAACCAG
CTGGACTTCG ACGAGGAGGT GGTGTTCAAG GCCCGCCAGT ACCTCTACGA CCATGTTCGC
CAGCACGCCG GCCAGCCATT CTGCCTGACC GTGTCGATGA CCCATCCGCA CGACCCCTAC
AGCATCCCGG CGAGCTACTG GAATCTCTAC CGCGACGAGG ACATCCCGCT GCCGCGCCAG
CGCTTCGCCC AGGAGGAGCA GGACCCTCAT TCGCAACGCC TGCTGAAGGT CATCGACCTG
TGGGACAAGC CGTTGCCCGA GGAGCGCATC CGCGCCGCCC GGCGTGCCTA CTTCGGCGCC
TGCAGCTACG TCGACGCGCA GATCGGTGCG CTGCTGGCGA CCCTGGAGGA ATGCGGGCTG
GCCGACGACA CCATCGTGGT GTTCTCCGGC GACCATGGCG ACATGCTCGG CGAGCGCGGC
CTCTGGTACA AGATGCACTG GTTCGAGATG GCCGCGCGCG TGCCGCTGCT GGTCCATGCG
CCGGCGCGCT TCGCGCCGCG CCGCATCGGC GCTTCGGTAT CCACCGTGGA CCTGCTGCCG
ACCCTGGTGG AGCTGGCCGG CGGCCAGGTC GATCCACGCC TGCCGCTGGA AGGCCGCTCG
CTGCTGCCGC ACCTGCGCGA CGGCAGCGGG CATGACGAGG TGATCGGCGA ATACACCGCC
GAGGGCACCC TCAGCCCGCT GATGATGATC CGCCGCGGCG ACTACAAGTT CATCTACTCC
GAGCAGGACC CCTGCCTGCT CTACGACCTG CGCAACGACC CGCAGGAGCG CGAGAACCTC
GCCGCCAGTC CGGCCCATCG CGGAACGTTC GAGGCGTTCC TCGACGAGGC CCGGCGACGC
TGGGACATCC CCGCGATCAC CCGCGCCGTA CTCGACAGCC AGCGCCGCCG ACGCCTGGTG
GCCGCCGCGC TGGCGCGAGG GCGGCTGGCC AGTTGGGACC ACCAGCCGTG GATCGACGCC
AGCCAGCAGT ACATGCGCAA CCATATCGAC CTGGACGATC TCGAGCGCCG CGCGCGCTTC
CCGCAACCCT GA
 
Protein sequence
MKTSPNILFI MADQMAAPLL PLHDPRSVLR MPHLSRLAER AVVFDSAYCN SPLCAPSRFT 
LVSGRLPTRI GAWDNAADFA ADTPTYAHYL RNLGYRTALS GKMHFCGPDQ LHGYEERLTS
DIYPADYGWA VNWDEPEVRP SWYHNMSSVL QAGPCVRTNQ LDFDEEVVFK ARQYLYDHVR
QHAGQPFCLT VSMTHPHDPY SIPASYWNLY RDEDIPLPRQ RFAQEEQDPH SQRLLKVIDL
WDKPLPEERI RAARRAYFGA CSYVDAQIGA LLATLEECGL ADDTIVVFSG DHGDMLGERG
LWYKMHWFEM AARVPLLVHA PARFAPRRIG ASVSTVDLLP TLVELAGGQV DPRLPLEGRS
LLPHLRDGSG HDEVIGEYTA EGTLSPLMMI RRGDYKFIYS EQDPCLLYDL RNDPQERENL
AASPAHRGTF EAFLDEARRR WDIPAITRAV LDSQRRRRLV AAALARGRLA SWDHQPWIDA
SQQYMRNHID LDDLERRARF PQP