Gene PSPA7_0032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPSPA7_0032 
SymbolbetC 
ID5356908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa PA7 
KingdomBacteria 
Replicon accessionNC_009656 
Strand
Start bp32880 
End bp34391 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content69% 
IMG OID640809086 
Productcholine sulfatase 
Protein accessionYP_001345428 
Protein GI152989262 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.193076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACCC CGCCGAACAT ACTGTTCATC ATGGCCGACC AGATGGCCGC GCCGCTGCTG 
CCGTTTCACG ATCCCCGCTC GGTGCTACGC ATGCCCAACC TGTCGCGCCT CGCCGAACGG
GCGGTGGTAT TCGATTCGGC CTACTGCAAC AGTCCGCTCT GCGCGCCCTC GCGCTTCACC
CTGGTCAGCG GCCGGTTGCC CAGCCGCATC GGCGCCTGGG ACAACGCCGC CGACTTCGCC
GCCGATACCC CGACCTACGC CCACTACCTG CGTCGCCTCG GTTATCGCAC GGCGCTGTCG
GGCAAGATGC ACTTCTGCGG CCCCGACCAG CTGCATGGCT ACGAGGAGCG CCTGACCAGC
GATATCTATC CGGCCGACTA CGGCTGGGCG GTGAACTGGG ACGAGCCGGA GGTGCGCCCG
AGCTGGTACC ACAACATGTC CTCGGTCCTG CAGGCCGGTC CCTGCGTGCG CACCAACCAG
CTGGACTTCG ACGAGGAGGC GGTGTTCAAG GCCCGCCAGT ACCTCTACGA CCATGTTCGC
CAGCACGCCG GCCAGCCATT CTGCCTGACC GTGTCGATGA CCCACCCGCA CGATCCCTAC
TGCATCCCGG CGAGCTACTG GAACCTCTAC CGCGACGAGG ACATCCCGCT GCCGCGCCAG
AGCTTCGCCC AGGAAGAGCA GGACCCTCAT TCGCAACGCC TGCTGAAGGT CATCGACCTG
TGGGACAAGC CGTTGCCCGA GGAGCGCATC CGCGCCGCCC GGCGCGCCTA CTTCGGCGCC
TGCAGCTACG TCGACGCGCA GGTCGGCGCG CTGCTGGCGA CGCTGGAGGA ATGCGGGCTG
GCCGACGACA CCATCCTGGT GTTCTCCGGC GACCATGGCG ACATGCTCGG CGAGCGCGGC
CTCTGGTACA AGATGCACTG GTTCGAGATG GCCGCGCGCG TGCCGCTGCT GGTCCACGCG
CCAGGTCGCT TCGCGGCGCG CCGGATCGGC GCCTCGGTAT CCACCGTCGA CCTGCTGCCG
ACCCTGGTGG AACTGGCCGG CGGCCAGGTC GACGCGCGCC TGCCGCTGGA CGGACGCTCG
CTGCTGCCGC ACCTGCGTGA CGGCAGCGGG CATGACGAGG TGATCGGCGA ATACACCGCC
GAGGGCACCC TCGGCCCGCT GATGATGATC CGTCGCGGCG ACTACAAGTT CATCTATTCA
GAGCAGGACC CCTGCCTGCT CTACGACCTG CGCAACGACC CCCAGGAGCG CGAGAACCTC
GCCGCCAGCC CGGCCCATCG CGGAGCATTC GAGGCGTTCC TCGATGAGGC ACGGCGGCGC
TGGGACATCC CCGCGCTCAC CCGCGCCGTG CTCGACAGCC AGCGCCGCCG GCGCCTGGTA
GCCGAGGCGC TGACGCGCGG CCGGTTGACC AGCTGGGACC ATCAGCCGTG GGTCGACGCC
AGCCAGCAAT ACATGCGCAA CCATATCGAC CTCGACGACC TCGAGCGCCG CGCGCGCTTC
CCGCAACCCT GA
 
Protein sequence
MKTPPNILFI MADQMAAPLL PFHDPRSVLR MPNLSRLAER AVVFDSAYCN SPLCAPSRFT 
LVSGRLPSRI GAWDNAADFA ADTPTYAHYL RRLGYRTALS GKMHFCGPDQ LHGYEERLTS
DIYPADYGWA VNWDEPEVRP SWYHNMSSVL QAGPCVRTNQ LDFDEEAVFK ARQYLYDHVR
QHAGQPFCLT VSMTHPHDPY CIPASYWNLY RDEDIPLPRQ SFAQEEQDPH SQRLLKVIDL
WDKPLPEERI RAARRAYFGA CSYVDAQVGA LLATLEECGL ADDTILVFSG DHGDMLGERG
LWYKMHWFEM AARVPLLVHA PGRFAARRIG ASVSTVDLLP TLVELAGGQV DARLPLDGRS
LLPHLRDGSG HDEVIGEYTA EGTLGPLMMI RRGDYKFIYS EQDPCLLYDL RNDPQERENL
AASPAHRGAF EAFLDEARRR WDIPALTRAV LDSQRRRRLV AEALTRGRLT SWDHQPWVDA
SQQYMRNHID LDDLERRARF PQP