Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PA14_00380 |
Symbol | betC |
ID | 4384749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas aeruginosa UCBPP-PA14 |
Kingdom | Bacteria |
Replicon accession | NC_008463 |
Strand | - |
Start bp | 33021 |
End bp | 34532 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639322585 |
Product | choline sulfatase |
Protein accession | YP_788186 |
Protein GI | 116053751 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.103488 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACCT CGCCGAACAT CCTGTTCATC ATGGCCGACC AGATGGCCGC GCCGCTGCTG CCGCTTCACG ATCCGCGCTC GGTGCTGCGC ATGCCTCACC TCTCGCGCCT CGCCGAACGG GCCGTGGTGT TCGACTCGGC GTACTGCAAC AGCCCGCTCT GCGCGCCGTC GCGCTTCACC CTGGTCAGCG GTCGCTTGCC TACTCGCATC GGCGCCTGGG ACAACGCCGC CGACTTCGCC GCCGATACCC CCACCTACGC CCACTACCTG CGCAACCTCG GCTATCGCAC GGCGCTGTCG GGCAAGATGC ACTTCTGCGG TCCCGACCAG TTGCACGGCT ACGAGGAACG CCTGACCAGC GACATCTATC CGGCGGACTA TGGCTGGGCG GTGAACTGGG ACGAGCCGGA GGTGCGCCCG AGCTGGTACC ACAACATGTC CTCGGTTTTG CAGGCCGGTC CCTGCGTGCG CACCAACCAG CTGGACTTCG ACGAGGAGGT GGTGTTCAAG GCCCGCCAGT ACCTCTACGA CCATGTTCGC CAGCACGCCG GCCAGCCATT CTGCCTGACC GTGTCGATGA CCCATCCGCA CGACCCCTAC AGCATCCCGG CGAGCTACTG GAATCTCTAC CGCGACGAGG ACATCCCGCT GCCGCGCCAG CGCTTCGCCC AGGAGGAGCA GGACCCTCAT TCGCAACGCC TGCTGAAGGT CATCGACCTG TGGGACAAGC CGTTGCCCGA GGAGCGCATC CGCGCCGCCC GGCGTGCCTA CTTCGGCGCC TGCAGCTACG TCGACGCGCA GATCGGTGCG CTGCTGGCGA CCCTGGAGGA ATGCGGGCTG GCCGACGACA CCATCGTGGT GTTCTCCGGC GACCATGGCG ACATGCTCGG CGAGCGCGGC CTCTGGTACA AGATGCACTG GTTCGAGATG GCCGCGCGCG TGCCGCTGCT GGTCCATGCG CCGGCGCGCT TCGCGCCGCG CCGCATCGGC GCTTCGGTAT CCACCGTGGA CCTGCTGCCG ACCCTGGTGG AGCTGGCCGG CGGCCAGGTC GATCCACGCC TGCCGCTGGA AGGCCGCTCG CTGCTGCCGC ACCTGCGCGA CGGCAGCGGG CATGACGAGG TGATCGGCGA ATACACCGCC GAGGGCACCC TCAGCCCGCT GATGATGATC CGCCGCGGCG ACTACAAGTT CATCTACTCC GAGCAGGACC CCTGCCTGCT CTACGACCTG CGCAACGACC CGCAGGAGCG CGAGAACCTC GCCGCCAGTC CGGCCCATCG CGGAACGTTC GAGGCGTTCC TCGACGAGGC CCGGCGACGC TGGGACATCC CCGCGATCAC CCGCGCCGTA CTCGACAGCC AGCGCCGCCG ACGCCTGGTG GCCGCCGCGC TGGCGCGAGG GCGGCTGGCC AGTTGGGACC ACCAGCCGTG GATCGACGCC AGCCAGCAGT ACATGCGCAA CCATATCGAC CTGGACGATC TCGAGCGCCG CGCGCGCTTC CCGCAACCCT GA
|
Protein sequence | MKTSPNILFI MADQMAAPLL PLHDPRSVLR MPHLSRLAER AVVFDSAYCN SPLCAPSRFT LVSGRLPTRI GAWDNAADFA ADTPTYAHYL RNLGYRTALS GKMHFCGPDQ LHGYEERLTS DIYPADYGWA VNWDEPEVRP SWYHNMSSVL QAGPCVRTNQ LDFDEEVVFK ARQYLYDHVR QHAGQPFCLT VSMTHPHDPY SIPASYWNLY RDEDIPLPRQ RFAQEEQDPH SQRLLKVIDL WDKPLPEERI RAARRAYFGA CSYVDAQIGA LLATLEECGL ADDTIVVFSG DHGDMLGERG LWYKMHWFEM AARVPLLVHA PARFAPRRIG ASVSTVDLLP TLVELAGGQV DPRLPLEGRS LLPHLRDGSG HDEVIGEYTA EGTLSPLMMI RRGDYKFIYS EQDPCLLYDL RNDPQERENL AASPAHRGTF EAFLDEARRR WDIPAITRAV LDSQRRRRLV AAALARGRLA SWDHQPWIDA SQQYMRNHID LDDLERRARF PQP
|
| |