Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PSPA7_0032 |
Symbol | betC |
ID | 5356908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas aeruginosa PA7 |
Kingdom | Bacteria |
Replicon accession | NC_009656 |
Strand | - |
Start bp | 32880 |
End bp | 34391 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640809086 |
Product | choline sulfatase |
Protein accession | YP_001345428 |
Protein GI | 152989262 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.193076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACCC CGCCGAACAT ACTGTTCATC ATGGCCGACC AGATGGCCGC GCCGCTGCTG CCGTTTCACG ATCCCCGCTC GGTGCTACGC ATGCCCAACC TGTCGCGCCT CGCCGAACGG GCGGTGGTAT TCGATTCGGC CTACTGCAAC AGTCCGCTCT GCGCGCCCTC GCGCTTCACC CTGGTCAGCG GCCGGTTGCC CAGCCGCATC GGCGCCTGGG ACAACGCCGC CGACTTCGCC GCCGATACCC CGACCTACGC CCACTACCTG CGTCGCCTCG GTTATCGCAC GGCGCTGTCG GGCAAGATGC ACTTCTGCGG CCCCGACCAG CTGCATGGCT ACGAGGAGCG CCTGACCAGC GATATCTATC CGGCCGACTA CGGCTGGGCG GTGAACTGGG ACGAGCCGGA GGTGCGCCCG AGCTGGTACC ACAACATGTC CTCGGTCCTG CAGGCCGGTC CCTGCGTGCG CACCAACCAG CTGGACTTCG ACGAGGAGGC GGTGTTCAAG GCCCGCCAGT ACCTCTACGA CCATGTTCGC CAGCACGCCG GCCAGCCATT CTGCCTGACC GTGTCGATGA CCCACCCGCA CGATCCCTAC TGCATCCCGG CGAGCTACTG GAACCTCTAC CGCGACGAGG ACATCCCGCT GCCGCGCCAG AGCTTCGCCC AGGAAGAGCA GGACCCTCAT TCGCAACGCC TGCTGAAGGT CATCGACCTG TGGGACAAGC CGTTGCCCGA GGAGCGCATC CGCGCCGCCC GGCGCGCCTA CTTCGGCGCC TGCAGCTACG TCGACGCGCA GGTCGGCGCG CTGCTGGCGA CGCTGGAGGA ATGCGGGCTG GCCGACGACA CCATCCTGGT GTTCTCCGGC GACCATGGCG ACATGCTCGG CGAGCGCGGC CTCTGGTACA AGATGCACTG GTTCGAGATG GCCGCGCGCG TGCCGCTGCT GGTCCACGCG CCAGGTCGCT TCGCGGCGCG CCGGATCGGC GCCTCGGTAT CCACCGTCGA CCTGCTGCCG ACCCTGGTGG AACTGGCCGG CGGCCAGGTC GACGCGCGCC TGCCGCTGGA CGGACGCTCG CTGCTGCCGC ACCTGCGTGA CGGCAGCGGG CATGACGAGG TGATCGGCGA ATACACCGCC GAGGGCACCC TCGGCCCGCT GATGATGATC CGTCGCGGCG ACTACAAGTT CATCTATTCA GAGCAGGACC CCTGCCTGCT CTACGACCTG CGCAACGACC CCCAGGAGCG CGAGAACCTC GCCGCCAGCC CGGCCCATCG CGGAGCATTC GAGGCGTTCC TCGATGAGGC ACGGCGGCGC TGGGACATCC CCGCGCTCAC CCGCGCCGTG CTCGACAGCC AGCGCCGCCG GCGCCTGGTA GCCGAGGCGC TGACGCGCGG CCGGTTGACC AGCTGGGACC ATCAGCCGTG GGTCGACGCC AGCCAGCAAT ACATGCGCAA CCATATCGAC CTCGACGACC TCGAGCGCCG CGCGCGCTTC CCGCAACCCT GA
|
Protein sequence | MKTPPNILFI MADQMAAPLL PFHDPRSVLR MPNLSRLAER AVVFDSAYCN SPLCAPSRFT LVSGRLPSRI GAWDNAADFA ADTPTYAHYL RRLGYRTALS GKMHFCGPDQ LHGYEERLTS DIYPADYGWA VNWDEPEVRP SWYHNMSSVL QAGPCVRTNQ LDFDEEAVFK ARQYLYDHVR QHAGQPFCLT VSMTHPHDPY CIPASYWNLY RDEDIPLPRQ SFAQEEQDPH SQRLLKVIDL WDKPLPEERI RAARRAYFGA CSYVDAQVGA LLATLEECGL ADDTILVFSG DHGDMLGERG LWYKMHWFEM AARVPLLVHA PGRFAARRIG ASVSTVDLLP TLVELAGGQV DARLPLDGRS LLPHLRDGSG HDEVIGEYTA EGTLGPLMMI RRGDYKFIYS EQDPCLLYDL RNDPQERENL AASPAHRGAF EAFLDEARRR WDIPALTRAV LDSQRRRRLV AEALTRGRLT SWDHQPWVDA SQQYMRNHID LDDLERRARF PQP
|
| |