Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphyt_5075 |
Symbol | |
ID | 6279038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phytofirmans PsJN |
Kingdom | Bacteria |
Replicon accession | NC_010676 |
Strand | + |
Start bp | 1236569 |
End bp | 1238107 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642616166 |
Product | choline-sulfatase |
Protein accession | YP_001888809 |
Protein GI | 187919778 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.287023 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0038506 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTTGACA GCAAAAAGAA TATTCTCATC CTGATGGCCG ACCAGATGAC GCCGTTCGCG CTGGCCGCAT ACGGGCACCC TCTGACGAAG ACGCCCAATC TCGATCGCCT CGCGAAACAG GGCGTGGTGT TCGAATCGGC CTATTGCGCG AGTCCGCTGT GTGCGCCGTC GCGATTCTCT TTTTTATCGG GCAAGCTGCC GTCCGCAATC GGTGCCTATG ACAATGCAGC GGAATTTCCG TCGCAAACGC TGACCTTCGC GCATTATCTC CGCGCTGAAG GTTATCGAAC CATTCTGTCC GGCAAGATGC ATTTCTGCGG CGCCGATCAG TTGCATGGCT TCGAAGAGCG GCTCACCACC GACATCTATC CCGCCGACTT CGGCTGGACA CCCGACTGGG ACAATTTCGA AGCACGTCCC ACGTGGTATC ACAACATGAG TTCGGTAATC GACGCGGGTC CGTGCGTGCG CACCAATCAA CTCGACTTCG ACGACGAGGT CACGTTCACC GCTCGCCAGA AACTCTTCGA CATTGCGCGC GAACGTCATG CGGGTAAAGA TGCGCGGCCG TTTTGCATGG TCGCCTCGCT GACGCACCCA CACGACCCGT ACGCGATTCC GCAAAAGTAC TGGGACATGT ATCGCGACGA AGACATCGAC ATGCCCGCGT TTCGCGATTC ATTCGAAGAC GCTGACCCGC ACTCAAAGCG CCTGCGCCAT GTCTGCGAAA CCGATCGCAC GCCGCCTACC GATCAGCAGA TCCGCAACGC GCGGCGCGCT TATTACGGCG CGATCTCCTA CGTCGACGAC CAGTTCGGCG CGATCCTCGA AGCGCTCGAT CAGGCCGGCC TCGCGCAAGA CACGGTGATC GTGGTGACGT CCGATCACGG CGAGATGCTC GGCGAACGTG GACTCTGGTA CAAGATGACT TTCTTCGAAG GCGGTTGCCG CGTGCCCTTG ATCGTGCACG CGCCGCAGCA ATTCGACGCG CATCGCGTGA AGGACTCGGT CTCGCATCTC GATCTCGTGC CGACGCTGGT CGAACTGGCG CGCGGCGAAC AGCCGGCCGT GTGGCCCGAT TCGCTGGATG GACAAAGCCT CGTGCCGCAT CTGTTCGGTA AGCAAGGTGG TCATGACGAA GCGATCGGCG AATATCTGGC CGAGGGTGCG ATTGCGCCGA TTGTGATGCT GCGGCGTGGG CGCTTCAAGT TTATTCACAC ACCTGCCGAT CCCGATCAAC TCTACGACGT CGCAGCCGAT CCGCTTGAAC GAGAAAACCT TGCCGCGCGC AGCGAATATG CATCACAGGT CGCGGCATTT CGGGAGGAAG TCGCGCAACG CTGGAATCTC GCCGCGCTGC ACAATGAAGT GCTGCAAAGC CAGCGGCGCC GCCATTTTCA CTTCGCGTCA ACGACACAAG GCACGGTCGC ATCGTGGGAT TGGCAACCGC TCGTCGATGC GAGCCAGCGT TATATGCGCA ACCACATCGA TCTCGATACG CTCGAAGCGA TGGCGCGCTT TCCCGCCGTT GCCCGTTAA
|
Protein sequence | MLDSKKNILI LMADQMTPFA LAAYGHPLTK TPNLDRLAKQ GVVFESAYCA SPLCAPSRFS FLSGKLPSAI GAYDNAAEFP SQTLTFAHYL RAEGYRTILS GKMHFCGADQ LHGFEERLTT DIYPADFGWT PDWDNFEARP TWYHNMSSVI DAGPCVRTNQ LDFDDEVTFT ARQKLFDIAR ERHAGKDARP FCMVASLTHP HDPYAIPQKY WDMYRDEDID MPAFRDSFED ADPHSKRLRH VCETDRTPPT DQQIRNARRA YYGAISYVDD QFGAILEALD QAGLAQDTVI VVTSDHGEML GERGLWYKMT FFEGGCRVPL IVHAPQQFDA HRVKDSVSHL DLVPTLVELA RGEQPAVWPD SLDGQSLVPH LFGKQGGHDE AIGEYLAEGA IAPIVMLRRG RFKFIHTPAD PDQLYDVAAD PLERENLAAR SEYASQVAAF REEVAQRWNL AALHNEVLQS QRRRHFHFAS TTQGTVASWD WQPLVDASQR YMRNHIDLDT LEAMARFPAV AR
|
| |