Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1855 |
Symbol | |
ID | 3845901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 2245821 |
End bp | 2247374 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637839156 |
Product | choline sulfatase |
Protein accession | YP_440049 |
Protein GI | 83716187 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.250441 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGATA CCGCCGATTC CACCGATATC CAGCCGAACA TTCTCGTCCT GATGGCCGAC CAGCTCACGC CGTTCGCGTT GCGCGCGTAC GGCCATCGCG CGACGCGCAC GCCGACGATC GATCGGCTCG CCGCCGGGGG CGTCGTCTTC GATGCCGCCT ACTGCGCGAG CCCGCTCTGC GCGCCGTCGC GCTTCGCGCT GATGGCGGGC AAGCTGCCGT CGGCGCTCGG CGCTTACGAT AACGCCGCCG AATTGCCGGC GCAAACGCTG ACGTTCGCGC ACTACCTGCG CGCGGCCGGC TACCGGACGA TGCTGTCGGG CAAAATGCAC TTCTGCGGGC CCGATCAGTT GCACGGCTTC GAGGAACGGC TCACGACCGA CATCTATCCG GCCGATTTCG GCTGGGTGCC GGACTGGACG CGTCCCGCCG AGCGGCCGAG CTGGTATCAC AACATGAGCT CGGTGCTCGA CGCCGGCCCG TGCGTGCGGA CCAACCAGCT CGATTTCGAC GACGATGCGA CGTTCGCGGC GCGTCAGAAG ATCTTCGACG TCGCGCGCGA GCGCGCGGCC GGCCGGGACG CGCGGCCGTT CTGCATGGTC GTGTCGCTCA CGCATCCGCA CGATCCGTAT GCGATCACAC GCGAATACTG GGATCTGTAC CGGGACGAGG ATATCGATCT GCCCGCCGTG CGGATGGATT TCGACGCGAG CGACCCGCAT TCACGGCGCC TGCGCGCCGT ATGCGAGGTC GATCGCACGC CGCCGGACGA CTTGCAAATC CGGCGCGCGC GGCGCGCGTA CTACGGCGCG ACGTCCTATG TCGACGCGCA GTTCGGCGCG CTGCTCGCGG CGCTCGAGCA ATGCGGGCTC GCCGACGACA CGATCGTGAT CGTCACCGCG GATCACGGCG ACATGCTCGG CGAGCGCGGC CTCTGGTACA AGATGACGTT CTTCGAAGGC GCATGTCGCG TGCCGCTCAT CGTGCACGCG CCGGGCCGGT TTTCTGCCGC GCGCGTGCCG GCGGCCGTGT CGCACGTCGA TCTGCTGCCG ACGCTCGTCG AACTCGCGAC GGGCGAGCGC CGCGCCGACT GGCCCGACGC TGTCGACGGC CGCAGCCTCG TTCCGCATCT GCGCGGCGAG GGCGGCCATG ACGAGGCGTT CGGCGAATAC CTGGCCGAAG GCGCGATCGC GCCGATCGTG ATGATTCGCC GCGGCAACCA CAAGTACATC CATTCGCCTG CGGACCCTGA TCAGCTCTTC GATCTGAAGA ATGATCCGAG CGAGCTCGAC AACCTCGCGA ACGCGCCCAC CGAGGCAGCG CGTGTCGTTG CGTTTCGCGC GGAGTGCGCC GCACGCTGGG ATCTCGATGC GCTGCATCAA CAGGTGCTCG CGAGCCAGCG CAGGCGGCGC TTCCATTTCA AGGCGACGAC CACGGGGCGG ATTCGGTCGT GGGACTGGCA GCCGTTCCAG GATGCGAGCC AGCGTTACAT GCGCAATCAC CTCGAGCTCG ACGCGCTCGA GGCGACCGCG CGTTTTCCCC GTCCGCACGC GTGA
|
Protein sequence | MPDTADSTDI QPNILVLMAD QLTPFALRAY GHRATRTPTI DRLAAGGVVF DAAYCASPLC APSRFALMAG KLPSALGAYD NAAELPAQTL TFAHYLRAAG YRTMLSGKMH FCGPDQLHGF EERLTTDIYP ADFGWVPDWT RPAERPSWYH NMSSVLDAGP CVRTNQLDFD DDATFAARQK IFDVARERAA GRDARPFCMV VSLTHPHDPY AITREYWDLY RDEDIDLPAV RMDFDASDPH SRRLRAVCEV DRTPPDDLQI RRARRAYYGA TSYVDAQFGA LLAALEQCGL ADDTIVIVTA DHGDMLGERG LWYKMTFFEG ACRVPLIVHA PGRFSAARVP AAVSHVDLLP TLVELATGER RADWPDAVDG RSLVPHLRGE GGHDEAFGEY LAEGAIAPIV MIRRGNHKYI HSPADPDQLF DLKNDPSELD NLANAPTEAA RVVAFRAECA ARWDLDALHQ QVLASQRRRR FHFKATTTGR IRSWDWQPFQ DASQRYMRNH LELDALEATA RFPRPHA
|
| |