Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bxe_B1573 |
Symbol | |
ID | 4007938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia xenovorans LB400 |
Kingdom | Bacteria |
Replicon accession | NC_007952 |
Strand | + |
Start bp | 1635503 |
End bp | 1637050 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637951121 |
Product | putative choline-sulfatase |
Protein accession | YP_553741 |
Protein GI | 91778533 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGAGA CCAAAAAGAA TATTCTTATC CTGATGGCAG ACCAGATGAC GCCGTTCGCA CTAGCCGCGT ACGGACACCG TCTGACGAAA ACCCCCAATC TGGATCGCCT CGCGAAACAA GGCGTGGTAT TCGAATCGGC CTACTGCGCC AGTCCGCTGT GTGCGCCTTC GCGGTTTTCT TTTCTGTCGG GCAAGTTGCC TTCGGCAATC GGCGCCTATG ACAATGCAGC GGAGTTTCCG TCGCAAACGC TGACCTTCGC ACATTATCTT CGCGCAGAAG GATATCGCAC CATTCTGTCC GGCAAGATGC ATTTCTGCGG CGCCGATCAG TTGCATGGTT TCGAGGAGCG GCTCACCACG GACATCTATC CGGCCGACTT CGGCTGGACG CCCGACTGGG AGCATTTTGA AACACGCCCC ACGTGGTATC ACAACATGAG TTCCGTGATC GACGCCGGCC CGTGCGTGCG TACCAACCAG CTCGACTTCG ACGACGAAGT CACGTTCACG ACTCGCCAGA AACTCTTCGA TATCGCACGC GAACGTCATG CGGGAAAGGA TGCGCGGCCG TTCTGCCTGG TCGCCTCGCT GACGCATCCG CACGATCCCT ACGCGATTCC GCAAAAGTAT TGGGACATGT ATCGCGACGA AGAGATCGAC ATGCCCGCGT TTCGCGATTC GTTCGAAGAC GCCGACCCGC ACTCGAAGCG CCTGCGCCAT GTCTGCGAAA CCGACCGCAC GCCGCCCACC GATCAGCAGA TCCGCAATGC GCGCCGCGCG TATTACGGCG CGATCTCCTA CGTCGACGAT CAGTTCGGCG CGATCCTCGA AGCGCTCGAA CAATCCGGCC TCGCGAAAGA CACTGTGATC GTGGTGACTT CCGATCACGG CGAGATGCTC GGCGAGCGCG GGCTCTGGTA CAAGATGACC TTCTTCGAAG GCGGTTGCCG CGTTCCGCTG ATCGTGCACG CACCGCAGCA ATTCGACGCG CATCGCGTAG AGGCGAGCGT CTCGCATCTC GATCTGCTGC CCACGCTGGT CGAGTTGGCA CGCGGCGAAC AACCGGCCGT GTGGCCCGAT TCGCTCGACG GACAAAGCCT TGTTCCGCAT CTTATGAGCG GGCAAGGCGG GCAAGGCGGC CATGACGAAG CGATCGGCGA ATATCTGGCC GAAGGGGCCA TTGCGCCGAT CGTGATGTTG CGGCGCGGCC GCTTCAAGTT CATTCACACA CCGGCCGATC CGGATCAACT CTACGATGTC GCCGCAGACC CGCTGGAACG CGAGAACCTT GCCGCGCGCA GCGAATACGC GTCACAGGTC ACGGCTTTCC GTGAGGAGAT CGCGCAACGC TGGGATCTCG CCGCGCTGCA CCACGAAGTG CTGCAAAGCC AGCGGCGCCG TCATTTTCAT TTTGCCTCCA CGACGCAGGG CGTGGTCGCG TCATGGGACT GGCAGCCGCA CGTGGATGCG AGCCAGCGTT ACATGCGCAA TCACATCGAT CTGGATTCGC TCGAAGCGAT GGCGCGCTTT CCCGCCGTCG TGCGTTGA
|
Protein sequence | MLETKKNILI LMADQMTPFA LAAYGHRLTK TPNLDRLAKQ GVVFESAYCA SPLCAPSRFS FLSGKLPSAI GAYDNAAEFP SQTLTFAHYL RAEGYRTILS GKMHFCGADQ LHGFEERLTT DIYPADFGWT PDWEHFETRP TWYHNMSSVI DAGPCVRTNQ LDFDDEVTFT TRQKLFDIAR ERHAGKDARP FCLVASLTHP HDPYAIPQKY WDMYRDEEID MPAFRDSFED ADPHSKRLRH VCETDRTPPT DQQIRNARRA YYGAISYVDD QFGAILEALE QSGLAKDTVI VVTSDHGEML GERGLWYKMT FFEGGCRVPL IVHAPQQFDA HRVEASVSHL DLLPTLVELA RGEQPAVWPD SLDGQSLVPH LMSGQGGQGG HDEAIGEYLA EGAIAPIVML RRGRFKFIHT PADPDQLYDV AADPLERENL AARSEYASQV TAFREEIAQR WDLAALHHEV LQSQRRRHFH FASTTQGVVA SWDWQPHVDA SQRYMRNHID LDSLEAMARF PAVVR
|
| |