Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_B0587 |
Symbol | |
ID | 3752351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007511 |
Strand | + |
Start bp | 658337 |
End bp | 659872 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637765435 |
Product | sulfatase |
Protein accession | YP_371345 |
Protein GI | 78061437 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAACC CGACACCCAA CATCCTGATC CTGATGGCCG ACCAGCTCAC GCCGTTCGCG CTGCCGGCCT ATGGCAATCG CGTCGCGCGC ACGCCGACGC TGGACCGTCT CGCCGCGCAA GGCGTCGTAT TCGACGCCGC ATACTGCGCG AGCCCGCTGT GCGCGCCGTC GCGTTTCTCG CTGCTGACCG GCAAGCTGCC GTCGGGGATC GGCGCCTACG ATAACGCCGC CGAATTGCCG GCGCAAACAT TGACGTTCGC ACACTACCTG CGCGCGGGCG GCTACCGGAC GATGCTGTCC GGCAAGATGC ATTTCTGCGG GCCCGACCAG TTGCACGGCT TCGAGGAGCG GCTGACGACC GACATCTATC CGGCCGATTT CGGCTGGGTG CCCGACTGGG ACAGCCCGAC CGAGCGGCCG AGCTGGTATC ACAACATGAG TTCGGTGCTG GAGGCCGGCC CGTGCGTGCG CACGAACCAG CTCGACTTCG ACGACGAAGT CACGTTTGCC GCGAAGCAGA AGCTGTACGA CGTCGCGCGC GAGCGTGCGG CCGGGCACGA TGCGCGGCCG TTCTGCATGG TCGTGTCGCT GACTCATCCG CACGACCCGT ATGCGATCAC GCGCGAATAC TGGGATCAAT ACAGCGACGA CGAGATCGAC ATGCCGGCCG TGCACCTCGA TGCGGCGGAA AGCGACCCGC ATTCGCAGCG GCTGCGCTTC GTCTGCGAAA ACGACCGCAC GCCGCCCACC GACGCGCAGA TTCGCGCCGC GCGCCGTGCC TACTACGGTG CGACGTCCTA CGTCGACGCG CAGTTCGGCA GCGTGCTGGG CGCGCTCGAA CAGTGCGGAT TCGCCGACGA TACGATCGTG ATCGTCACGT CCGACCACGG CGACATGCTC GGGGAGCGCG GGCTCTGGTA CAAGATGACG TTCTTCGAAG GCGGCTGCCG CGTGCCGCTG ATCGTGCATG CGCCGGGCCG CTTCGATGCC GCGCGTGTGC GCGGGCCTGT GTCGCACGTC GACCTGCTGC CGACGCTCGT CGATCTCGCC GGCGCCGCAC CGGCCGGCGG CTGGCCTGAC CCGGTCGATG GCGCGAGTCT CGTGCCGCAC CTGCACGGCA CGCCGGCGCA CGACGTCGCG CTCGGCGAAT ACCTTGCGGA AGGCGCAGTT GCACCGGTCG TGATGATCCG CCGCGGCGAC TGGAAGTACG TCCATTGCCC GGCCGATCCC GACCAGCTCT ACAACCTCTC CGACGACCCG CGCGAGCTCA CGAACCTCGC CGACACGCCG GAAGCGGCCG ACGTGCTCGC TACGTTCCGC GCGCAAGCCG CGCAGCGCTG GAACCTGCCC GAGCTGGACC GGCAGGTGCG CGCGAGCCAG CGGCGCCGGC GCTTCCATTA CGCGGCGACG ACGCAGGGCC GCATCCAGGC GTGGGACTGG CAGCCGTTCA CCGACGCGAG CCAGCGCTAC ATGCGCAATC ACATCGAACT CGACACGCTC GAGGCGATGG CGCGTTTTCC GCGCGTCGGG CGCTGA
|
Protein sequence | MTNPTPNILI LMADQLTPFA LPAYGNRVAR TPTLDRLAAQ GVVFDAAYCA SPLCAPSRFS LLTGKLPSGI GAYDNAAELP AQTLTFAHYL RAGGYRTMLS GKMHFCGPDQ LHGFEERLTT DIYPADFGWV PDWDSPTERP SWYHNMSSVL EAGPCVRTNQ LDFDDEVTFA AKQKLYDVAR ERAAGHDARP FCMVVSLTHP HDPYAITREY WDQYSDDEID MPAVHLDAAE SDPHSQRLRF VCENDRTPPT DAQIRAARRA YYGATSYVDA QFGSVLGALE QCGFADDTIV IVTSDHGDML GERGLWYKMT FFEGGCRVPL IVHAPGRFDA ARVRGPVSHV DLLPTLVDLA GAAPAGGWPD PVDGASLVPH LHGTPAHDVA LGEYLAEGAV APVVMIRRGD WKYVHCPADP DQLYNLSDDP RELTNLADTP EAADVLATFR AQAAQRWNLP ELDRQVRASQ RRRRFHYAAT TQGRIQAWDW QPFTDASQRY MRNHIELDTL EAMARFPRVG R
|
| |