Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_B2021 |
Symbol | |
ID | 3753786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007511 |
Strand | + |
Start bp | 2316211 |
End bp | 2318109 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637766869 |
Product | sulfatase |
Protein accession | YP_372778 |
Protein GI | 78062870 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.46218 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.427112 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACCC CCGGCACGAG CCGGCACGAT CAGGAGACAG CCGACGTGAA AGACGATCAG GACACCCCCG CCGCCGCCGA ACCCACCCCG CGGCGCCAGT TCCTCAAGCT GGCCGGCGCG GCGGTCGCGG CGGCCGGCTT CGGCACCGAT GCGCTGGCCG CGAGCACGCC GCCGGCCGCC CCGGTTGCCG CCGGCGCGGG CGTCGATCCC GTGCCCGCAT TCCACGGCGC GACGACCGCG CCCGCCGCGC CGCCCGCCGG CTACAACATC CTGTTCATCC TGACCGACCA GGAACGCCAC TTCGACCGCT GGCCGTTTCC GGTGCCCGGC CGCGAAGCGC TGCGCCGCGA CGGCATCACG TTCATGCATC ACCAGATCGC CGCATGCGTG TGCTCGCCGT CGCGCTCGAC GGTCTACACG GGGCAGCACA TCCAGCACAC CGGCGTGCTC GACAACGCGG GCGTGCCCTG GCAGAAGGAC ATGTCGCCGG ACGTGCGCAC CGTCGGCCAC ATGCTGCGCG ACGCCGGCTA CTACGCCGCG TATCTCGGCA AGTGGCACCT GAGCGCGTCG ATGCACGAAA CCGCAAGCCC GTACACGGCG CCGGTGGCCG ACTACAACCG CACGATCCGG TCGTACGGTT TCGACGACTA TTTCGGCGTG GGCGACCTGA TCGGGATGGT GCGCGGCGGC TACCAGTACG ACGGGATCAC GGCCGAGGCC GCGGTGAGCT GGATGCGCAA CCACGCGCCG CGTCTCGCGA AGGAAGGCAA GCCGTGGTTC CTCGCGGTGA ATCTGGTGAA CCCGCACGAC GCGATGTTCG TGAACACCGA CACCAACGGC TCGACGGTGC AGGACGCGAA CCACCCGATG CTCGGCAACG CGCCGCCGCC GAACGACGCG CTGTATCGCA CGTCGTGGCA CGACAAGCCG CTCGCGGCAT CGCGGCGGCA GCCGTACGAC GAACCGGGAC GGCCGCCCGC GCACGGGATG TTCAACGCCG CGCATGCGAA CCTCGTCGGG CGCTATCCGT TCACCGACGA ACGCCTGCGC ATCTATCAGG ACAACTACTT CAACTGCGTG CGCGACTGCG ACACGCACGT CGTGCGCCTG CTGCAGTCGC TGCAGGCGCT CGGCCTCGAC GAACGCACGA TCGTCGTGAT GACGGCCGAC CACGGCGATC ACATCGGCGC GCACCAGCTC GTCGGCAAGG GCGCCACCGC GTACCAGCCG CAGAACCACG TGCCGCTCGT GATCCGCCAT CCCGCGTATC CGGGCGGCAT GCAGTGCGAT GCGCTGACAT CGCACATCGA CATCGCGCCG ACGCTGCTCG GGCTCACCGG TCTCGACGAC GCGCGTCTCG CGTCGATCCG CGGCAGCGCA CTCAAGGGGC ACGACCTCAC GCGCTGGCTC GCGAAGCCGG CCGGGGCGAA GCTGCACGCG GCGCGTGATG CCACGCTGTT CAACTACGCG ATGCTGCTCT ACTACGACAG CGAATGGATG CTGAAGGAAC TGGGCACGAT GCGCCAGAAA GGCGTGCCGG AGGACGAGCT GCTGCGCCGC GCGCTCGCGC AGCAGCCGGA TTTCCGGTTG CGCGGCACGA TCCGCAGCGT GTTCGACGGC CGCTACCGGT TCACGCGCTA TTTCTCGCCG CTCGAATTCA ACCGGCCGAC GACGATGGAG GACCTGTTCG CGCGCAACGA CGTCGAGCTG TTCGATATCG CGAGCGATCC GGGCGAGATG CGCAATCTCG CGATGGACCG GAAACAGCAC GGCGAGCTGC TGCTCGCGAT GAACGGCCGC CTGAACGACC TGATCGCGAG CGAAGTCGGC GACGACAGCC CCGACGTCAT GCCGATCCGC GACGGCAAGG TGCAGGTGCA GATCCGCAAG TGGCATTGA
|
Protein sequence | MKTPGTSRHD QETADVKDDQ DTPAAAEPTP RRQFLKLAGA AVAAAGFGTD ALAASTPPAA PVAAGAGVDP VPAFHGATTA PAAPPAGYNI LFILTDQERH FDRWPFPVPG REALRRDGIT FMHHQIAACV CSPSRSTVYT GQHIQHTGVL DNAGVPWQKD MSPDVRTVGH MLRDAGYYAA YLGKWHLSAS MHETASPYTA PVADYNRTIR SYGFDDYFGV GDLIGMVRGG YQYDGITAEA AVSWMRNHAP RLAKEGKPWF LAVNLVNPHD AMFVNTDTNG STVQDANHPM LGNAPPPNDA LYRTSWHDKP LAASRRQPYD EPGRPPAHGM FNAAHANLVG RYPFTDERLR IYQDNYFNCV RDCDTHVVRL LQSLQALGLD ERTIVVMTAD HGDHIGAHQL VGKGATAYQP QNHVPLVIRH PAYPGGMQCD ALTSHIDIAP TLLGLTGLDD ARLASIRGSA LKGHDLTRWL AKPAGAKLHA ARDATLFNYA MLLYYDSEWM LKELGTMRQK GVPEDELLRR ALAQQPDFRL RGTIRSVFDG RYRFTRYFSP LEFNRPTTME DLFARNDVEL FDIASDPGEM RNLAMDRKQH GELLLAMNGR LNDLIASEVG DDSPDVMPIR DGKVQVQIRK WH
|
| |