Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_B2584 |
Symbol | |
ID | 3754351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007511 |
Strand | - |
Start bp | 2938524 |
End bp | 2940482 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637767432 |
Product | sulfatase |
Protein accession | YP_373339 |
Protein GI | 78063431 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGT CCGCGTCGCC GTTACATGCT TTTCGTTTCC GTGTCGTCTG CGCCGCGATT GCCGGCGCGC TGTCGCTCGC ATCGTGCGGC GGCGTCGACA GCGATCCGCC GCCGTCACCC ACCAGCACCA CGCCGCCGCT GGCTCAGAAG CGCCCGAACA TCCTGTACAT CATGGCCGAC GATCTCGGCT ATTCCGACAT CCATGCATTC GGCGGCGAGA TCAACACGCC GAACCTCGAC GCGCTCGTCG CGTCGGGCCG CATCCTGTCG AACCATCACA CGGGCACCGT CTGCGCGATC ACGCGCGCGA TGCTGGTGTC CGGCACCGAC CACCATCTCG TCGGCGAAGG CACGATGGGC GTGCCGACCG ACGAACGGCG CGGGCTGCCC GGCTACGAGG GCTACCTGAA CGATCGTGCA TTGTCGTTCG CACAACTATT GAAGGATGCC GGCTATCACA CGTACATCGC GGGCAAGTGG CACATCGGCT CGGGGATCGT CGGCAGCGCG ACGGGCAGCG GGCAGACGCC GGACCAGTGG GGCTTCGAGC GCAGCTACGT GCTGCTCGGC GGCGCGGCGA CGAACCACTT CGCGCACGAG CCGGCCGGCT CGTCGAACTA CACGGAAGAC GGCCGCTACG TGCAGCCTGG CCAGCCCGGA CAGCCGGGCG GCACGGGCGG CAGCCCGGCC GTGTTCTATT CGACCGACTT CTATACGCAG AAGCTGATTT CGTACATCGA CTCGAACCAG CGCGACGGCA AGCCGTTCTT CGCGTACGCG GCCTTCACGT CGCCGCACTG GCCGCTGCAG GTACCCGATC CGTGGCTGCA CAAGTACGCG GGCGTGTACG ACGCCGGCTA CGACGCGATC CGCAACGCGC GGATCGCGCG GCAGAAGGCG CTCGGCCTGA TCCCCGCCGA TTTCAAGCCG TTCGACGGCC TGCCTGAAAC GACGGCCGCG TCGCCCGCGA CCGCGAACAA CGGCACGGCC GCCGCGAAAT ACATCAGCGC GGTGCATTCG GCCGCGGACG GCTACAGCGA CTACGGCCCC GGCAAGGTCG ACAAGCTGTG GTCGAGCCTG ACGCCGGCCG AGCGCAAGGC GCAGGCGCGC TACATGGAGA TCTATGCGGG GATGGTCGAG AACCTCGACT ACAACATCGG CCTGCTGATC CAGCACCTGA AGGACATCGG CGAATACGAC AACACGTTCA TCATGTTCCA GTCGGACAAC GGCGCGGAAG GCTGGCCGAT CGACGGCGGC GCCGACCCGA CGGCGACCGA CACCGCGAAC GGGCAGGACC CGATCTATTC GACGCTCGGC ACCGACAACG GCAAGCAGAA CGCGCAGCGC CTGCAGTACG GGTTGCGCTG GGCCGAAGTG AGCGCGTCGC CGTTCCGGCT CACGAAGGGG TATTCCGCTG AAGGCGGCGT ATCGACGCCG ACGATCGTCC GCCTGCCGGG CCAGACGCAG CAGTTGCCGA CGCTGCGCGC CTTCACGCAC GTGACCGACA ACACGGCGAC GTTCCTCGCG GTCGCGGGCG TCACGCCGCC GTCGCAGCCG GCGCCGCCGC TCGTCAACAC ACTGACGGGT GTCGACCAGA ACAAGGGCAA GGTCGTCTAC AACAACCGCT ACGTGTATCC GGTCACCGGC CAGTCGTTGC TGCCGGTGCT GACGGGCACG GCGACGGGCG AAGTGCATAC CGCGCCGTTC GGCGACGAAG CCTACGGCCG TGCATACCTG CGCAGCGCCG ACGGCCGCTG GAAGGCCTTG TGGACCGAGC CGCCGCTCGG GCCGCCCGAC GGTCACTGGC AGCTGTACGA CCTCGCGTCG GATCGCGGCG AGACGACCGA CGTGTCCGCG CAGAACCCGT CGGTGATCAG CACGCTCGTC GACCAGTGGA AGACCTACAT GAGCAACGTC GGCGGTGTCG AACCGCTGCG TCCGCGCGGC TACTACTGA
|
Protein sequence | MKKSASPLHA FRFRVVCAAI AGALSLASCG GVDSDPPPSP TSTTPPLAQK RPNILYIMAD DLGYSDIHAF GGEINTPNLD ALVASGRILS NHHTGTVCAI TRAMLVSGTD HHLVGEGTMG VPTDERRGLP GYEGYLNDRA LSFAQLLKDA GYHTYIAGKW HIGSGIVGSA TGSGQTPDQW GFERSYVLLG GAATNHFAHE PAGSSNYTED GRYVQPGQPG QPGGTGGSPA VFYSTDFYTQ KLISYIDSNQ RDGKPFFAYA AFTSPHWPLQ VPDPWLHKYA GVYDAGYDAI RNARIARQKA LGLIPADFKP FDGLPETTAA SPATANNGTA AAKYISAVHS AADGYSDYGP GKVDKLWSSL TPAERKAQAR YMEIYAGMVE NLDYNIGLLI QHLKDIGEYD NTFIMFQSDN GAEGWPIDGG ADPTATDTAN GQDPIYSTLG TDNGKQNAQR LQYGLRWAEV SASPFRLTKG YSAEGGVSTP TIVRLPGQTQ QLPTLRAFTH VTDNTATFLA VAGVTPPSQP APPLVNTLTG VDQNKGKVVY NNRYVYPVTG QSLLPVLTGT ATGEVHTAPF GDEAYGRAYL RSADGRWKAL WTEPPLGPPD GHWQLYDLAS DRGETTDVSA QNPSVISTLV DQWKTYMSNV GGVEPLRPRG YY
|
| |