Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcen2424_5072 |
Symbol | |
ID | 4452956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia HI2424 |
Kingdom | Bacteria |
Replicon accession | NC_008543 |
Strand | - |
Start bp | 2096014 |
End bp | 2097564 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639697128 |
Product | sulfatase |
Protein accession | YP_838698 |
Protein GI | 116693165 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.263154 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAACC CGACACCCAA CATCCTGATC CTGATGGCCG ACCAGCTCAC GCCGTTCGCG CTGCCGGCGT ACGGCAACCG CGTCGCCCGT ACGCCGACGC TCGACCGGCT CGCCGCCGAA GGCGTGGTGT TCGACGCCGC GTACTGCGCG AGCCCTTTGT GCGCGCCGTC GCGCTTCTCG CTGCTGACCG GCAAGCTGCC GTCGGGGATC GGCGCCTACG ATAACGCCGC CGAATTGCCG GCGCAAACGC TGACGTTCGC ACACTACCTG CGCGCGGGCG GCTACCGGAC GATGCTGTCC GGCAAGATGC ATTTCTGCGG GCCCGATCAG TTGCACGGCT TCGAGGAGCG CCTCACGACC GACATCTATC CGGCCGATTT CGGCTGGGTG CCGGACTGGG ATCAACCGAC CGAGCGGCCG AGCTGGTATC ACAACATGAG CTCGGTGCTC GATGCCGGCC CGTGCGTGCG TACGAACCAG CTCGACTTCG ACGACGAAGT GACGTTCGCC GCGAAACAGA AGCTGTACGA CGTCGCGCGC GAACGTGCGG CCGGACACGA TACGCGGCCG TTCTGCATGG TCGTGTCGCT GACCCATCCG CACGACCCGT ATGCGATCAC GCGCGAATAC TGGGATCTGT ACCGCGACGA AGACATCGAC ATGCCGGCCG TGCGGCTCGA TGCGGCCGAA AGCGATCCGC ATTCGCAGCG GCTGCGCTTC GTCTGCGAGA ACGACCGCAC GCCGCCGACC GACGCGCAGA TCCGCGCCGC CCGCCGCGCG TATTACGGCG CGACGTCCTA CGTCGACACG CAGTTCGGCA GCGTGCTGGC CGCCCTCGAG CAATGCGGGT TCGCCGACGA CACGATCGTG ATCGTCACGT CCGACCACGG CGACATGCTC GGCGAACGCG GGCTCTGGTA CAAGATGACG TTCTTCGAAG GCGGCTGCCG CGTGCCGCTG ATCGTGCATG CGCCGGGCCG CTTCGGCGCT GCGCGGGTGC GCGGGCCCGT GTCGCACGTC GACCTGCTGC CGACGCTCGT CGAGCTGGCC GGCGCCACGC CCGCCGGCGG CTGGCCGGAC GCCGGATGGC CGGACCCGGT CGACGGTGCG AGCCTCGTGC CGCACCTGCA CGGCACGCCC GCGCACGATG TCGCGCTCGG CGAATACCTC GCGGAAGGCG CGCTCGCGCC GGTCGTGATG ATCCGCCGCG GCGACTGGAA ATACGTGCAT TGCCTGGCCG ATCCCGACCA GCTCTACCAC CTGTCGGACG ACCCGCGCGA GCTGACGAAC CTGGCCGGGC AGCCGGAAGC CGCCGACGTG CTCGCCGCGT TCCGCGTGGA GGCCGCACAG CGCTGGAACC TGCCCGAGCT GGACCGGCAG GTGCGCGCGA GCCAGCGGCG CCGGCGCTTC CATTACGCGG CGACGACGCA GGGCCGCATC CACGCGTGGG ACTGGCAGCC GTTCACCGAC GCGAGCCAGC GCTACATGCG CAATCACATC GAACTCGACA CGCTCGAGGC GATGGCGCGT TTTCCGCGCG TCGGGCGCTG A
|
Protein sequence | MTNPTPNILI LMADQLTPFA LPAYGNRVAR TPTLDRLAAE GVVFDAAYCA SPLCAPSRFS LLTGKLPSGI GAYDNAAELP AQTLTFAHYL RAGGYRTMLS GKMHFCGPDQ LHGFEERLTT DIYPADFGWV PDWDQPTERP SWYHNMSSVL DAGPCVRTNQ LDFDDEVTFA AKQKLYDVAR ERAAGHDTRP FCMVVSLTHP HDPYAITREY WDLYRDEDID MPAVRLDAAE SDPHSQRLRF VCENDRTPPT DAQIRAARRA YYGATSYVDT QFGSVLAALE QCGFADDTIV IVTSDHGDML GERGLWYKMT FFEGGCRVPL IVHAPGRFGA ARVRGPVSHV DLLPTLVELA GATPAGGWPD AGWPDPVDGA SLVPHLHGTP AHDVALGEYL AEGALAPVVM IRRGDWKYVH CLADPDQLYH LSDDPRELTN LAGQPEAADV LAAFRVEAAQ RWNLPELDRQ VRASQRRRRF HYAATTQGRI HAWDWQPFTD ASQRYMRNHI ELDTLEAMAR FPRVGR
|
| |