Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bmul_5130 |
Symbol | |
ID | 5769737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia multivorans ATCC 17616 |
Kingdom | Bacteria |
Replicon accession | NC_010086 |
Strand | - |
Start bp | 2273473 |
End bp | 2275422 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641319421 |
Product | sulfatase |
Protein accession | YP_001585092 |
Protein GI | 161521665 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.173338 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATGT CCGCGTCACG TTCGGTCCCG TTCCGTCTTC GCGTCGTATG CGCGATCGCC GCCGGTGCGC TGTCGCTCGC GTCGTGCGGC GGCGTCGACG GCAACCCGCC TCCGCAGGCC GACGCGACGC CATCGGCGAA GCGCCCGAAC ATCCTCTACA TCATGGCCGA CGATCTCGGC TATTCCGACA TCCACGCGTT CGGCGGCGAG ATCAACACGC CGAACCTCGA CGCGCTCGTC GCGTCGGGCC GCATCCTGTC GAACCACCAT ACGGGCACCG TATGCGCGAT CACGCGCGCG ATGCTGATTT CGGGCACGGA TCACCATCTC GTCGGCGAAG GCACGATGGG CGTGCCGACC GACGAGCGGC GCGGGCTGCC GGGCTACGAG GGCTATCTGA ACGATCGCGC GCTGTCGTTC GCGCAGCTGC TGAAGGATGC CGGCTATCAC ACGTATATCG CGGGCAAGTG GCACATCGGC TCGGGCATCG TCGGCAGCGC GACGGGCAGC GGGCAGACGC CCGATCAATG GGGCTTCGAG CGCAGCTACG TGCTGCTCGG CGGCGCGGCG ACGAACCACT TCGCGCACGA GCCGGCCGGC TCGTCGAACT ACACGGAGGA CGGCCGCTAC GTGCAGCCGG GCCAGCCCGG GCAGCCGGGC GGCACGGGCG GCAATCCGGC GGTGTTCTAT TCGACGAATT TCTATACGCA GAAGCTGATC CAGTACATCG ATTCGAATCA CAGCGACGGC AAGCCGTTCT TCGCCTATGC GGCATACACG TCGCCACACT GGCCGCTGCA GGTGCCCGAT CCGTGGCTGC ACAAGTACGC GGGCGTCTAC GACGCCGGCT ACGATGCGAT CCGCAACGCG CGAATCGCGC GGCAGAAGGC GCTCGGCCTG ATTCCGGCCG ACTTCAAGCC GTTCGACGGA TTGCCCGAGA CGACGGTTGC GTCGCCAGCG ACGGCGAACG ACGGCACCGC CAACGCGAAA TACGTCAGCG CCGTGCATTC GGCGGCCGAC GGCTACCGCG ACTACGGCGC GGGCAAGGTC GACAAGCTGT GGTCGAGCCT GAGCCCGGCC GAACGCAAGG CGCAGGCGCG CTACATGGAG ATCTACGCGG GGATGGTCGA GAACCTCGAC TACAACATCG GCCTGCTGAT CCAGCACCTG AAGGACATCG GCGAATACGA CAACACGTTC ATCATGTTCC AGTCGGACAA CGGCGCGGAA GGCTGGCCGA TCGATTCGGG CGCGGACCCG ACCGCGACCG ATACCGCGAA CGCGCAGGAA CCGACCTATT CGGCGCTCGG CACCGACAAC GGCAAGCAGA ACGCGCAGCG GCTGCAGTAC GGGCTGCGCT GGGCCGAGGT GAGCGCGACG CCGTTCCGGC TCACGAAGGG CTATTCGGCC GAAGGCGGCG TGTCGACGCC GACGATCGTT CATCTGCCGG GCCAGACGCA GCAGCTGCCG ACGCTGCGCG CGTTCACGCA CGTGACCGAC AACACGGCGA CGTTCCTCGC GGTAGCGGGC GTGACGCCGC CGTCGCAGCC GGCGCCGCCG CTGATCAACA CGCTGACGGG CGTCGATCAG AACAAGGGCA AGGTCGTATA CGGCAACCGC TACGTCTATC CCGTCACCGG CCAGTCGCTG CTGCCGGTGC TGACCGGCGC CGCGAACGGC GAAGTGCACA CCGCGCCGTT CGGCGACGAA GCCTACGGCC GCGCGTATCT GCGCAGTGCC GACGGCCGCT GGAAAGCGTT GTGGACGGAG CCGCCGCTCG GGCCGCTCGA CGGTCACTGG CAGCTGTACG ACCTCACGAC GGACCGCGGC GAGACGATCG ACGTGTCCGC GCAGAATCCG TCGGTGGTCA GCACGCTGAT CGATCAGTGG AAGGCGTACA TGAGCAACGT CGGCGGCGTC GAGCCGCTGC GTCCGCGCGG CTACTACTGA
|
Protein sequence | MSMSASRSVP FRLRVVCAIA AGALSLASCG GVDGNPPPQA DATPSAKRPN ILYIMADDLG YSDIHAFGGE INTPNLDALV ASGRILSNHH TGTVCAITRA MLISGTDHHL VGEGTMGVPT DERRGLPGYE GYLNDRALSF AQLLKDAGYH TYIAGKWHIG SGIVGSATGS GQTPDQWGFE RSYVLLGGAA TNHFAHEPAG SSNYTEDGRY VQPGQPGQPG GTGGNPAVFY STNFYTQKLI QYIDSNHSDG KPFFAYAAYT SPHWPLQVPD PWLHKYAGVY DAGYDAIRNA RIARQKALGL IPADFKPFDG LPETTVASPA TANDGTANAK YVSAVHSAAD GYRDYGAGKV DKLWSSLSPA ERKAQARYME IYAGMVENLD YNIGLLIQHL KDIGEYDNTF IMFQSDNGAE GWPIDSGADP TATDTANAQE PTYSALGTDN GKQNAQRLQY GLRWAEVSAT PFRLTKGYSA EGGVSTPTIV HLPGQTQQLP TLRAFTHVTD NTATFLAVAG VTPPSQPAPP LINTLTGVDQ NKGKVVYGNR YVYPVTGQSL LPVLTGAANG EVHTAPFGDE AYGRAYLRSA DGRWKALWTE PPLGPLDGHW QLYDLTTDRG ETIDVSAQNP SVVSTLIDQW KAYMSNVGGV EPLRPRGYY
|
| |