Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bamb_4484 |
Symbol | |
ID | 4313376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia ambifaria AMMD |
Kingdom | Bacteria |
Replicon accession | NC_008391 |
Strand | - |
Start bp | 1428308 |
End bp | 1429843 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638152323 |
Product | sulfatase |
Protein accession | YP_776368 |
Protein GI | 115359230 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAACC CGACTCCCAA CATCCTGATC CTGATGGCCG ACCAGCTGAC GCCGTTCGCG CTGCCGGCCT ACGGCAACCG CGTCGCGCGC ACGCCGACGC TCGACCGTCT CGCCGCGCAA GGCGTGGTGT TCGACGCCGC GTATTGCGCG AGCCCGCTGT GCGCGCCGTC GCGCTTCTCG CTGCTGACCG GCAAGCTGCC GTCGGGTATC GGCGCCTACG ATAACGCCGC CGAATTGCCG GCGCAAACGC TGACGTTCGC GCACTACCTG CGCGCCGGCG GCTACCGCAC GATGCTGTCC GGCAAGATGC ATTTCTGCGG ACCCGACCAG CTGCACGGTT TCGAGGAGCG GCTCACGACC GACATCTATC CGGCCGACTT CGGCTGGGTG CCCGACTGGG ACCACCCGAC CGAGCGGCCG AGCTGGTATC ACAACATGAG CTCGGTGCTC GAAGCCGGCC CGTGCGTGCG CACCAACCAG CTCGACTTCG ACGACGAGGT CACGTTCGCG GCCAAACAGA AGCTGTACGA CGTCGCGCGC GAGCGTGCGG CCGGGCACGA CGCGCGGCCG TTCTGCATGG TCGTGTCGCT GACCCATCCG CACGATCCGT ATGCGATCAC GCGCGAATAC TGGGATCTGT ACAGCGACGA CGAGATCGAC ATGCCGGCCG TGCGGCTCGA TGCGGAGCAG AGCGATCCGC ATTCGCAGCG GCTGCGCTTC GTCTGCGAAA ACGACCGCAC GCCGCCGACC GACGCGCAGA TCCGCGCCGC GCGCCGCGCA TACTACGGCG CGACGTCGTA CGTCGACGCG CAGTTCGGCA GCGTGCTGGC CGCGCTCGAG CAATGCGGAT TCGCCGACGA CACGATCGTG ATCGTCACGT CCGACCACGG CGACATGCTC GGCGAGCGCG GGCTCTGGTA CAAGATGACG TTCTTCGAAG GCGGCTGCCG CGTGCCGCTG ATCGTCCACG CGCCGGGCCG CTTCGACGCC GCGCGCGTGC GCGGGCCCGT GTCGCATCTC GACCTGCTGC CGACGCTCGT CGACCTGACC GGCGCGGCGC CCGCCGGCGG CTGGCCGGAC CCGGTCGATG GCGCGAGCCT CGTGCCGCAT CTGCAAGGCA CGCCCGCGCA CGACGTGGCA CTGGGCGAAT ACCTCGCGGA AGGGGCGGTC GCGCCGGTCG TGATGATCCG CCGCGGCGAC TGGAAGTACG TGCATTGCCC GGCCGACCCC GACCAGCTCT ACCATCTCGC CGACGATCCG CGCGAGCGCA CGAACCTGGC CGGCCTGCCC GAAGCCGCCG ACGTGCTCGC CGCGTTCCGC GCGGAGGCCG CGCGGCGCTG GAACCTGCCG GAACTCGACG CGCAGGTGCG CGCGAGCCAG CGGCGGCGGC GCTTCCATTA CGCGGCGACG ACGCAGGGCC GCATCCAGGC GTGGGACTGG CAGCCGTTCA CCGATGCGAG CCAGCGCTAC ATGCGCAATC ACATCGAACT CGATACGCTC GAAGCGATGG CGCGCTTTCC GCGCGTCGGG CGCTGA
|
Protein sequence | MTNPTPNILI LMADQLTPFA LPAYGNRVAR TPTLDRLAAQ GVVFDAAYCA SPLCAPSRFS LLTGKLPSGI GAYDNAAELP AQTLTFAHYL RAGGYRTMLS GKMHFCGPDQ LHGFEERLTT DIYPADFGWV PDWDHPTERP SWYHNMSSVL EAGPCVRTNQ LDFDDEVTFA AKQKLYDVAR ERAAGHDARP FCMVVSLTHP HDPYAITREY WDLYSDDEID MPAVRLDAEQ SDPHSQRLRF VCENDRTPPT DAQIRAARRA YYGATSYVDA QFGSVLAALE QCGFADDTIV IVTSDHGDML GERGLWYKMT FFEGGCRVPL IVHAPGRFDA ARVRGPVSHL DLLPTLVDLT GAAPAGGWPD PVDGASLVPH LQGTPAHDVA LGEYLAEGAV APVVMIRRGD WKYVHCPADP DQLYHLADDP RERTNLAGLP EAADVLAAFR AEAARRWNLP ELDAQVRASQ RRRRFHYAAT TQGRIQAWDW QPFTDASQRY MRNHIELDTL EAMARFPRVG R
|
| |