Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphyt_2053 |
Symbol | |
ID | 6282260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phytofirmans PsJN |
Kingdom | Bacteria |
Replicon accession | NC_010681 |
Strand | + |
Start bp | 2319705 |
End bp | 2321276 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642621618 |
Product | sulfatase |
Protein accession | YP_001895684 |
Protein GI | 187924042 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.000240603 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAGCCA TCCGAAACGT CCTTTTCATC ATGTGTGACC AGTTGCGGCG GGATCACCTC GCCTGCTACG GCCACCCATA CATGCGCACA CGCAACATCG ACACGCTCGC CGCGCGCGGC GTGCGCTTCG ACAACGCCTA TGTCAGCTCC GGCGTCTGCG GGCCGTCGCG CATGTCGTAT TACACCGGCC GCACGATGAC GAGCCACGGC GCGAACTGGA ATCGCGTGCC GATGTCCATC GGCGAGATCA CGCTTGGGGA GTATCTGCTC CGCCATGGAC GCTCGCTCGC CCTCGCCGGA AAGACCCATG TTCAGGCGGA CGCGTCGGGC ATGGAGCGGC TGGAAATCGG TAGGGACAGC CCGCTTGGGC TGCGCCTTTC TGCGGGCTCG TTCGTCGAAC TGGACCGTTA CGACGGGCAT CACGAACCGG GCCGCGAAAG CGGTTATCCC GCGTATCTGC GCGCACACGG GTACGCGAGC GACCGGCCGT GGAGCGACTA TGTGATCAGC GTCGAAGACG ACCGCGGCGA GGTCCGCTCG GGCTGGCAGA TGCGCAATGT GCGCTGGCCG TCGCGCGTCG CGGAGCCGCA CTCCGAAACA GCGTATATGA CCGATCAGGC GATCCGCCAT ATCGAACAAC AAGGCGACGA GCCCTGGGCG TTACACCTGT CGTACGTGAA GCCGCATTGG CCGTATGTCG CGCCGCATCC CTATCACGAC ATGTACTCGC TCGATCAGTG CCTGCCGCTC GTGCGCCACA CCGCGGAGCT GGACAATGCC CACCCGGTGA CGGCCGCTTA TCGCCAGCAT GAAGAGAGCG TCAATTTTTC GCGCGACGAA GTGTCCAATA CGGTGCGGCC CATTTACCAG GGATTGATCC AGCAGATCGA CGATCATCTC GGGCGTCTCT GGGACGTGCT CGACCGGCTG AATCGCTGGG ACGACACGCT GATCGTCCTG ACCGCCGATC ACGGCGATTT CCTCGGCGAT CACTGGCTCG GCGAGAAAGA GCAGTTCTAC GATACCGTGC AGCGCGTGCC GATGGTCGTG TACGATCCAT CGCCCGAAGC GAACGCCACG CGCGGTACGG CCGAGGCGCG CTTCACGTCC TGCATCGACG TGGTGCCGAC CGTGCTCGAA GCGCTCGGCC TGCCTTCGTG TGAGGAGCGG ATCGAAGGAA AATCGTTGCT GCCGTTGCTG CGCGACACAC TCACGCAGGA AAACGGATGG CGTGACTATG TGGTCTCCGA ACTCGACTAC AGTTTTCGCG GAGCGCGCCT GACGCTCGGC CGCGCGCCGC ACGAATGCCG CGGCTGGATG GTGCGCGACG CGCGCTGGAA ATACGTGCAC TGGCTCGGCT ACCGGCCGCA ACTGTTCGAC CTCGAAGCGG ACCCGAACGA GTTTATCGAT CTCGGCGGCG AGCGCACGCA TGAAGCGACG CGTACGCAAA TGCATGCCAA GCTGGTCGAC TGGCACGCGA CGCTCAAGCA GCGCGTGACG ATCGACGATG CGGGCGTCGC GAGCCGCACA AATACGCACA GGGACTGGGG CGTGTTCTTC GGCGAGTGGT AA
|
Protein sequence | MTAIRNVLFI MCDQLRRDHL ACYGHPYMRT RNIDTLAARG VRFDNAYVSS GVCGPSRMSY YTGRTMTSHG ANWNRVPMSI GEITLGEYLL RHGRSLALAG KTHVQADASG MERLEIGRDS PLGLRLSAGS FVELDRYDGH HEPGRESGYP AYLRAHGYAS DRPWSDYVIS VEDDRGEVRS GWQMRNVRWP SRVAEPHSET AYMTDQAIRH IEQQGDEPWA LHLSYVKPHW PYVAPHPYHD MYSLDQCLPL VRHTAELDNA HPVTAAYRQH EESVNFSRDE VSNTVRPIYQ GLIQQIDDHL GRLWDVLDRL NRWDDTLIVL TADHGDFLGD HWLGEKEQFY DTVQRVPMVV YDPSPEANAT RGTAEARFTS CIDVVPTVLE ALGLPSCEER IEGKSLLPLL RDTLTQENGW RDYVVSELDY SFRGARLTLG RAPHECRGWM VRDARWKYVH WLGYRPQLFD LEADPNEFID LGGERTHEAT RTQMHAKLVD WHATLKQRVT IDDAGVASRT NTHRDWGVFF GEW
|
| |