Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphyt_6647 |
Symbol | |
ID | 6278274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phytofirmans PsJN |
Kingdom | Bacteria |
Replicon accession | NC_010676 |
Strand | + |
Start bp | 2960051 |
End bp | 2961481 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642617679 |
Product | sulfatase |
Protein accession | YP_001890316 |
Protein GI | 187921284 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.707956 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCCA CGAACGTCCT CTTCATCCTG TCTGACGAGC ACCAGCACAA CCTGATGGGC TGCGCCGGGC ACCCGGTCAT CAAGACGCCC TCGCTCGACG CGCTCGCGCA GCGCGGCACG CGCTTCGAGA ACGCCTACAC GCCGTCGCCG ATCTGCGTGC CGGCGCGGGC AAGCCTCGCG ACGGGGCGCT ATGTCCACGA CATCCGCTGC TGGGACAACG CGATCGCCTA CGACGGCAGC ACGCCGGGCT GGGCGCAGCA CCTGTCGGCG AGCGGCGTGC TGACGGAATC GATCGGCAAG CTGCACTACA AGTCGGACGC GTCGCCCGTC GGATTTCGCC GGCAGCAACA CGCGGTGCAT ATCCTCGATG GGATTGGCCA GGTGTGGGGG TCGGTGCGCA ATCCGATGCC CGAAACCATG GGCCGCTCGC CGCTATACGA CAAGATCGGC CCCGGCACGT CCGACTACAA CCGCTTCGAC ATGCGCGTCG CCGATACGGC ATGTGGCTGG CTCGGTGAGC ATGCCGCCGA CGACAAGCCC TGGGTGCTGT TCGTCGGGCT CGTCGCACCG CACTTTCCGC TCGTCGTGCC GCAGGATTTT CTCGATCTCT ACGATCCACG CGAAATCGAC CTGCCGCTAC TGCATCCGTC GACGGGTTAT GTACGGCATC CGTGGGTGGA GCGTCAGGCG CGGCATATGG ATCACGATGC GGCGATCGGC AGCGACGAAC GCAGGCGTCT TGCCGTCGCA TGCTATTACG CGCTGGTGTC GTTCCTCGAC GCGCAAGTCG GCAAAGTGCT TGCCGCACTG CGGGCAAGCG GACTCGACGA TTCGACGACG ATCATCTACA GCAGCGATCA CGGCGATAAT CTCGGCAAGC GCGGCATGTG GAACAAGTGT CTGATGTACC GCGAATCGAC AGGTGTGCCG ATGATCGTCG CGGGTCCTGG CATTCCGGCG AGCAAGGTGA GTGAAACGCC TGTGTCGCTG ATCGATATCC AGAACACGCT GCTCGAATGC ACGGGCTGCG AAGCAGCGCT GATCGATGGT CCAGGAAAGT CTCTCGTCGA ACTCGCGTGT GCGGAAGACG ACGTCGGGCG TCTCGCGTTC AGCGAATATC ACGCTGTCGG ATCGGAAAGC GCAGCGTATA TGCTCGCGGA TAGCCACTAC AAGTATCACC ATTATCTCGG CATGAAGCCG GAACTGTTCG ATGTGAAGAA CGACCCGGAA GAGATGCGCG ATCTTGCGTC GCTGCCCGAA TACGCCGACG TGCTCGCGCA TTTCGAACGA CAGCTTCGCG CGCTGCTCGA TCCCGAAACC ACCGATGCTG CCGCGAAAGC CGATCAGGAC AGACTGGTCC AAGCATTCGG CGGCAGGGAA GCGGCGCTGC GAACGGGCAC ACCCGCAGCG ACGCCCGTGC CGGTTGAATA G
|
Protein sequence | MKPTNVLFIL SDEHQHNLMG CAGHPVIKTP SLDALAQRGT RFENAYTPSP ICVPARASLA TGRYVHDIRC WDNAIAYDGS TPGWAQHLSA SGVLTESIGK LHYKSDASPV GFRRQQHAVH ILDGIGQVWG SVRNPMPETM GRSPLYDKIG PGTSDYNRFD MRVADTACGW LGEHAADDKP WVLFVGLVAP HFPLVVPQDF LDLYDPREID LPLLHPSTGY VRHPWVERQA RHMDHDAAIG SDERRRLAVA CYYALVSFLD AQVGKVLAAL RASGLDDSTT IIYSSDHGDN LGKRGMWNKC LMYRESTGVP MIVAGPGIPA SKVSETPVSL IDIQNTLLEC TGCEAALIDG PGKSLVELAC AEDDVGRLAF SEYHAVGSES AAYMLADSHY KYHHYLGMKP ELFDVKNDPE EMRDLASLPE YADVLAHFER QLRALLDPET TDAAAKADQD RLVQAFGGRE AALRTGTPAA TPVPVE
|
| |