Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphyt_6681 |
Symbol | |
ID | 6280298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phytofirmans PsJN |
Kingdom | Bacteria |
Replicon accession | NC_010676 |
Strand | + |
Start bp | 2995296 |
End bp | 2996981 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642617713 |
Product | sulfatase |
Protein accession | YP_001890350 |
Protein GI | 187921318 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCTG AAACGTCCTT CCCGGCACCT GCCAGGCCCT GCAAACGCGC GCCTTTCGCT GTGGTTGCGG CGGCATTGCT GAGCGTTGCG ATAAGCGGCC AGGCACAAAC GGATTCTCAA CAGCCCATGC GGCCTAACAT CGTGTTGATT GTGGGCGACG ACGTGGGTTG GGGCGATCTC GGAGCCTATG GCGGCGGTGA GGGGCGCGGC ATTCCCACTC CGAACCTTGA CAGGCTGGCC GACGAAGGCA TGACATTTTT CGACTTCTAC GGCCAGCCGA GTTGTACGCC AGGCCGGGCG GCTCTTCAGA CCGGACGCAA TCCGAACCGC AGCGGTATGA CGACCGTAGC GTTCCAGGGG CAAGGCGGCG GCCTGCCTCA CGCGGAATGG ACGCTGGCGT CCATGCTGAA ACTCGCGCAC TACAACACAT ATTTCACCGG CAAGTGGCAC CTCGGCGAAG CGGACTACGC CCTACCGAAC GCACAGGGCT ATGACGACAT GAAGTACGTT GGCCTGTATC ACCTGAATGC ATACACGTAC TCCGATCCGA AATGGTTTCC CGACATGGAC CAGCAAACTC GGGACCTGTT CACCAAGGTA ACGAGAGGCA TGCTCAGCGG CAAGGCCGGC GAAAAGGCTC ATGAAGATTT CAAGCTGAAC GGTCAGTATC AAAACGAGCC CGAAAACCGG ATCGTCGGCA TTCCGTTTGT CGATCGTTAC ATCGAGAAAG CTGCGCTCGA CGATATCGAC GATGCATCCC AACGCGGCCA GCCATTCTTC ATCAACGTAA ACTTCATGAA GGTTCATCAG CCGAACATGC CCGATCCCGA CTACATCGGA AAGTCTCTGT CGAAGTCCAA ATACGCAGAC TCACTGGTGG AACTGGATGC ACGTGTCGGC CACATCATGG ACAAGCTTCG CGAGAAAGGA CTCGACAAGA ACACGCTTGT GTTCTTCACG ACGGACAACG GCGCGTGGCA GGATGTTTAT CCAGATGCGG GCTATACGCC CTTCCGCGGT ACCAAGGGAA CCGACCGCGA GGGCGGAGCG CGTGTTCCCG CCATCGCGTG GTGGCCGGGC AAGATCAAGC CTCACACGAG GAACTTCGAC ATTCTGGGTG GCCTCGACTG TATGGCAACC TTCGCGGCAT TGGCAGGCGT GGATTTACCC AAAAACGATC GCGAAGGCAA ACCCATCGTT TTCGACAGCT ACGATATGTC TCCCGTGCTG CTCGGCACGG GCAAGAGCAA GCGCAATGCC TGGTTCTATT TCACCGAAAA CGAGCTGACG CCGGGTGCCG TGCGGGTCGG GCAGTTCAAG GCGGTGTTCA ACCTGCGCGG CGACGCAGGC AAGGATACGG GCGGTCTAGC GGTGGACACG AATCTCGGCT GGAAAGGACC TGAGAGCTAC GTCGCAACCG TTCCGCAGGT CTTCGATCTG TATCAGGATC CTCAGGAGCG CTACGACATC TTCATGAACA ACTATACGGA ACACACTTGG ACACTTGTGG CGTTTAACGC TGCTGTCAAG GACCAGATGC AAACGTATGT GAAGTATCCG CCGCGCAAGT TGCAAAGCGA AGGATACGCC GGACCGATTA CCCTCACCCA GTATCAACGA TTCAAGTACA TCCGGGATCA ACTGCAGGAG AACGGTTTCA ACATTCCGAT GCCAACCGGG AACTGA
|
Protein sequence | MKPETSFPAP ARPCKRAPFA VVAAALLSVA ISGQAQTDSQ QPMRPNIVLI VGDDVGWGDL GAYGGGEGRG IPTPNLDRLA DEGMTFFDFY GQPSCTPGRA ALQTGRNPNR SGMTTVAFQG QGGGLPHAEW TLASMLKLAH YNTYFTGKWH LGEADYALPN AQGYDDMKYV GLYHLNAYTY SDPKWFPDMD QQTRDLFTKV TRGMLSGKAG EKAHEDFKLN GQYQNEPENR IVGIPFVDRY IEKAALDDID DASQRGQPFF INVNFMKVHQ PNMPDPDYIG KSLSKSKYAD SLVELDARVG HIMDKLREKG LDKNTLVFFT TDNGAWQDVY PDAGYTPFRG TKGTDREGGA RVPAIAWWPG KIKPHTRNFD ILGGLDCMAT FAALAGVDLP KNDREGKPIV FDSYDMSPVL LGTGKSKRNA WFYFTENELT PGAVRVGQFK AVFNLRGDAG KDTGGLAVDT NLGWKGPESY VATVPQVFDL YQDPQERYDI FMNNYTEHTW TLVAFNAAVK DQMQTYVKYP PRKLQSEGYA GPITLTQYQR FKYIRDQLQE NGFNIPMPTG N
|
| |