Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphyt_4474 |
Symbol | |
ID | 6279540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phytofirmans PsJN |
Kingdom | Bacteria |
Replicon accession | NC_010676 |
Strand | - |
Start bp | 542836 |
End bp | 544782 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642615567 |
Product | sulfatase |
Protein accession | YP_001888220 |
Protein GI | 187919189 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACAC CACGATCGCA GCGCCCTTCT CTTCGTCTTG GCATGATCGG CGCCGCCGTC GCCAGTCTCG TCGCTCTCGC CTCCTGCGGC GACGACAATC TGCCCGGCCC GACTCCCACG CCCGTAGTCG CCCAGAAACG CCCGAACATT CTGTACATCA TGGCCGACGA CCTCGGCTAT TCCGACATCC ACGCGTTCGG CGGCGAAATC AACACGCCCA ACCTCGACGC GCTGGTGCAA TCGGGCCGCA TCCTGACGAA TCACCATACG GGCACCGTCT GCGCGATCAC GCGCTCCATG TTGATCTCCG GCACCGATCA CCATCTGGTC GGCGAAGGCA CCATGGGTGT GCCGACCGAC GAGCGCAAAG GACTGCCGGG TTATGAAGGC TATCTGAACG ACCGCGCGCT CTCCGTTGCG CAACTGCTGA AGGACGGCGG TTATCACACG TATATGGCCG GCAAATGGCA TATCGGCTCG GGCATTGTCG GCAGTACGAC AGGCGGCGGA CAGACGCCCG ACCAATGGGG CTTCGAACAT AGCTATGCCT TGCTCGGCGG CGCCGCGACC AATCACTTCG CGCACGAACC GGCGAACTCG CACAACTACA CCGAAGACGG CAAATACGTG CAGCCCGGTC AGCCGGGACA ACCGGGTGGC GCGGGCGGCA GCCCCGCAGT GTTCTATTCG ACCGACTTTT ATACGCAGCG CCTGATCTCG TACATCGATT CGAACAAGGG CGACGGCAAA CCGTTCTTCG CTTACGCGGC CTACACGTCG CCGCATTGGC CGCTGCAGGT ACCCGAACCT TATCTGCACA ACTACGCCGG CAAATACGAC GCCGGTTACG ACGCCATCCG CAACGCGCGC ATTGCGCGGC AGAAAGCGCT CGGCATCATT CCGAACGACT TCGTGCCGTA CGGCGGCGCA TCGGAAACGC TCGTCGCCAC CGCGGCCACG GCGAACAACG GCACGGTGAA CGCGAAGTAC GTGAGCGCCG TGCATAGCGC GGCGCAAGGC TATACCGACT ACGGCCCGGG CACGGTGAAC AAGACGTGGG CGAGTCTCTC GCCCGCCGAG AAGAAAGCCC AGGCGCGTTA TATGGAAATC TACGCGGGCA TGGTCGAAAA CCTCGATCAC AACATCGGGT TGCTGATCCA GCATCTGAAG GATATCGGCG AGTACGACAA TACCTTCATC ATGTTCCAGT CGGACAACGG CGCGGAAGGC TGGCCGATCG ACTCGGGCGC CGACCCGACC GCGACCGACA CCGCCAATGC CGCCGACCCG GTTTATTCGG CACTCGGCAC CGACAACGGC AAGCAGAACG CGCAACGTCT GCAATACGGT TTGCGCTGGG CCGAAGTCAG CGCCACGCCG TTCCGTCTCA CCAAGGGCTA TTCGGGCGAG GGCGGCGTCT CCACGCCGTT GATCGTCCAT CTGCCCGGTC AGAGCACGCA GAAACCGACA CTGCGCGACT TCACGCACGT GACCGACAAT ACCGCGACGT TCCTTGCTGT CGCACAGATT TCGCCGCCCA CGCAGGCCGC GCCGCCGCTG ATCAATTCTC TGACCGGCGT CGATCAGAAC AAGGGCAAGG TGGTCTACAA CAATCGCTAC GTCTATCCGG TCACCGGGCA GTCGCTGCTG CCGCTGTTGA ACGATCAGGC GACGAGCGCG GTGCACAGCG CATCGTTCGG CGACGAAGCC TATGGCCGCG GCTATCTGCG CAGCGCCGAC GGCCGCTGGA AAGCGTTGTG GACGGAACCG CCGCTCGGTC CCGTGGACGG TCACTGGCAA CTGTTCGACA TGAGCGCCGA CCGTGGCGAA ACGCAGGACG TGTCGACGCA GAATCCCTCG GTGATCGACG GTCTCGTGCA GCAGTGGAAC AACTATATGA GCAGCGTCGG CGGCGTCGAA CCGTTGCGTC CGCGCGGTTA CTACTGA
|
Protein sequence | MTTPRSQRPS LRLGMIGAAV ASLVALASCG DDNLPGPTPT PVVAQKRPNI LYIMADDLGY SDIHAFGGEI NTPNLDALVQ SGRILTNHHT GTVCAITRSM LISGTDHHLV GEGTMGVPTD ERKGLPGYEG YLNDRALSVA QLLKDGGYHT YMAGKWHIGS GIVGSTTGGG QTPDQWGFEH SYALLGGAAT NHFAHEPANS HNYTEDGKYV QPGQPGQPGG AGGSPAVFYS TDFYTQRLIS YIDSNKGDGK PFFAYAAYTS PHWPLQVPEP YLHNYAGKYD AGYDAIRNAR IARQKALGII PNDFVPYGGA SETLVATAAT ANNGTVNAKY VSAVHSAAQG YTDYGPGTVN KTWASLSPAE KKAQARYMEI YAGMVENLDH NIGLLIQHLK DIGEYDNTFI MFQSDNGAEG WPIDSGADPT ATDTANAADP VYSALGTDNG KQNAQRLQYG LRWAEVSATP FRLTKGYSGE GGVSTPLIVH LPGQSTQKPT LRDFTHVTDN TATFLAVAQI SPPTQAAPPL INSLTGVDQN KGKVVYNNRY VYPVTGQSLL PLLNDQATSA VHSASFGDEA YGRGYLRSAD GRWKALWTEP PLGPVDGHWQ LFDMSADRGE TQDVSTQNPS VIDGLVQQWN NYMSSVGGVE PLRPRGYY
|
| |