Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0753 |
Symbol | |
ID | 4905957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 745448 |
End bp | 747001 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640143859 |
Product | sulfatase family protein |
Protein accession | YP_001074789 |
Protein GI | 126457339 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.1985 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGATA CCGCCGAACC CACCGATATC CAGCCGAACA TTCTCGTCCT GATGGCCGAC CAGCTCACGC CCTTCGCGTT GCGCGCGTAC GGCCATCGCG CGACGCGTAC GCCGACGATC GACCGGCTCG CCGCCGAGGG CGTCGTCTTC GACGCCGCTT ATTGCGCGAG CCCGCTCTGC GCGCCGTCGC GCTTCGCGCT GATGGCGGGC AAGCTGCCGT CGGCGCTCGG CGCTTACGAT AACGCCGCCG AATTGCCGGC GCAAACGCTG ACGTTCGCGC ACTACCTGCG CGCGGCCGGT TACCGGACGA TGCTGTCGGG CAAGATGCAC TTCTGCGGGC CCGATCAGTT GCACGGCTTC GAGGAACGGC TCACGACCGA CATCTATCCG GCCGATTTCG GCTGGGTGCC GGACTGGACG CGTCCCGCCG AGCGGCCGAG CTGGTATCAC AACATGAGCT CGGTGCTCGA CGCCGGGCCT TGCGTGCGGA CCAACCAGCT CGATTTCGAC GACGATGCGA CGTTCGCCGC GCGCCAGAAG ATCTTCGACG TCGCGCGCGA GCGCGCGGCC GGCCGGGACA CGCGGCCGTT CTGCATGGTC GTGTCGCTCA CGCATCCGCA TGATCCGTAT GCGATCACGC GCGAATACTG GGATCTGTAC CGGGACGAGG ACATCGATCT GCCCGCCGTG CAGATGGATT TCGACGCGAG CGACCCGCAT TCGCGGCGGC TGCGCGCCGT ATGCGAGGTC GATCGCACGC CGCCGGAGGA CCTGCAGATC CGGCGCGCGC GGCGCGCGTA CTACGGCGCG ACGTCCTATG TCGACGCGCA GTTCGGCGCG CTGCTCGCGA CGCTCGAGCA ATGCGGGCTC GCCGACGACA CGATCGTGAT CGTCACCGCC GATCACGGCG ACATGCTCGG CGAGCGCGGC CTCTGGTACA AGATGACGTT CTTCGAAGGC GCATGCCGCG TGCCGCTCAT CGTGCACGCG CCGCGCCGGT TTCCGGCCGC GCGCGTGCCG GCGGCCGTGT CGCACGTCGA TCTGCTGCCG ACGCTCGTCG AGCTCGCGAC GGGCGAGCGC CGCGCCGACT GGCCCGACGC CGTCGACGGC CGCAGCCTCG TTCCCCATCT GCGCGGCGAA GGCGGCCATG ACGAGGCGTT CGGCGAATAT CTGGCCGAAG GCGCGATCGC GCCGATCGTG ATGATGCGCC GCGGCAGCCA CAAGTACATC CATTCGCCCG CGGATCCGGA TCAGCTCTTC GATCTGAGGA ATGATCCGCG CGAGCTCGAC AATCTCGCGA ACACGCCCGC CGCGGCAAAG CACGTCGCCG CGTTTCGCAT GGAGCGCGTC GCGCGCTGGG ATCTCGATGC GCTGCATCAG CAGGTGCTCG CGAGCCAGCG CAGGCGGCGC TTCCATTTCG AGGCGACGAC CCAGGGGCGA ATCCGGTCGT GGGACTGGCA GCCGTTCCAG GATGCGAGCC AGCGTTACAT GCGCAATCAC CTCGAACTCG ACGCGCTCGA GGCAGCCGCG CGTTTTCCTC GTCCGCACGC ATGA
|
Protein sequence | MPDTAEPTDI QPNILVLMAD QLTPFALRAY GHRATRTPTI DRLAAEGVVF DAAYCASPLC APSRFALMAG KLPSALGAYD NAAELPAQTL TFAHYLRAAG YRTMLSGKMH FCGPDQLHGF EERLTTDIYP ADFGWVPDWT RPAERPSWYH NMSSVLDAGP CVRTNQLDFD DDATFAARQK IFDVARERAA GRDTRPFCMV VSLTHPHDPY AITREYWDLY RDEDIDLPAV QMDFDASDPH SRRLRAVCEV DRTPPEDLQI RRARRAYYGA TSYVDAQFGA LLATLEQCGL ADDTIVIVTA DHGDMLGERG LWYKMTFFEG ACRVPLIVHA PRRFPAARVP AAVSHVDLLP TLVELATGER RADWPDAVDG RSLVPHLRGE GGHDEAFGEY LAEGAIAPIV MMRRGSHKYI HSPADPDQLF DLRNDPRELD NLANTPAAAK HVAAFRMERV ARWDLDALHQ QVLASQRRRR FHFEATTQGR IRSWDWQPFQ DASQRYMRNH LELDALEAAA RFPRPHA
|
| |