Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_3946 |
Symbol | |
ID | 4901237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 3852362 |
End bp | 3853183 |
Gene Length | 822 bp |
Protein Length | 273 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640137172 |
Product | HAD family hydrolase |
Protein accession | YP_001068166 |
Protein GI | 126455443 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.292051 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATAAAG CCATCGCCAC CGATCTCGAC GGAACCCTGC TCAATAGCGA CCACCAGCTC GATCCTTATA CGATCGACAC CGTGCGGCGG CTCGCGGACG GCGGCGTGCC CTTCGTGATC GCGACGGGGC GCCACTATGC GGATGTCGCG GGTATTCGCG ACGTGCTCGG CATCCGTCCT TACCTGATCA CGTCGAACGG CGCGCGCGTG CACGGGCCGG ACGACACGCG GATCCACGCG CAGGACGTGC CCGCCGACGC GGTGCGGCAA CTGGTGCGGC CCGAGCTCGT CGGCACGCAT GGCCGGGTGA TCGTCAATCT GTTCACGAAC GACGGCTGGC TGATCGATCG CGATGCGCCG CAACTGCTCG CATTCCATCA GGATTCCGGA TTTCGCTACG AAATCGTCGA TATGCTGGCG CACGACGGCG CGGACATTGC GAAAGTGCTG TACATCGGCG AGCCCGAGGA TCTGGCCGTC GTCTCGGGCA ATCTCGCGCG CCGGTTCGGC GACGCGCTGT ACGTCACGTA TTCGCTGCCC GATTGCCTCG AGGTGATGAC GGCGAATGTA TCGAAGGGGC GCGCGCTGCG CGTCGTGCTC GAGCGGCTCG GCGTCGATCC CGCCCACTGC GTGGCGTTCG GCGACAACAT GAACGATATC GACCTGCTCG AGACGGCCGG GTATCCGTTC ATGATGAACA ACGCGAATCC CGACCTCGCC TTGCGCCTGC CGAAGGTGCC GCGGATCGGC AACAACTTCG AAGCGGGCGT CGCGCGCCAT CTGCGCGCGC TGTTCGCGCT CGAGGATTCC ATCGCGCACT GA
|
Protein sequence | MYKAIATDLD GTLLNSDHQL DPYTIDTVRR LADGGVPFVI ATGRHYADVA GIRDVLGIRP YLITSNGARV HGPDDTRIHA QDVPADAVRQ LVRPELVGTH GRVIVNLFTN DGWLIDRDAP QLLAFHQDSG FRYEIVDMLA HDGADIAKVL YIGEPEDLAV VSGNLARRFG DALYVTYSLP DCLEVMTANV SKGRALRVVL ERLGVDPAHC VAFGDNMNDI DLLETAGYPF MMNNANPDLA LRLPKVPRIG NNFEAGVARH LRALFALEDS IAH
|
| |