Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10247_1589 |
Symbol | |
ID | 4892050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10247 |
Kingdom | Bacteria |
Replicon accession | NC_009080 |
Strand | + |
Start bp | 1577437 |
End bp | 1578405 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640150246 |
Product | endo/excinuclease domain-containing protein |
Protein accession | YP_001081133 |
Protein GI | 126451215 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2827] Predicted endonuclease containing a URI domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.942614 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTGGT ATCTGTATCT GATCGAGTGC GCGGACGGCA GCGTCTACAC CGGCATCACG ACCGACGTCG CCGCCCGCTT CGACAAGCAC GCGCAAGGCG CGGGCGCCCG CTACACGCGC GCGCGCAAGC CGCGCGCGGT GCTCGCGTCG TTCGAGCTCG CCGATCGCTC GAGCGCGTCG CGCGCCGAGT ATTGGGTGAA GCGGCTCACC GCCGTGCAGA AGCGCGAGCT GGCAGCGGGC ACACGCACGC TCGAGTCGGT GCTGCCGGCG GCGGCCGGCG TGCGAAGCGA TGCGGCGGCG GGGCGCGATG CGGGGGCCCA GGTAGCTGCT GCGGCGGCGG CACAGAGGGA GGGCGCGCCC GTCACGGGGC GCGTGAAGCG CGTCGACGCC GAACGGGCGT CGGGCACGCC GACGCCGCGA TCTCCTCGCC GCGCCACTCG GGCGACGGCC GAGGCGCTCG ATGCGGGCAC GCCGGCTGAC GAGGGTGAGG CGACGCAAGC GCCAAAGGAA AGGAAGGCGA CGGAGGCGGC GAAAACTTCG AAGACGGTGA GATTGTCGAA GGCACCGAAG CCGTCGAAGC CGTCGAAGCC GTCGAAGCCG TCGAAGCCGT CGAAGCCGTC GAAGCCGTCG AAGCCGTCGA AGCCGTCGAA GCCGTCGAAG CCGTCGAAGC CGTCGAAGCC GTCGAAGCCG TCGAAGCCGT CGAAGCCGTC GAAGCCGTCG AAGCCGTCGA AGCCGTCGAA GCCGTCGAAG CCGTCGAAGC CGTCGAAGCC GTCGACACCA CCGAAGCCAC CGAAGCCGTC GACGCAGATC GCGGCGACAG CCGCCTCCCG ACGAACGAAA ACCGTCCGCG CGCCTGCGGC GAGCACGACC GCGCATGCGC ATGCGAACGC CGCCCCGAAC GCGGGCACCG CCGTATCCGC CGCGCCGCCG CGCGGCGCGC GCGCCAAACA AAACCGCGCG GCCTCGTGA
|
Protein sequence | MSWYLYLIEC ADGSVYTGIT TDVAARFDKH AQGAGARYTR ARKPRAVLAS FELADRSSAS RAEYWVKRLT AVQKRELAAG TRTLESVLPA AAGVRSDAAA GRDAGAQVAA AAAAQREGAP VTGRVKRVDA ERASGTPTPR SPRRATRATA EALDAGTPAD EGEATQAPKE RKATEAAKTS KTVRLSKAPK PSKPSKPSKP SKPSKPSKPS KPSKPSKPSK PSKPSKPSKP SKPSKPSKPS KPSKPSKPSK PSKPSKPSTP PKPPKPSTQI AATAASRRTK TVRAPAASTT AHAHANAAPN AGTAVSAAPP RGARAKQNRA AS
|
| |