Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1397 |
Symbol | |
ID | 4903780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 1362536 |
End bp | 1364194 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640144503 |
Product | putative halogenase |
Protein accession | YP_001075431 |
Protein GI | 126456710 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.666039 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAACA ATCAGGTCAG GAAATACGAC GTCGTCATCA TCGGGACGGG CATCGGCGGC ACGACGCTCG GCGCGATCCT CGCGCGGCAC GGGCTGCGGG TCGCGATGAT CGATTCCGGC ACGCATCCGC GCTTTGCCGT CGGCGAATCG ACGATCGCCA CGACCACGCT GACGCTCGAG CTGATGGCGA TGCGCTTCGA TGTGCCGGAG CTCAAGCACA TCACGTCGAT CGCCGAAGTG AGCGAGAACG TGATGCCGTC GTGCGGCGTG AAGCGCAACT TCGGCTTCGT GTATCACCGC GAGCACACCG AGCAGAATCC GCAGGAGGTC AATCAGGCGC TCGTCGTCAA CGAGGTGCAC TATTTCCGGC AGGACATCGA CGCGTACATG CTGCACGTGG CCATTCGCTA CGGCTGCGAC GCGTATCAGA ACACCGTCGT CGACGATATC CGGATCGACG CCGGCGGCGT GACGGTGACG ACGCGCGGCG GCCTCACGTT CGAGGCGGAT TTCGTCGCCG ACGGCGCGGG GTACCGCTCG GTGCTGGCCG ACAAGCTCGG CCTGCGCGAG ACGCCGTGCC GCGCGAAGAC GCATGCGCGC GGCCTGTTCA CGCACATGAT CGACGTGAAG CCGTTCGACG CCTGCCGCGA GGTGCCCAAG GCGCTGCAGC AGCCGGTGCC GTGGCATCAG GGGACGCTGC ACCACCTGTT CGACGGCGGC TGGATGTGGG TGATTCCGTT CAACAACACG CCGGAATCGA AGAACCCGCT CGTGAGCGTC GGCCTGATGC TCGATCCGCG CAAGCATCCG AAGCCGGACG TGCGGCCCGA GCAGGAATTC GCCGACTTCA TCGCGAAGCA TCCGGACATG GCGCGGCAGT TCGCCGATGC GCGCGCGGTG CGCGAATGGG TGTCCTCGGG CCGCATCCAG TACAGCGCGA GCGCATGCAC GGGCGACCGG TTCTGCCTGC TCTCGCATGC GACGGGCTTC ATCGATCCGC TGTTCTCGCG CGGCCTGTTC AACACGATGC AGACGACCAA CGCGCTCGCG GGGCTGCTGA TCGAAGCCGC AAAGGACCGC GATTTCAGCA AGGCGCGCTT CGCGCCGGTC GAGAAGCTCC AGCAGGGCCT GATCGATTTC AACGATCGGC TCGTCAACTG CTCGTACCTC TCGTGGGGCC ACTATCCGCT CTGGAACGCG TGGTTCCGCC TGTGGCTGCT CACCGGCAAC TACGGCCAGC TTCACCTGCA GCGCGTGATG ATGAAGTACC GGCAAACCGG CGACGCGCGC TGGCTCGAGC CGGCCGACGC GCTGTTGCCG GGCGCGTTCA CCACGCTCGA GCCGATCATG CGGCTGTTCG AGGAGGCGGC GGTGTGCGTC GAGCGGTACG GCGCGGGCGA ACTCTCGGGC GAGGCGGCCG AGCGGGCGAT CTACGCGCTG CTCGAGGAGA ACGCCGCGCT GCTGCCGCCG TTCTTCGATT TCGTTTCGCC CGCCGAGCGG ATCACCTGGC CGAGCACGCC CGAGAAGATC GCCGCGCTGC TGCTCGAGTG GGTCGAGCGG CTGCCGGAGG ACGTGCGGGC GGAATACTTC GACTACGACG TGCGGGCGCT GCTCCAGCAG CCGGTCGTCA AGGACACGAT CACCGCGGAC GTCGCGTGA
|
Protein sequence | MSNNQVRKYD VVIIGTGIGG TTLGAILARH GLRVAMIDSG THPRFAVGES TIATTTLTLE LMAMRFDVPE LKHITSIAEV SENVMPSCGV KRNFGFVYHR EHTEQNPQEV NQALVVNEVH YFRQDIDAYM LHVAIRYGCD AYQNTVVDDI RIDAGGVTVT TRGGLTFEAD FVADGAGYRS VLADKLGLRE TPCRAKTHAR GLFTHMIDVK PFDACREVPK ALQQPVPWHQ GTLHHLFDGG WMWVIPFNNT PESKNPLVSV GLMLDPRKHP KPDVRPEQEF ADFIAKHPDM ARQFADARAV REWVSSGRIQ YSASACTGDR FCLLSHATGF IDPLFSRGLF NTMQTTNALA GLLIEAAKDR DFSKARFAPV EKLQQGLIDF NDRLVNCSYL SWGHYPLWNA WFRLWLLTGN YGQLHLQRVM MKYRQTGDAR WLEPADALLP GAFTTLEPIM RLFEEAAVCV ERYGAGELSG EAAERAIYAL LEENAALLPP FFDFVSPAER ITWPSTPEKI AALLLEWVER LPEDVRAEYF DYDVRALLQQ PVVKDTITAD VA
|
| |