Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2887 |
Symbol | |
ID | 4886030 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 2741524 |
End bp | 2743224 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640132823 |
Product | halogenase PrnC |
Protein accession | YP_001063879 |
Protein GI | 126443758 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.939862 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCAGA AGAGCCCCGC GAACGGGCGC GATAGCCATC ACTTCGACGT GATCATCCTC GGCTCGGGCA TGTCCGGCAC CCAGATGGGA GCCATCCTGG CCAAGCAACA GTTTCGCGTG CTGATCATCG AGGAGTCGTC GCACCCGCGG TTCACGATCG GCGAATCGTC GATCCCCGAG ACGTCTCTCA TGAACCGCAT CATCGCCGAT CGCTACGGCA TTCCGGAGCT CGACCACATC ACGTCGTTCT ACGCGACGCA GCGTTACGTC GCGTCGAGCA CGGGCATCAA GCGCAACTTC GGCTTCGTGT TCCATAAACC CGGCCAGGAG CACGACCCGA GAGAGTTCAC GCAGTGCGTC ATTCCCGAGC TGCCGTGGGG TCCGGAGAGC CACTATTACC GGCAAGACGT CGACGCCTAC CTGTTGCAAG CCGCCATCAA ATACGGCTGC ACGGTTCGCC AGAAGACGAA CGTGACCGAA TACCACGCCG ACAAAAACGG CGTCGCGGTG ACCACCGCCC AGGGCGAACG GTTCACCGGC CGGTACATGA TCGACTGCGG CGGCCCCCGC GCGCCGCTCG CGATCCAGTT CAAGCTCCGC GAAGAGCCGT GTCGCTTCAA GACGCACTCG CGCAGCCTCT ACACGCACAT GCTCGGGGTC AAGCCGTTCG ACGACATCTT CAAGGTCAAG GGGCAGCGCT GGCGCTGGCA CGAGGGGACC TTGCACCACA TGTTCGCGGG CGGCTGGCTC TGGGTGATTC CGTTCAACAA CCACCCGCGC TCGACCAACA ATCTGGTGAG CGTCGGCCTG CAGCTCGACA CGCGTGTCCA CCCGAAAACG GACATCCCCG CTCAGCAGGA ATTCGACGAG TTCCTCGCGC GCTTCCCGAG TATCGGGGCG CAGTTCCGGG ACGCCGTGCC GGTGCGTGAC TGGGTCAAGA CCGACCGCCT GCAATTCTCG TCGCGCGCCT GCGTCGGCGA CCGCTACTGC CTGATGCTGC ACGCGAACGG ATTCATCGAC CCGCTCTTCT CCCGAGGGCT TGAGAACACC GCGGTGACCG TCCACGCGCT GGCAGCGCGC CTGATCAAGG CGCTGCGCGA CGACGACTTC TCCCCCGAGC GCTTCGAGTA CATCGAGCGC CTGCAGCAGA AGCTTCTGGA CCACAACGAC GACTTCGTCA GCTGCTGCTA CACGGCGTTC ACGGACTTCC GCCTGTGGGA CGCGTTCCAC AGGCTGTGGG CGGTCGGCAC GATCCTCGGG CAGTTCCGGC TCGTGCAAGC CCACGCGAGG TTTCGCGCGT CGCGTGACGA GAGCGCGCTC GATCACCTCG ACAACAACCC CCCGTACCTC GGGTACCTGT GCGCGGACAT GGAGGAGTAC TACCAGTTGT TCAACGACGC CAAGGCGGAG GTCGAGGCCG TGAGCGCCGG GCGCAAGCCG GCCGAGGAGG CCGCCGCGCG GATTCACGCC CTCATCGACG AACGAGACTT CGCCCGGCCG ATGTTCAGCT TCGGGTACTG CATCACCGGG GCCAAGCCAC AGCTCAACAA CTCGAAGTAC AGCCTACTGC CGGCGATGAA GCTGTTGCAT TGGACGCAAA CCAGCGCGCC GGCAGAGGTG AAAAAGTACT TCGACTACAA CCCGATGTTC GCGCTGCTCA AGGCGTACGT CACGACCCGC ATCGGCTTGG CGCTGAAATA G
|
Protein sequence | MTQKSPANGR DSHHFDVIIL GSGMSGTQMG AILAKQQFRV LIIEESSHPR FTIGESSIPE TSLMNRIIAD RYGIPELDHI TSFYATQRYV ASSTGIKRNF GFVFHKPGQE HDPREFTQCV IPELPWGPES HYYRQDVDAY LLQAAIKYGC TVRQKTNVTE YHADKNGVAV TTAQGERFTG RYMIDCGGPR APLAIQFKLR EEPCRFKTHS RSLYTHMLGV KPFDDIFKVK GQRWRWHEGT LHHMFAGGWL WVIPFNNHPR STNNLVSVGL QLDTRVHPKT DIPAQQEFDE FLARFPSIGA QFRDAVPVRD WVKTDRLQFS SRACVGDRYC LMLHANGFID PLFSRGLENT AVTVHALAAR LIKALRDDDF SPERFEYIER LQQKLLDHND DFVSCCYTAF TDFRLWDAFH RLWAVGTILG QFRLVQAHAR FRASRDESAL DHLDNNPPYL GYLCADMEEY YQLFNDAKAE VEAVSAGRKP AEEAAARIHA LIDERDFARP MFSFGYCITG AKPQLNNSKY SLLPAMKLLH WTQTSAPAEV KKYFDYNPMF ALLKAYVTTR IGLALK
|
| |