Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2735 |
Symbol | |
ID | 4904436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 2668218 |
End bp | 2669918 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640145838 |
Product | halogenase PrnC |
Protein accession | YP_001076765 |
Protein GI | 126457167 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCAGA AGAGCCCCGC GAACGGGCGC GATAGCCATC ACTTCGACGT GATCATCCTC GGCTCGGGCA TGTCCGGCAC CCAGATGGGA GCCATCCTGG CCAAGCAACA GTTTCGCGTG CTGATCATCG AGGAGTCGTC GCACCCGCGG TTCACGATCG GCGAATCGTC GATCCCCGAG ACGTCTCTCA TGAACCGCAT CATCGCCGAT CGCTACGGCA TTCCGGAGCT CGACCACATC ACGTCGTTCT ACGCGACGCA GCGTTACGTC GCGTCGAGCA CGGGCATCAA GCGCAACTTC GGCTTCGTGT TCCATAAACC CGGCCAGGAG CACGACCCGA AAGAGTTCAC GCAGTGCGTC ATTCCCGAGC TGCCGTGGGG TCCGGAGAGC CACTATTACC GGCAAGACGT CGACGCCTAC CTGTTGCAAG CCGCCATCAA ATACGGCTGC ACGGTTCGCC AGAAGACGAA CGTGACCGAA TACCACGCCG ACAAAAACGG CGTCGCGGTG ACCACCGCCC AGGGCGAACG GTTCACCGGC CGGTACATGA TCGACTGCGG CGGCCCCCGC GCGCCGCTCG CGATCCAGTT CAAGCTCCGC GAAGAGCCGT GTCGCTTCAA GACGCACTCG CGCAGCCTCT ACACGCACAT GCTCGGGGTC AAGCCGTTCG ACGACATCTT CAAGGTCAAG GGGCAGCGCT GGCGCTGGCA CGAGGGGACC TTGCACCACA TGTTCGCGGG CGGCTGGCTC TGGGTGATTC CGTTCAACAA CCACCCGCGC TCGACCAACA ACCTGGTGAG CGTCGGCCTG CAGCTCGACA CGCGTGTTCA CCCGAAAACG GACATCCCCG CGCAGCAGGA ATTCGACGAG TTCCTCGCGC GCTTCCCGAG TATCGGGGCG CAGTTCCGGG ACGCCGTGCC GGTGCGTGAC TGGGTCAAGA CCGACCGCCT GCAATTCTCG TCGCGCGCCT GCGTCGGCGA CCGCTACTGC CTGATGCTGC ACGCGAACGG ATTCATCGAC CCGCTCTTCT CCCGAGGGCT CGAGAACACC GCGGTGACCA TCCACGCGCT GGCGGCGCGC CTGATCAAGG CGCTGCGCGA CGACGACTTC TCCCCCGAGC GCTTCGAGTA CATCGAGCGC CTGCAGCAGA AGCTTCTGGA CCACAACGAC GACTTCGTCA GCTGCTGCTA CACGGCGTTC ACGGACTTCC GCCTGTGGGA CGCGTTCCAC AGGCTGTGGG CGGTCGGCAC GATCCTCGGG CAGTTCCGGC TCGTGCAAGC CCACGCGAGG TTTCGCGCGT CGCGTGACGA GAGCGCGCTC GATCACCTCG ACAACAACCC CCCGTACCTC GGGTACCTGT GCGCGGACAT GGAGGAGTAC TACCAGTTGT TCAACGACGC CAAGGCGGAG GTCGAGGCCG TGAGCGCCGG GCGCAAGCCG GCCGAGGAGG CCGCCGCGCG GATTCACGCC CTCATCGACG AACGAGACTT CGCCCGGCCG ATGTTCAGCT TCGGGTACTG CATCACCGGG GCCAAGCCAC AGCTCAACAA CTCGAAGTAC AGCCTACTGC CGGCGATGAA GCTGTTGCAT TGGACGCAAA CCAGCGCGCC GGCAGAGGTG AAAAAGTACT TCGACTACAA CCCGATGTTC GCGCTGCTCA AGGCGTACGT CACCACCCGC ATCGGCTTGG CGCTGAAATA G
|
Protein sequence | MTQKSPANGR DSHHFDVIIL GSGMSGTQMG AILAKQQFRV LIIEESSHPR FTIGESSIPE TSLMNRIIAD RYGIPELDHI TSFYATQRYV ASSTGIKRNF GFVFHKPGQE HDPKEFTQCV IPELPWGPES HYYRQDVDAY LLQAAIKYGC TVRQKTNVTE YHADKNGVAV TTAQGERFTG RYMIDCGGPR APLAIQFKLR EEPCRFKTHS RSLYTHMLGV KPFDDIFKVK GQRWRWHEGT LHHMFAGGWL WVIPFNNHPR STNNLVSVGL QLDTRVHPKT DIPAQQEFDE FLARFPSIGA QFRDAVPVRD WVKTDRLQFS SRACVGDRYC LMLHANGFID PLFSRGLENT AVTIHALAAR LIKALRDDDF SPERFEYIER LQQKLLDHND DFVSCCYTAF TDFRLWDAFH RLWAVGTILG QFRLVQAHAR FRASRDESAL DHLDNNPPYL GYLCADMEEY YQLFNDAKAE VEAVSAGRKP AEEAAARIHA LIDERDFARP MFSFGYCITG AKPQLNNSKY SLLPAMKLLH WTQTSAPAEV KKYFDYNPMF ALLKAYVTTR IGLALK
|
| |