Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A3951 |
Symbol | |
ID | 3749136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | - |
Start bp | 872774 |
End bp | 874084 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637762230 |
Product | Dyp-type peroxidase |
Protein accession | YP_368194 |
Protein GI | 78065425 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2837] Predicted iron-dependent peroxidase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01412] Tat-translocated enzyme [TIGR01413] Dyp-type peroxidase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0116282 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0479981 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGACG ATTCCAACCA GCCACCCCGC CCGACGCGGC GCGGCTTCCT GAAGGCGGGC GGCGCCGCGG TGGCCGCCGG CCTCGCGGCG GCGTCGATCC CGGCCGCGAA GGCCGCGGAT GCACCCGCCG CCGCACCCGC ACCCGCGCCC GCTTCCGCGC ATGACGGCGT CGAGCCGTTC TACGGCAAAC ACCAGAGCGG CATCGCGACG CCGCAGCAGC GCCACGCGTA TTTCGCGGCG CTCGACCTGA CGACGGCCCA GCGCGCGGAC GTCATCGCGC TGCTGAAGAC CTGGACCGAC GCGGCCGCGC GGATGGCGCG CGGCGACACC GCGCTGCCGC TCGCCACGAC GGGTAACGAC GAGGTCGCGC CGGCCGACGG GGGCGACGCG CTCGGCCTCG GCCCTGCGCG CCTGACGATC ACGTTCGGCT TCGGCCCCGG CATGTTCGCG CTCGCCGGCA AGGACCGCTT CGGCCTCGCG AAGCATCGCC CCGCCGCGCT CGTCGACCTG CCGCGCTTCA ATGGCGACCA GTTGCTGCCC GAAAAGACCG GCGGCGACCT GTTCATCCAG GCGTGCGCCG ACGACGCGCA GGTCGCGTTC CACGCGGTGC GCCAGCTCGT GCGCCTCGGC GCGAAGGCGA CGCAGATGCG CTGGGGCCAG GCCGGCTTCA CGTCGGGCAA GCCCGGCGAG ACGCCGCGCA ACCTGATGGG CTTCAAGGAC GGCACGATGA ATCCGCCGAT GTCCGATCCG GCCGCAATGG ATGAATTCGT GTGGGCCGGC AGCGAAGGCC CGGCATGGAT GAACGGCGGC ACCTACACGG TCGTGCGCCG GATCCGCATC ACGCTCGAGC ACTGGGACAA CACGGAGCTC GGCTTCCAGG AGCAGGTGGT CGGCCGTCAC AAGTACAGCG GCGCACCCCT CGGCCAGAAG CACGAGTTCG AGGCGCTCGA TCTCGACGCG GCCGACAAGG ACGGCAACCC GGTGATCCCC GACAACGCGC ACGCGCGCCT CGCATCGCCG CAGTTGAACA ACGGCGCGCA GATCCTGCGC CGCGCGTACT CGTACAACGA CAGCACGAGC TTTTACATCG AGCGCTGGCC GCCGTGGCGC CAGCAGACCG AGTACGACGC GGGGCTGATG TTCGTCGCGC ACCAGCGCGA CCCGCGCAAG GGCTTCATTC CGATCAACGA GAAGCTCGCA AAGATGGACA TCATGAACCA GTTCACCACG CACGTCGGCA GCGCGATCTT CGCGTGCCCG CCGGGCGCGC AACCGGGTTC GTACATCGGC GCCGCGCTGT TCGAGGCATG A
|
Protein sequence | MADDSNQPPR PTRRGFLKAG GAAVAAGLAA ASIPAAKAAD APAAAPAPAP ASAHDGVEPF YGKHQSGIAT PQQRHAYFAA LDLTTAQRAD VIALLKTWTD AAARMARGDT ALPLATTGND EVAPADGGDA LGLGPARLTI TFGFGPGMFA LAGKDRFGLA KHRPAALVDL PRFNGDQLLP EKTGGDLFIQ ACADDAQVAF HAVRQLVRLG AKATQMRWGQ AGFTSGKPGE TPRNLMGFKD GTMNPPMSDP AAMDEFVWAG SEGPAWMNGG TYTVVRRIRI TLEHWDNTEL GFQEQVVGRH KYSGAPLGQK HEFEALDLDA ADKDGNPVIP DNAHARLASP QLNNGAQILR RAYSYNDSTS FYIERWPPWR QQTEYDAGLM FVAHQRDPRK GFIPINEKLA KMDIMNQFTT HVGSAIFACP PGAQPGSYIG AALFEA
|
| |