Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1637 |
Symbol | |
ID | 3846673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 1920321 |
End bp | 1921898 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637838938 |
Product | di-haem cytochrome c peroxidase family protein |
Protein accession | YP_439831 |
Protein GI | 83717380 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1858] Cytochrome c peroxidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.249586 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCACCACG CCGGCCGTCG CTCCGGCGCG GCGCGCGCCG GATTTTTCTT TTGCACGATC ACGACCGAAC GCTCCAGCAT GGCCGAACCG CTTGCCGCGC AATCCGCACC GCCCTCACGA TCCGACGATT CCGTCCGGCC TCCCCGCTCG GCCCGAGTCG ACGCATCCGC GTCGACGGCG CCTCCCGCCG CGCCCCCTGA TTCGCCTCAT TCGCCCCGCC GCCGCGCGCG CGCGGCGCGC CACGTGCTGA AGACCGCCGC ATTCGGCGCG CTCGGCTTCG CCGCATTCGC GCTCGCGTTT CCCGAGCACG CGCCGAATGC GCTCGGCGCG ATCGTCGAGG ATCTGACGGG CGCGAATCCG CATCCGGTCG CGCTGCGCCG CCCGGCCGCC GAGCCGTTGA GCGCGGTCGC GCAGCTCGGC CGGGCGCTGT TCTTCGATCC GTCGCTGTCC GCGTCGGGCC GACAGTCATG CGCGTCGTGC CACAGCCCCG ACCGGGCATA CGGCCCGCCG AACGATCTCG ACGTGCAGTT GGGCGGTGCC GCGCTGACGC GGCCCGGCTA CCGGCCGCCG CCGTCGCTGA TGTATCTGTA CCGCCAGCCG AACTTCAGCA TCGGCCCCGA CTCGTCGGAG AACGACGATG CGGCGAGCGT CGCGCAGCAG GCGGCGTCGG CCGCGGGCGT CGTGCGCGCG CGGAAGACCG CCGGCGCGGC CGCCGCACCG CAGCTCGTGC CGCAAGGCGG AATGTTCTGG GACGGCCGCG CGGACACGCT GCAGCAGCAG GCGTTCGGCC CGCTGATGAA TCCGGTCGAG ATGGCGAACG CGAGCACGGC CGACGTCGCG CGCAAGCTCG AGAACGCGCG GTACGCGCCG CAGTTCCGGC AGTTGTTCGG CCCGCGCATC TTCGACGACG CGCGTCTTGC GGTGTCCGAA GCGATGTTCG CGATCGCGCG CTACCAGGTC GAGGACCCGT CGTTCCATCC GTATTCGAGC AAGTACGACC GCTGGCTCGA AGGCGACGCG CGGCTCACGC GGGCCGAGCT GCGCGGCATG CGGCTCTTCA ACGATCCGGC GAAGGCGAAT TGCGCGGGCT GCCATCTGTC GAAGCCGAGC CCGGACGGCC TGCCGCCGAT GTTCACCGAT TTCCAGTACG AAGCGCTCGG CGTGCCCCGC AACCGCGCGC TCGCGCAGAA CCGCAATCCG GCGTTCCACG ATCTCGGCAT TTGCGGGCCG TTGCGCGACG ACCTGAAAAC GCAGACGCAA TACTGCGGGA TGTTCGCGAC GCCGTCGCTG CGCAACGTCG CGACGCGTCA CGTGTTCTTC CACAACGGCG TCTATCACTC GCTCGATCAG GTGCTCGCGT TCTACAACCT GCGCAGCGTC GATCCGGGCA AGATCTACCC GCGCGACGCG AGCGGCCGGG TGCGGCAGTA CGACGACATC CCGAGCGCGT ATCGCGCGAA CGTCGACGTC GCCGACGCGC CGTTCGACCG CAAGCCGGGC GACGCGCCGG CGATGACCGC GCAGGACATG CGCGACATCG TCGCGTTCCT GAACACGCTG ACCGACGAGA AGCGCTGA
|
Protein sequence | MHHAGRRSGA ARAGFFFCTI TTERSSMAEP LAAQSAPPSR SDDSVRPPRS ARVDASASTA PPAAPPDSPH SPRRRARAAR HVLKTAAFGA LGFAAFALAF PEHAPNALGA IVEDLTGANP HPVALRRPAA EPLSAVAQLG RALFFDPSLS ASGRQSCASC HSPDRAYGPP NDLDVQLGGA ALTRPGYRPP PSLMYLYRQP NFSIGPDSSE NDDAASVAQQ AASAAGVVRA RKTAGAAAAP QLVPQGGMFW DGRADTLQQQ AFGPLMNPVE MANASTADVA RKLENARYAP QFRQLFGPRI FDDARLAVSE AMFAIARYQV EDPSFHPYSS KYDRWLEGDA RLTRAELRGM RLFNDPAKAN CAGCHLSKPS PDGLPPMFTD FQYEALGVPR NRALAQNRNP AFHDLGICGP LRDDLKTQTQ YCGMFATPSL RNVATRHVFF HNGVYHSLDQ VLAFYNLRSV DPGKIYPRDA SGRVRQYDDI PSAYRANVDV ADAPFDRKPG DAPAMTAQDM RDIVAFLNTL TDEKR
|
| |