Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1092 |
Symbol | |
ID | 3844417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 1269685 |
End bp | 1271316 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637838395 |
Product | di-haem cytochrome c peroxidase family protein |
Protein accession | YP_439289 |
Protein GI | 83717163 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1858] Cytochrome c peroxidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.684942 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGGGCC GCAGGCGAAG CAGCGCGGCG CGGCGCGCGG TGCCCGCCGC AGCGCTCACG GCGCGCGCGT CCGTCACCGC CTTCGCGCTC GGCCCGGCCG CGCTCGGCAC GACGAGGACG CGCCGCACCG TGCTGACCCT CTCGACCATC GTGCGGGCGG CATGCGGCGC GCTCGCCGCG TGCGCATCGG CGATCGCGCT CGCGTCCGGC CCGGCCGCGC CGGACGCGAC ACACTCGACA CACGCGGCTC GCCCGGCGAG CGCCGCAAGC TCAACGAGTC CCATGAACCC GACGAGCACG CCGGGCGCGT CCGGCCCGGC CCATGCGAAA GCCGCGCTAG ACGCCGCCCG CGCCAAGGCC GCCCCGCCCT CGCCGCCGAC GACGGTCCTG CTCCCCGGCG CGCCGCCTGA GCGCGTCGTC GACACGATCG GCCGCGGCAC GCCGCAGGTC GCGTCGAAAG TCGACCCGAC GGCGGCCGTG TTCCGCCCGG ACCCGACGCT CGCCGCGCTC GGCAAGCGCG TGTTTTTCGA TCCGGCATTG TCGGAGCCGC GCGGCATGTC GTGCGCAAGC TGCCACGATC CGGGCCGCGC GTTCGCGCCG ACGCTCTCTC CCGCGGCGCT CGCCGGCCCG CGCGTGCCGC AGGGCAGCCG GCCCGGCCAT TTCAGCCGCC GCAACGCGCC GTCGCTGCTG TACGTGCGCT ACGTGCCGCG CCGCCACTTC TATCAGGACG ACGACGCGCT CGCGCCCGCG CCGTTCGGCG GCCTGTTCTC GGACGGCCGC GCCGACACGC TCGCCGAGCA GTTGCGCGGC CCGCTCTTCG ATCCGGACGA GATGAACAAC GCGTCGGCCG CGGCGCTGAT GCGCAAGATC GGCCGCACCG GGCTCGGCGC GGCGCTCGCC GGGCGCTTCG GCCCATCGGT GCGCCGCGAT CCCGAGCGGA TGGTGCGCGT GCTCGGCGAA GCCATGCAGG CGTACCTGCA AAGCGACGAA ATGGCGCCCT TCTCGTCGCG CTACGACGCG TACGTGACGA AGCGCGCACC GCTCACGCCG CAGGAGATGC GCGGGCTCGC GCTCTTCAGG AATCCGGACA AGGGCAACTG CATGAGCTGC CACACGCTGT CGGACACCGC GAGCCGGCCC GAGCGCTCGC TCTTCACCGA CTTCGGCTAC GACGCGATCG CGGTGCCGCG CAATCGCGCG CTGCCCGCGA ACCGCGACCC GCGCCACTTC GACAACGGCC TGTGCGACAC CGCCGCGAAA CTGCGCTGGC CCGAGCCGAC ACAATGGTGC GCTTACCTGC GCACGCCGGG GCTGCGCAAC GTCGCGATCA AGGAATCGTT CATGCACAAC GGCGTGTTCG ACACGCTGCG CGACGCGGTC GCGTTCTACA ACACGCGCTC GACCGATCCG GCACGCTGGT ACCACGGCCG CGACACGTTC GACGACGTGC CGCGTGCGTA TCGCGGCAAC GTCAACGTGA ACTCGACGCC GATGAACCGC CGCCCCGGCA CGCCGCCCGC GATGACGGAC GCCGACGTCG ACGATCTCGT CGCCTTCTTG CGCACGCTGA CGGACGCGCG CTACGTCGGG CTGATGCCCA CGGCGCCGGA CGGCAAGGCG GCGCGGCCGT GA
|
Protein sequence | MTGRRRSSAA RRAVPAAALT ARASVTAFAL GPAALGTTRT RRTVLTLSTI VRAACGALAA CASAIALASG PAAPDATHST HAARPASAAS STSPMNPTST PGASGPAHAK AALDAARAKA APPSPPTTVL LPGAPPERVV DTIGRGTPQV ASKVDPTAAV FRPDPTLAAL GKRVFFDPAL SEPRGMSCAS CHDPGRAFAP TLSPAALAGP RVPQGSRPGH FSRRNAPSLL YVRYVPRRHF YQDDDALAPA PFGGLFSDGR ADTLAEQLRG PLFDPDEMNN ASAAALMRKI GRTGLGAALA GRFGPSVRRD PERMVRVLGE AMQAYLQSDE MAPFSSRYDA YVTKRAPLTP QEMRGLALFR NPDKGNCMSC HTLSDTASRP ERSLFTDFGY DAIAVPRNRA LPANRDPRHF DNGLCDTAAK LRWPEPTQWC AYLRTPGLRN VAIKESFMHN GVFDTLRDAV AFYNTRSTDP ARWYHGRDTF DDVPRAYRGN VNVNSTPMNR RPGTPPAMTD ADVDDLVAFL RTLTDARYVG LMPTAPDGKA ARP
|
| |