Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3820 |
Symbol | |
ID | 4883307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 3724605 |
End bp | 3725492 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640129748 |
Product | tetrapyrrole methylase family protein |
Protein accession | YP_001060814 |
Protein GI | 126439138 |
COG category | [R] General function prediction only |
COG ID | [COG0313] Predicted methyltransferases |
TIGRFAM ID | [TIGR00096] probable S-adenosylmethionine-dependent methyltransferase, YraL family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.119622 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTCCC TCCTCGATCT CGCGCAGGCG CAGCACTATC CGGCAGGCGC CCTCTACATC GTCGCGACGC CGATCGGCAA CGCCGCCGAC ATCACACTGC GCGCGCTGCA CGTGCTCACG CTCGCCGATC GCATCGCCGC CGAGGACACC CGCAACACGG GCCAACTGCT CGTGCGCTAC GGAATCTCGA AGCCGCTCGT CGCCGTGCAC GAGCACAACG AACGCGCGGC CGCCGCGAAG CTGATCGATC ACCTGCGCGC GGGCGAGCGG ATCGCCTATG TGTCGGACGC CGGCACACCC GGCATTTCCG ATCCGGGCGC AAAACTCGTC GACGCCGTGC GCGCCGCCGG CTTCGGCGTG ATCCCGCTGC CGGGCGCGAA CGCGGCCGCG GCGGCGGTCA GCGTCGCGGG CGACTGGGCG GGCGCCTTCA CGTTCGCGGG TTTCCTGCCG CCGAAGCCGA AGCAGCGCGA CGCCGCGCTC CAGCCGCTGA AGACGCATCC GTACGCGCTC GTGTTCTACG AGGCGCCGCA CCGGATCGTC GAGACGGTCG AAGCGCTCGC GGCGGCGCTC GGCGGCGAAC GCCGGCTGTT GATCGCGCGC GAGCTCACCA AGCTACACGA GGAGTTGTTC GAGGGCACGC TGGCTGATGG GCCAACGTGG CTGCGCGCGG ACGCGAACCG GCAGCGCGGC GAATTCGTGC TTGTCGTCGA GGGCGCGCCG CAGGGCGCGC AGGACGAAGA CGAACGCGCG CACGACGCGC TGCTCGAGCT GCTGCTCGAC GAAGTGCCGG TGAAGAGCGC GGCGAAGCTC GCGGCGGCGC TAACGGGGGC GTCGCGCAAC GCGCTCTACG CGCGCGCACT CGCGCTGAAG AAAGAGGAAG AAGAGTAA
|
Protein sequence | MTSLLDLAQA QHYPAGALYI VATPIGNAAD ITLRALHVLT LADRIAAEDT RNTGQLLVRY GISKPLVAVH EHNERAAAAK LIDHLRAGER IAYVSDAGTP GISDPGAKLV DAVRAAGFGV IPLPGANAAA AAVSVAGDWA GAFTFAGFLP PKPKQRDAAL QPLKTHPYAL VFYEAPHRIV ETVEALAAAL GGERRLLIAR ELTKLHEELF EGTLADGPTW LRADANRQRG EFVLVVEGAP QGAQDEDERA HDALLELLLD EVPVKSAAKL AAALTGASRN ALYARALALK KEEEE
|
| |