Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bxe_B2441 |
Symbol | orf20 |
ID | 4007063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia xenovorans LB400 |
Kingdom | Bacteria |
Replicon accession | NC_007952 |
Strand | - |
Start bp | 686188 |
End bp | 687570 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637950270 |
Product | tetrahydromethanopterin biosynthesis protein |
Protein accession | YP_552900 |
Protein GI | 91777692 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0294] Dihydropteroate synthase and related enzymes |
TIGRFAM ID | [TIGR00284] dihydropteroate synthase-related protein |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.366822 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCACA TCGTCTTCCT GACTGGCCGG CTCGCCGAAA AGAGCCTGCT GCGCGTGCTC GAAAGCATGG CGCCCACGCC GTTCACTTGG GAAATCCGCG AAATCGGCCT GCAGGTCGCC GCGCTCATGA CAGCGGATCT GATCCGCCGC CGGGTGCCCG CGCCACTCGC CGCGCAGCGC GTGATCGTGC CCGGCCGCTG CCGCGGCGAT CTCGCCGCGC TGACCGAACA CTACGGTGTG CCGTTCGAGC GCGGTCCGGA AGAGGTCAAG GACCTGCCGC AATTCTTCGG CCGCGAGGCG CAGCCGTTCG ATCTAACGCG TTACGAGACC GACATTTTTG CCGAGATCAT CGATGCACCG CGGCTCGATC TGGACGGCAT CGCCGCGCGG GCGCGCGAGT ATGCCGCGCA GGGTGCCGAT GTGATCGACA TTGGCTGCCT GCCCGAAACG CCGTTTCCGC ATCTCGAAGA CGCGGTGCGC CTGCTGAAAG ACGAAGGGTA TCGCGTAAGC GTCGATTCGA TGAGCGGCGA TGAACTGCTG CGCGGCGGCC GCGCGGGCGC GGATTATCTG CTGAGCCTGA ACCTCGATAC GCTATGGATC GCGGACGAGG TGCCGTCCAC GCCGGTCCTC GTTGCCCGCG AACCGGGCGA CCCCGCCTCG CTCGACGCCG CGATCGACCT CCTCGCCGCG CGCGGGCGCG CGTTTCTCGC CGACCCGATT CTCGATCCGA TTCCGTTCGG CCTCGCCGCA TCGATTGCGC GTTATGTCCG GTTGCGCGAG CGCTACCCGG ACGTCGCCAT CATGATGGGC ATCGGCAACG TGACCGAGTT GACCGAGGCC GACACCAGCG GCATCAACGC GGTCCTGCTC GGCATCGCAG CGGAGCTTCG CGTGAGCGCG GTGCTCACCA CGTCCGTCAG CCTGCATGCG CGGCGCGCGG TGCGCGAAGC GGACGTGGCT CGCCGGGTGA TGCACGCCGC GCGCGAGGCG CAGGTGCTGC CCAAGGGCAT CGACAGCGAT CTCGCGACGG TTCACGCCAA ACGCCCATTT CCATACAGCG CCGCCGAAAT CGACGAGTTC GCTCGCGATA TACGGGATCC TAACTTTCGC GTCCAGATCA GTACCGACGG CATTCATGTC TACAACCGGG ATGGACATCG CAAAGGCCAT GATCCGTTTG CGCTTTACCC GGCGTTGCAT CTGGAAGCCG ATGGCGGACA CGCGTTTTAT ATGGGTGTTC AGTTGGCGCG TGCGGAGATT GCGTGGCGGT TGGGCAAGCG GTTCGATCAG GATCAACCGC TTGACTGGGG GTGCGCGCTC GATCGGAATG AGCAGGATTT GATGGTGTGG CGTGAACCGG GAGAGACCAG GAAGAAGGGG TAA
|
Protein sequence | MDHIVFLTGR LAEKSLLRVL ESMAPTPFTW EIREIGLQVA ALMTADLIRR RVPAPLAAQR VIVPGRCRGD LAALTEHYGV PFERGPEEVK DLPQFFGREA QPFDLTRYET DIFAEIIDAP RLDLDGIAAR AREYAAQGAD VIDIGCLPET PFPHLEDAVR LLKDEGYRVS VDSMSGDELL RGGRAGADYL LSLNLDTLWI ADEVPSTPVL VAREPGDPAS LDAAIDLLAA RGRAFLADPI LDPIPFGLAA SIARYVRLRE RYPDVAIMMG IGNVTELTEA DTSGINAVLL GIAAELRVSA VLTTSVSLHA RRAVREADVA RRVMHAAREA QVLPKGIDSD LATVHAKRPF PYSAAEIDEF ARDIRDPNFR VQISTDGIHV YNRDGHRKGH DPFALYPALH LEADGGHAFY MGVQLARAEI AWRLGKRFDQ DQPLDWGCAL DRNEQDLMVW REPGETRKKG
|
| |