Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3149 |
Symbol | |
ID | 4885453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 3086294 |
End bp | 3088465 |
Gene Length | 2172 bp |
Protein Length | 723 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640129077 |
Product | hypothetical protein |
Protein accession | YP_001060161 |
Protein GI | 126440350 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG2358] TRAP-type uncharacterized transport system, periplasmic component [COG3917] 2-hydroxychromene-2-carboxylate isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCAG AACCCACCCG CCGCCGGCCG CACCGCATCG TCGCCCGCTT CGTCGCGGTG TCGTGGCGCG ATCTCGCGAT GTCGATCGGG CCGACCGTCG TGCTGTCGAT CGCCGCGGTC TGGCTCGCGA TCGCGCTGAT CCAGCCCGCG CCGCCGACGT CGCTCACGAT CTCCGCCGGC CCGCCCGGCA GCACGAACTG GCGCTCGGCG CAGCGCTACA AGCAGATCCT GTCGAAAAAC GGCGTGACGC TGCGCGTACT CGAATCCGAG GGCTCGGCCG AAAATCTCGC ACGGCTGTCG GACCCCGCGC AGAAGGTCGA TGTCGGCTTC GTGCAAAGCG GCATCGAGCA GAAGGGAAAG CACGAGGATC TCGTGTCGCT CGGCAGCGTC GGCTACGTGC CGCTCGCGAT CCTGTATCGC GGGCCCGTGA TCGAGCGGCT GTCGCAGTTC AAGGGCAAGC GGCTCGCGCT CGGCGCCGAG GGCGCGGGCG CGCACGAGCT CGGCCTCGCG CTGCTGAAGA TGAACGGCAT CGTGCCGGGC GGCCCGACCC CGCTGCTGCC GCTGTCCGGC GAGGACGCCG CGCGCGCGCT GACGGAAGGC CGCATCGACG CCGCGTTCCT GTCCGGCGAT TCGACGCAGA TTCCGGTGAT GGCCAAGCTG TTTCGCACGC CCGGCGTGCA CTTCTATTCG TTCACGCAGG CCGAGGCGTA CACGCGGCGC GTTGCATACC TGACCGACAT CACGCTGCCG ATGGGCGTCT ACGATCCGGG CACGAACCTG CCGCCGTCGG ACATCCACAC GCTGTCGCCC ACCGTCGAGC TGATCGCGCG CGACACGCTC CACCCGGCGC TGTCGGACCT GCTCATCGAG GCGGCGCGCG ACGTGCACGG CCGTGCGACG ATCCTGCAGC GCGCGGGCGA ATTTCCGTCG CCCGTCACGC ACAGCAGCTT CCTGTTGTCC GACGACGCCG CACGCTACTA CAAGTCGGGC AAGACCTTCC TGTACCGGAA GCTGCCGTTC TGGGTGGCGA GCCTCGTCGA CCGGCTGCTC TTCATCGTCG TGCCGCTCGT CGTCGTGCTG ATTCCGGGGC TGCGGCTCGT GCCGACGCTG TACGGCTGGC GCGTGCGCTC GCGGATCTAC CGGTGGTACG GCGCGCTCAT CGCGCTCGAG CGCAGCGCGC TCGGCGAACA TACCGCGCAA GAGCGCGTCG TGCTGCTCGA CAAGCTCGAC GACGTCGAGG AATCAGTCAA CCGGATGAAG ATGCCGCTCG CGTACGCCGG ACAGTTCTAC GTGCTGCGCG AGCATATCGG CTTCGTTCGC GGGCGGCTGC TCGCGCGCGA TTACGAGACG CCGCAGCCCG CCGCGGCGAC ACCGCCCGCC GCGCCCCCCC CCTGGGGGCG CCGCCCGCCG GTTCGCGTCA GCCGGGCGAC GCTTGAGCCG AAAATCCGTC GCGGCGGCTG CATTTGCCCG CGCCGCCGCG TAACATTCAA CAAAGAGGCC GCGCGTTGCG TCCTATTCCT CAGGAGACTC TCCATGACCG TCGGCCTCGA CGCCTCCCAG CCGATATGGT TCTACGACTT CCTGTCGCCG TTCTCGTATC TGCTGCTGGA GCAACACGAC AAATGGCCCG GCATCGCGTT CGCGCTCGCG CCGGTGGCGC TCGCCGACCT GCATCGCCAC TGGGGCCAGC GCTACGCGTA CGGCGTACCC GCCAAGCGCG TGTTCACCTA CCGGCACGCG CTCTTTCGCG CCGAACAGCT CGGCATTCCG TTCAGGATGC CGCCCGCGCA TCCGTTCGAT TCGACGCGCG CGCTGCTGCT CGCGATCGCG CTCGATTCGG ACGTCCAGGC GATCCGCGAG ATCTTCCGCT TCATCTGGCG CGAGGGGCGC GACCCGTCGG CGCCCGACAA TTTCGCCGAG CTGTGCGGGC GCGTGGGCAT CGCGCACGAC GACGGCCGGC TCACGTCGGA CGAAACGCTC GCGCAGTTGC GCCGCAACAC CGACGACGCG ATCAGCCTGG GCGTATTCGG CGTGCCGACG TTCTGGCTGA ACCGCCAGCT GTTCTGGGGC GAGGACGCGC TGCCGATGGT GCTCTACTGC GCGCGCACGC CGAGCTGGCT CGAATCGAGC GAAGTCAGGC GCATCAGCAC GCTGCCGTCG GGCCTCGCAT GA
|
Protein sequence | MKPEPTRRRP HRIVARFVAV SWRDLAMSIG PTVVLSIAAV WLAIALIQPA PPTSLTISAG PPGSTNWRSA QRYKQILSKN GVTLRVLESE GSAENLARLS DPAQKVDVGF VQSGIEQKGK HEDLVSLGSV GYVPLAILYR GPVIERLSQF KGKRLALGAE GAGAHELGLA LLKMNGIVPG GPTPLLPLSG EDAARALTEG RIDAAFLSGD STQIPVMAKL FRTPGVHFYS FTQAEAYTRR VAYLTDITLP MGVYDPGTNL PPSDIHTLSP TVELIARDTL HPALSDLLIE AARDVHGRAT ILQRAGEFPS PVTHSSFLLS DDAARYYKSG KTFLYRKLPF WVASLVDRLL FIVVPLVVVL IPGLRLVPTL YGWRVRSRIY RWYGALIALE RSALGEHTAQ ERVVLLDKLD DVEESVNRMK MPLAYAGQFY VLREHIGFVR GRLLARDYET PQPAAATPPA APPPWGRRPP VRVSRATLEP KIRRGGCICP RRRVTFNKEA ARCVLFLRRL SMTVGLDASQ PIWFYDFLSP FSYLLLEQHD KWPGIAFALA PVALADLHRH WGQRYAYGVP AKRVFTYRHA LFRAEQLGIP FRMPPAHPFD STRALLLAIA LDSDVQAIRE IFRFIWREGR DPSAPDNFAE LCGRVGIAHD DGRLTSDETL AQLRRNTDDA ISLGVFGVPT FWLNRQLFWG EDALPMVLYC ARTPSWLESS EVRRISTLPS GLA
|
| |