Gene BURPS668_3149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3149 
Symbol 
ID4885453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3086294 
End bp3088465 
Gene Length2172 bp 
Protein Length723 aa 
Translation table11 
GC content69% 
IMG OID640129077 
Producthypothetical protein 
Protein accessionYP_001060161 
Protein GI126440350 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component
[COG3917] 2-hydroxychromene-2-carboxylate isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCAG AACCCACCCG CCGCCGGCCG CACCGCATCG TCGCCCGCTT CGTCGCGGTG 
TCGTGGCGCG ATCTCGCGAT GTCGATCGGG CCGACCGTCG TGCTGTCGAT CGCCGCGGTC
TGGCTCGCGA TCGCGCTGAT CCAGCCCGCG CCGCCGACGT CGCTCACGAT CTCCGCCGGC
CCGCCCGGCA GCACGAACTG GCGCTCGGCG CAGCGCTACA AGCAGATCCT GTCGAAAAAC
GGCGTGACGC TGCGCGTACT CGAATCCGAG GGCTCGGCCG AAAATCTCGC ACGGCTGTCG
GACCCCGCGC AGAAGGTCGA TGTCGGCTTC GTGCAAAGCG GCATCGAGCA GAAGGGAAAG
CACGAGGATC TCGTGTCGCT CGGCAGCGTC GGCTACGTGC CGCTCGCGAT CCTGTATCGC
GGGCCCGTGA TCGAGCGGCT GTCGCAGTTC AAGGGCAAGC GGCTCGCGCT CGGCGCCGAG
GGCGCGGGCG CGCACGAGCT CGGCCTCGCG CTGCTGAAGA TGAACGGCAT CGTGCCGGGC
GGCCCGACCC CGCTGCTGCC GCTGTCCGGC GAGGACGCCG CGCGCGCGCT GACGGAAGGC
CGCATCGACG CCGCGTTCCT GTCCGGCGAT TCGACGCAGA TTCCGGTGAT GGCCAAGCTG
TTTCGCACGC CCGGCGTGCA CTTCTATTCG TTCACGCAGG CCGAGGCGTA CACGCGGCGC
GTTGCATACC TGACCGACAT CACGCTGCCG ATGGGCGTCT ACGATCCGGG CACGAACCTG
CCGCCGTCGG ACATCCACAC GCTGTCGCCC ACCGTCGAGC TGATCGCGCG CGACACGCTC
CACCCGGCGC TGTCGGACCT GCTCATCGAG GCGGCGCGCG ACGTGCACGG CCGTGCGACG
ATCCTGCAGC GCGCGGGCGA ATTTCCGTCG CCCGTCACGC ACAGCAGCTT CCTGTTGTCC
GACGACGCCG CACGCTACTA CAAGTCGGGC AAGACCTTCC TGTACCGGAA GCTGCCGTTC
TGGGTGGCGA GCCTCGTCGA CCGGCTGCTC TTCATCGTCG TGCCGCTCGT CGTCGTGCTG
ATTCCGGGGC TGCGGCTCGT GCCGACGCTG TACGGCTGGC GCGTGCGCTC GCGGATCTAC
CGGTGGTACG GCGCGCTCAT CGCGCTCGAG CGCAGCGCGC TCGGCGAACA TACCGCGCAA
GAGCGCGTCG TGCTGCTCGA CAAGCTCGAC GACGTCGAGG AATCAGTCAA CCGGATGAAG
ATGCCGCTCG CGTACGCCGG ACAGTTCTAC GTGCTGCGCG AGCATATCGG CTTCGTTCGC
GGGCGGCTGC TCGCGCGCGA TTACGAGACG CCGCAGCCCG CCGCGGCGAC ACCGCCCGCC
GCGCCCCCCC CCTGGGGGCG CCGCCCGCCG GTTCGCGTCA GCCGGGCGAC GCTTGAGCCG
AAAATCCGTC GCGGCGGCTG CATTTGCCCG CGCCGCCGCG TAACATTCAA CAAAGAGGCC
GCGCGTTGCG TCCTATTCCT CAGGAGACTC TCCATGACCG TCGGCCTCGA CGCCTCCCAG
CCGATATGGT TCTACGACTT CCTGTCGCCG TTCTCGTATC TGCTGCTGGA GCAACACGAC
AAATGGCCCG GCATCGCGTT CGCGCTCGCG CCGGTGGCGC TCGCCGACCT GCATCGCCAC
TGGGGCCAGC GCTACGCGTA CGGCGTACCC GCCAAGCGCG TGTTCACCTA CCGGCACGCG
CTCTTTCGCG CCGAACAGCT CGGCATTCCG TTCAGGATGC CGCCCGCGCA TCCGTTCGAT
TCGACGCGCG CGCTGCTGCT CGCGATCGCG CTCGATTCGG ACGTCCAGGC GATCCGCGAG
ATCTTCCGCT TCATCTGGCG CGAGGGGCGC GACCCGTCGG CGCCCGACAA TTTCGCCGAG
CTGTGCGGGC GCGTGGGCAT CGCGCACGAC GACGGCCGGC TCACGTCGGA CGAAACGCTC
GCGCAGTTGC GCCGCAACAC CGACGACGCG ATCAGCCTGG GCGTATTCGG CGTGCCGACG
TTCTGGCTGA ACCGCCAGCT GTTCTGGGGC GAGGACGCGC TGCCGATGGT GCTCTACTGC
GCGCGCACGC CGAGCTGGCT CGAATCGAGC GAAGTCAGGC GCATCAGCAC GCTGCCGTCG
GGCCTCGCAT GA
 
Protein sequence
MKPEPTRRRP HRIVARFVAV SWRDLAMSIG PTVVLSIAAV WLAIALIQPA PPTSLTISAG 
PPGSTNWRSA QRYKQILSKN GVTLRVLESE GSAENLARLS DPAQKVDVGF VQSGIEQKGK
HEDLVSLGSV GYVPLAILYR GPVIERLSQF KGKRLALGAE GAGAHELGLA LLKMNGIVPG
GPTPLLPLSG EDAARALTEG RIDAAFLSGD STQIPVMAKL FRTPGVHFYS FTQAEAYTRR
VAYLTDITLP MGVYDPGTNL PPSDIHTLSP TVELIARDTL HPALSDLLIE AARDVHGRAT
ILQRAGEFPS PVTHSSFLLS DDAARYYKSG KTFLYRKLPF WVASLVDRLL FIVVPLVVVL
IPGLRLVPTL YGWRVRSRIY RWYGALIALE RSALGEHTAQ ERVVLLDKLD DVEESVNRMK
MPLAYAGQFY VLREHIGFVR GRLLARDYET PQPAAATPPA APPPWGRRPP VRVSRATLEP
KIRRGGCICP RRRVTFNKEA ARCVLFLRRL SMTVGLDASQ PIWFYDFLSP FSYLLLEQHD
KWPGIAFALA PVALADLHRH WGQRYAYGVP AKRVFTYRHA LFRAEQLGIP FRMPPAHPFD
STRALLLAIA LDSDVQAIRE IFRFIWREGR DPSAPDNFAE LCGRVGIAHD DGRLTSDETL
AQLRRNTDDA ISLGVFGVPT FWLNRQLFWG EDALPMVLYC ARTPSWLESS EVRRISTLPS
GLA