Gene BURPS1106A_2106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2106 
Symbol 
ID4901868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2097954 
End bp2099003 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content69% 
IMG OID640135336 
Producthypothetical protein 
Protein accessionYP_001066371 
Protein GI126453392 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.588734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACCG TTGACGAAGA CGACATCGGC ACGGCGAGCG GCCGCGACGA AGGCGACTGG 
GTGCCCAACC GGTTTTGCTT GCGCAACGCC TGGTTTCCCC TCGCGCATAC GTTCGAAATC
GGCGAGCGCG CGTCGCGCTG GCAGATCTAC TCGCAGCCGT GCTATCTGTG GCGCGCACGC
GGGCGCATCC ATGCATCGCG CCGGCATCCG GACCTGCCCG CCGCCCCCGC CACGCCCGCC
ATGCCCGCCG CGCCGGACTC GCCGTTCGAG CCGCCCGAGC GCTATCCGGT GGTCGAGCGA
TTCGGCTACG TATGGATCTG GTACGGCGAC CCGGAGCACG CGAGCGACGC GCTCGTGCCC
GACGTGCCGT TCCTGCCGCG CGAAGGGGGG CTGCCCGAGC GCATGCAGGG CAACATCCGG
CTCGACTGCT GCACGCCGCT GCTCGTCGAG AACCTGCTCG ACCTGACGCA CGCGGACTAT
CTGCACGCGA ACCTGCTCGG CGACGAGCAA TCCGAAGAGG ATCGCGTCGA CGTGCGGTTC
ACCTCCGAGA CGGTGACGAT GATCCGGCAG TGCACGAACA AATCGATCGC GCCGATCATG
CGCTGGTTCG GCGGCGTGCG CGCGAAGTAT CAGGACGTTC ACGTCGTGAT CCACGTGCAT
GTGCGCAGCT CCGTCGCGGT CGCGTACGGA CGCTACATGC CGGGCATCGA TCTGCCGATC
TTCCACCCGT GCGTGCCGGA ATCGCGCGAC CGGTGCCGGC TCAGCTTCGC GTTGAACATG
ACGCGAACGC CGTGGCTGCT GCGCGCGCTG ATGCCGCTCA CGCCTTACAT CGTGCTGCCG
CAGGACAATC GCATGATCGG CCCGCAAAGC ACCCGCTACC GGGATGCCGG CGAGCGCCGC
GATCTGTATT CGCGCTTCGA CCGCGCGGGG CTGCGGTATC GGCTCCTGCT GCAGCAGCTC
GCCCGGCGGC AGCGCGACGG CGATTTCTCG TACGCCCCCG ATGCGCTGCC CGGCCAGGAC
GCGCGCGGCA TTCTCGGCAT GCCGGACTAG
 
Protein sequence
MATVDEDDIG TASGRDEGDW VPNRFCLRNA WFPLAHTFEI GERASRWQIY SQPCYLWRAR 
GRIHASRRHP DLPAAPATPA MPAAPDSPFE PPERYPVVER FGYVWIWYGD PEHASDALVP
DVPFLPREGG LPERMQGNIR LDCCTPLLVE NLLDLTHADY LHANLLGDEQ SEEDRVDVRF
TSETVTMIRQ CTNKSIAPIM RWFGGVRAKY QDVHVVIHVH VRSSVAVAYG RYMPGIDLPI
FHPCVPESRD RCRLSFALNM TRTPWLLRAL MPLTPYIVLP QDNRMIGPQS TRYRDAGERR
DLYSRFDRAG LRYRLLLQQL ARRQRDGDFS YAPDALPGQD ARGILGMPD