Gene BURPS668_2051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2051 
Symbol 
ID4883765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2041597 
End bp2042640 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content68% 
IMG OID640127979 
Producthypothetical protein 
Protein accessionYP_001059086 
Protein GI126440868 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGCGA CCGTTGACGA AGACGACATC GGCACGGCGA GCGGCCGCGA CGAAGGCGAC 
TGGGTGCCCA ACCGGTTTTG CTTGCGCAAC GCCTGGTTTC CCCTCGCGCA TACCTTCGAA
ATCGGCGAGC GCGCGTCGCG CTGGCAGATC TACTCGCAGC CGTGCTATCT GTGGCGCGCA
CGCGGGCGCA TCCATGCATC GCGCCGGCAT CCGGACCTGC CCGCCGCCCC CGCCATGCCC
GCCGCGCCGG ACTCGCCGTT CGAGCCGCCC GAACGCTATC CGGTGGTCGA GCGATTCGGC
TACGTATGGA TCTGGTACGG CGACCCGGAG CGCGCGAGCG ACGCGCTCGT GCCCGACGTG
CCGTTCCTGC CGCGCGAAGG GGGGCTGCCC GAGCGCATGC AGGGCAACAT CCGGCTCGAC
TGCTGCACGC CGCTGCTCGT CGAGAACCTG CTCGACCTGA CGCACGCGGA CTATCTGCAC
GCGAACCTGC TCGGCGACGA GCAATCCGAA GAGGATCGCG TCGACGTGCG GTTCACGTCC
GAGACGGTGA CGATGATCCG GCAGTGCACG AACAAATCGA TCGCGCCGAT CATGCGCTGG
TTCGGCGGCG TGCGCGCGAA GTATCAGGAC GTTCACGTCG TGATCCACGT GCATGTGCGC
AGCTCCGTCG CGGTCGCGTA CGGACGCTAC ATGCCGGGCA TCGATCTGCC GATCTTCCAC
CCGTGCGTGC CGGAATCGCG CGACCGGTGC CGGCTCAGCT TCGCGTTGAA CATGACGCGA
ACGCCGTGGC TGCTGCGCGC GCTGATGCCG CTCACGCCTT ACATCGTGCT GCCGCAGGAC
AATCGCATGA TCGGCCCGCA AAGCACCCGC TACCGGGATG CCGGCGAGCG CCGCGATCTG
TATTCGCGCT TCGACCGCGC GGGGCTGCGG TATCGGCTCC TGCTGCAGCA GCTCGCCCGG
CGGCAGCGCG ACGGCGATTT CTCGTACGCC CCCGATGCGC TGCCCGGCCA GGACGCGCGC
GGCATTCTCG GCATGCCGGA CTAG
 
Protein sequence
MMATVDEDDI GTASGRDEGD WVPNRFCLRN AWFPLAHTFE IGERASRWQI YSQPCYLWRA 
RGRIHASRRH PDLPAAPAMP AAPDSPFEPP ERYPVVERFG YVWIWYGDPE RASDALVPDV
PFLPREGGLP ERMQGNIRLD CCTPLLVENL LDLTHADYLH ANLLGDEQSE EDRVDVRFTS
ETVTMIRQCT NKSIAPIMRW FGGVRAKYQD VHVVIHVHVR SSVAVAYGRY MPGIDLPIFH
PCVPESRDRC RLSFALNMTR TPWLLRALMP LTPYIVLPQD NRMIGPQSTR YRDAGERRDL
YSRFDRAGLR YRLLLQQLAR RQRDGDFSYA PDALPGQDAR GILGMPD