Gene BURPS668_2694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2694 
Symbol 
ID4882067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2666104 
End bp2667348 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content69% 
IMG OID640128622 
Product2-nitropropane dioxygenase family oxidoreductase 
Protein accessionYP_001059718 
Protein GI126439626 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCCGCC CCTTTCAGGA ATTTGCCGTG TCCGTGTCCG TCGTCAAAAC CGCATTCAAG 
AATCTCGTGA TCAAGGGCAG ATCGCTGCTG CCGATTGTGC AGGGCGGGAT GGGCGTCGGC
GTGTCCGCGC ACCGGCTCGC CGGCACGGTC GCGTCGCTCG GCGCGTGCGG GACGATCTCG
AGCGTCGACC TGCGTCGGCA TCATCCCGAC CTGATGGCGC GCACCGGCCG CTCGCGCGAT
CGCGCGCTCA TCGACGCGGC GAACCTCGAA GCGCTCGATC GCGAGATCCG CGCGGCGAAG
TCGCTCGCGA ACGGCCGCGG GCTCGTCGCC GTCAACGTGA TGCGCGCGCT GTCCGAATAC
GCTTCGTATG TGCGCCAGTC GTGCGAGAGC GGCGCGCACG CGGTCGTCGT CGGCGCCGGG
CTGCCGCTCG ACTTGCCCGA GCTGACCGCC GATTTTCCCG ACGTCGCGCT GATTCCGATC
CTGTCGGACG CACGCGGAAT CGGGCTCGTG CTGAAGAAGT GGATGCGCAA GAACCGTCTG
CCCGACGCCG TCGTCATCGA GAACCCACGC TACGCGGCGG GCCACCTCGG CGCGCCGACG
ACCGACAGCC TGAACAACCC GAATTTCGCG TTCCCCACGG TGCTCGAAGG CACGTTCGCG
CTGCTCAAGG AGCTCGGCAT CGAGCGCGAG CGGATTCCGC TGATCGCGGC GGGCGGCATT
CACAGCCACG AGCAGGTGCG TCAACTGTTC GCGCTCGGCG CGAGCGCCGT GCAGCTCGGC
ACGCCGTTCG CGGTGACCGA AGAGGGCGAC GCGCATCCGA ACTTCAAGAA AGTGCTCGTC
GAGGCGCAGC CGGACGACAT CGTCACGTTC ATGAGCGTCG CGGGGCTGCC GGCGCGCGCG
GTGCGCACGC CGTGGCTCAC GAACTATCTG GAACGGGAAC GGAAGCTGCA GCGTGCGGCG
AAGCCGCGCA AATGCCTCGT CGGCTTCGAT TGCCTGCAGC AATGCGGGCT GCGCGACGGC
ATCGAGAAGC ACGGCCAGTT CTGCATCGAC ACCCGGCTCG CGTTCGCGCT CGCGGGCGAC
ATCAAGCGCG GGCTGTTCTT CCGCGGCTCG GAAACCTTGC CGTTCGGTCA CGAGATCCGC
TGCGTGCGCG AGCTGATCGA CTATCTGCTC ACGGGCGTCA AGCGTGCGGC CGCCGCGGCG
ATCGCCCCCG CGACGGCGTG CGCGCCCATG CCCGCGCTGG GCTGA
 
Protein sequence
MSRPFQEFAV SVSVVKTAFK NLVIKGRSLL PIVQGGMGVG VSAHRLAGTV ASLGACGTIS 
SVDLRRHHPD LMARTGRSRD RALIDAANLE ALDREIRAAK SLANGRGLVA VNVMRALSEY
ASYVRQSCES GAHAVVVGAG LPLDLPELTA DFPDVALIPI LSDARGIGLV LKKWMRKNRL
PDAVVIENPR YAAGHLGAPT TDSLNNPNFA FPTVLEGTFA LLKELGIERE RIPLIAAGGI
HSHEQVRQLF ALGASAVQLG TPFAVTEEGD AHPNFKKVLV EAQPDDIVTF MSVAGLPARA
VRTPWLTNYL ERERKLQRAA KPRKCLVGFD CLQQCGLRDG IEKHGQFCID TRLAFALAGD
IKRGLFFRGS ETLPFGHEIR CVRELIDYLL TGVKRAAAAA IAPATACAPM PALG