Gene BURPS1106A_0357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0357 
Symbol 
ID4902515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp329440 
End bp330399 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content68% 
IMG OID640133587 
Product2-nitropropane dioxygenase family oxidoreductase 
Protein accessionYP_001064640 
Protein GI126454748 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCTGC CCGCCGTCCT GCAAAACCTC GCGCTGCCCG TCATCGCATC GCCGATGTTC 
ATCGTCAGCT ATCCGGAGCT CGTGCTCGCG CAATGCAAGG CGGGCATCGT CGGCTCGTTT
CCCGCGCTCA ACGCGCGCCC GGCCGAATTG CTCGACGAAT GGCTCACGCA GTTGCAGACG
CAGCTCGCCG AGCACAAGGC CGCGAACCCG GACGCCGTGA TCGGGCCGAT CGCCGTGAAT
CAGATCGTCC ATCAGTCGAA CGTGCGGCTC GAGCAGGACA TCCGCGTATG CGTCGAGCAC
AAGGTGCCGA TCTTCATCAC GAGCCTGCGC GCGCCGGCGC GCGAGATCGT CGATGCGGTG
CACGGCTACG GCGGCATCGT GCTGCATGAC GTGATCAACC TGCGGCACGC GCAGAAAGCG
CTCGAAGCGG GCGTCGACGG CCTCATCCTC GTCGCCGCGG GCGCGGGCGG CCACGCGGGC
ACGACCTCGC CGTTCGCGCT CGTCGGCGAA GTGCGCAGGA TCTTCGACGG CCCGATCGTG
CTGTCCGGCT CGATCGCGAA CGGCGGCTCG ATCCTTGCCG CGCAGGCGAT GGGCGCCGAT
CTCGCCTACA TGGGCACGCG CTTCATCGCG ACGCAGGAAG CGCACGCGGT GGACGCGTAC
AAGCGCGCGA TCCTCGAGGC GAAATCGGCC GACATCATCT ACACGAACCT CTTCACCGGC
GTGCACGGCA ACTACATCCG CGAGAGCATC GTGAACGCGG GGCTCGACCC GGACGCGCTG
CCCGAATCCG ACAAGACGAA GATGAATTTC GGCGGCGACA AGGCGAAGGC GTGGAAGGAC
ATCTGGGGCG CGGGCCAGGG CGTCGGGCTG ATGGATGACG TTCCGAGCGT TGCGGAGCTC
GTCGCGCGGC TCAAGCGCGA GTACGACGAC GCGAAGGCGC GCCTGGGGAT TCGCGCGTAG
 
Protein sequence
MALPAVLQNL ALPVIASPMF IVSYPELVLA QCKAGIVGSF PALNARPAEL LDEWLTQLQT 
QLAEHKAANP DAVIGPIAVN QIVHQSNVRL EQDIRVCVEH KVPIFITSLR APAREIVDAV
HGYGGIVLHD VINLRHAQKA LEAGVDGLIL VAAGAGGHAG TTSPFALVGE VRRIFDGPIV
LSGSIANGGS ILAAQAMGAD LAYMGTRFIA TQEAHAVDAY KRAILEAKSA DIIYTNLFTG
VHGNYIRESI VNAGLDPDAL PESDKTKMNF GGDKAKAWKD IWGAGQGVGL MDDVPSVAEL
VARLKREYDD AKARLGIRA