Gene BURPS1106A_3913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3913 
Symbol 
ID4901754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3819397 
End bp3820497 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content73% 
IMG OID640137139 
Product2-nitropropane dioxygenase family oxidoreductase 
Protein accessionYP_001068133 
Protein GI126453588 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.841992 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTCCC GTATCGCCCC GACTCCGTTC GCCGCCCGGT TCGACTTGCG CCTGCCGCTC 
GTGCAGGCGC CGATGGTCGG CGCGACGACG CCCGCGCTCG TCGCGGCCGC CTCCAACGCC
GGTGCGCTCG GCAGCCTCGG CGGCGCGTCG TTCGCGCCGG AGAAGCTTGC CGCCGAAATC
GCCGCGGTGC GTGCCGCGAC GCGCCGCGCG TTCGCCGTGA ACCTGTTCGT GCTGCCCGAC
GCGCAGCCGG ACGACGCGGC CGTGCGTCGC GCGCTCGACG CGATCGATCC GCTGCGCGCG
CGGTTCGGGT TGCCGCCCGG CGCGCCGCTG CCGCGCTACG CGCCGGATTT CCGCGCGCAA
CTCGATGCGC TCGTCGACGC GCGCGTGCCG GTCGCGAGCT TCACGTTCGG CGTGCTCGAC
AAGAAAGATG TCGTCCGGCT GCAGGCGGCG GGCACGTATG TGATCGGCAC GGCAACGCAT
GTCGCCGAGG GCCTCGCGTG GCAGGCGGCG GGCGCCGACG CGATCTGCGC GCAAGGCGCG
GAAGCGGGCG GCCATCGCGG CACGTTCATC GGTTCGGCCG AAGACGCGCT CGTCGGCACG
ATCGCGCTCG TGCCGCAGCT CGTCGACGCG ACGAATCTGC CGGTGCTCGC GGCGGGCGGC
ATCATGGACG GGCGTGGGAT CGCCGCCGCG CTCGCGCTCG GCGCGCAAGC CGCGCAGCTC
GGCACCGCGT TTCTCACGTG CGCGGAAAGC GCGATTCCCG CGTGCTGGAA AGCGCGTCTG
CTCGCGAGCG ACGATACGTC GACGTCCGTC ACGCGCGCGA TCACGGGCCG CCACGCGCGC
GGCATCCGCA ATGCGCTGAT GGCGCAGCTG GCCGGACGGC CCGATTCGGT CGCGCCGTAT
CCGGTGCAAA ACGCGCTGAC GCAGGAGCTG CGGCAAACCG CCGCGCGAGC GGGCGACGCC
GAGTACTTGT CGTTGTGGTC CGGGCAAGGC GCGCCGCTCG GCAAGCACCG CGATGGCGCG
CAAACCACCG CGCAATTGAT CGACGCGCTC GACGCCGAAT GGCGCGCTGC GCTTTCGCGC
TCCGTTATTT CCCTGGTCTG A
 
Protein sequence
MTSRIAPTPF AARFDLRLPL VQAPMVGATT PALVAAASNA GALGSLGGAS FAPEKLAAEI 
AAVRAATRRA FAVNLFVLPD AQPDDAAVRR ALDAIDPLRA RFGLPPGAPL PRYAPDFRAQ
LDALVDARVP VASFTFGVLD KKDVVRLQAA GTYVIGTATH VAEGLAWQAA GADAICAQGA
EAGGHRGTFI GSAEDALVGT IALVPQLVDA TNLPVLAAGG IMDGRGIAAA LALGAQAAQL
GTAFLTCAES AIPACWKARL LASDDTSTSV TRAITGRHAR GIRNALMAQL AGRPDSVAPY
PVQNALTQEL RQTAARAGDA EYLSLWSGQG APLGKHRDGA QTTAQLIDAL DAEWRAALSR
SVISLV