Gene BURPS1106A_3343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3343 
SymbolphnI 
ID4902061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3266087 
End bp3267295 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content71% 
IMG OID640136569 
Productphosphonate metabolism protein PhnI 
Protein accessionYP_001067580 
Protein GI126453653 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3626] Uncharacterized enzyme of phosphonate metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.304009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGTGG CAGTCAAGGG CGGCGAGCGG GCGATCGAGG CGTCGTGGCG TTTGCTCGAC 
CAAGCGCGGC GCGGCGACCC CGCCGTGCCC GAGCTGAGCG TCGCGCAGAT TCGCGAGCAG
CAGCGCCTCG CGGTCGCGCG CGTGATGACG GAAGGCTCGC TGTACGACGA GGCGCTGGCG
GCGCTCGCGA TCAAGCAGGC GGCGGGCGAT CTCGTCGAGG CGATCTTCCT GCTGCGCGCA
TACCGGACGA CGCTGCCGCG CTTCGGCTAC ACCGCGCCGA TCGATACCGG CGCGATGCGG
CTCGAGCGAC GGATCTCGGC AACGTTCAAG GACATCCCGG GCGGCCAGTT GCTCGGCCCG
ACCTACGATT ACACGCAGCG GCTGCTCGAT TTCGCGCTGC TCGCCGAAGG CGACGCAGCG
CGGCACGAGC CGCACGAGGC CGAATCGGCG CGCGCCGCGG CCGACGCAGG CTCGCCGCCG
CCGCATGCGA CGCATGCGCC GCCGCCCGCG GCTGCGCGCG TGATCGCGCT CCTGAACGAC
GAAGGCCTGA TCGAAGAGGA GCGGCCGACG GCGGTCGGCG CCGAGCCGGG CGACCTGTCG
CGCGAGCCGC TCGCGTTTCC GGCCGATCGC GCGACGCGCC TGCAGAATCT CGCACGCGGC
GACGAAGGTT TTCTGCTCGC GATGGGCTAC GCGACGCAGC GCGGCTACGG TCATTCGCAT
CCGTTCGCGG GCGAGCTGCG CTTTGGCGCC GTCGCCGTCG AAATGGCGCT CGACGAGCTC
GACGGCGAGA CGATCGAGAT CGGCGAGCTC GACGTGACCG AGTGCCAGAT GATCAACCAT
TTTTCCGGCG GCGACGGCGA GCCGCCTCGC TTCACGCAGG GCTACGGCCT CGCGTTCGGC
CATTCGGAGC GCAAGGCGAT CGCGATGGCG CTCGTCGATC GCGCGCTGCG CGCGTCCGAG
CTCGGCGAAG CCGCGCACTC GCCGCCGCAG GATCAGGAGT TCGTGCTGTC GCACAGCGAC
AACGTCGAGG CGTCCGGCTT CGTCCAACAC CTGAAGCTGC CGCACTACGT CGATTTCCAG
TCCGAGCTCG AGCTCGTGCG CCGCCTGCGC GCCGGGCATG CGGCGCAGGC GGGAGCGGGC
GCGAACGCGC ACGAGAACGC GCCCGCCCGC GCCGAACGCG CCGACACGCA CACCGAGGAG
TCCCGATGA
 
Protein sequence
MYVAVKGGER AIEASWRLLD QARRGDPAVP ELSVAQIREQ QRLAVARVMT EGSLYDEALA 
ALAIKQAAGD LVEAIFLLRA YRTTLPRFGY TAPIDTGAMR LERRISATFK DIPGGQLLGP
TYDYTQRLLD FALLAEGDAA RHEPHEAESA RAAADAGSPP PHATHAPPPA AARVIALLND
EGLIEEERPT AVGAEPGDLS REPLAFPADR ATRLQNLARG DEGFLLAMGY ATQRGYGHSH
PFAGELRFGA VAVEMALDEL DGETIEIGEL DVTECQMINH FSGGDGEPPR FTQGYGLAFG
HSERKAIAMA LVDRALRASE LGEAAHSPPQ DQEFVLSHSD NVEASGFVQH LKLPHYVDFQ
SELELVRRLR AGHAAQAGAG ANAHENAPAR AERADTHTEE SR