Gene BURPS668_A3106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A3106 
Symbol 
ID4887718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2945510 
End bp2947636 
Gene Length2127 bp 
Protein Length708 aa 
Translation table11 
GC content69% 
IMG OID640133042 
Producthypothetical protein 
Protein accessionYP_001064097 
Protein GI126444622 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.929812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA CGGACAAGCG TGAAAAAAAT GCGGCGGCGA AGCCCGATTC GCCGTCCAAC 
ATCGGCGAGT GCATTCGGCT GCCGTGCTCG GGGCAGTTTT ACGTGCTCGA ACGCGCGCTG
AGCAAGCCGC GCGTGAGCGG CGCGTTCGAC GGCATCGAGG CGGCGCTCGA CGCGCAAACG
GCGAAGTTCG CCGAACGCAA CGCGGCGGCC GCGCGGGCGT TCGCGCAACT GCGCGTGCGC
TGCGTGCGCG AGCAGCGCGC CGCTCTGCAG ATCGACGGCG TCACGGTCGG CGCGACGCTG
TCGTCGCTGA CGCGATCGTT TCTCCGGCCT CCGTTGCTCG CGCCCGAAGC GGACGTGGCG
GAAGCGCGCT TCGCCCACGT GCTGCTTGTC GAATTGACGC TGCGCCCGGT CGGCAAGGAC
GAAGTCGACG CATACCTGTA CGTTTATCGC GAGCTGGCGG ACGATCCGCT CGACCACGGC
CTGCGCGAGC ATTGCACCGA GATCGAGACG CCGGCGTTCG TCGAGCCGTT CGTGCAGCCC
GACGCGACGC GCATCGAGCG CATGAGCATG CGGATGATGG CGGCTTCGCG CGGCGAGGTC
CGGCGCAAGA CGATCGATGC GTACGACGTC GGCGCGACGA CGTCGAGCCT CGGCCTGCAT
CGGACCATCG CCGGCACGAT GACGCTCGCG GTGCCGCACG CTGGCGGCAC GCGGCGGTTC
GACGTGTCGC CGCACCGGCA GCGCGTGCGC GCCGGCACGT CGCGGCTGCC GCTCGACAAG
GTGATCCATT GGGCGTCGGA GCGCGCGGTC GGGTTCAGCC GATCGAGCAC CGCCACCGCG
ACGTCCTCGG CGTTTCTCTC GCAGTTCGCG CGGCCGATCG CGCGCCTGCT CGAGAAGACG
CCGTCGAGCG TGCTGATCGA GCGGCGCGCG TTCGCCGAGG CGATCGACGA ATACGCGAGG
CTTACCGGCC TCGCGTGGGT GGCCGGACGT GGCGCGCCCC AGGCATGGAA GACGGTCGAC
GACGTGCTCG ACGCCCTCGG CGACACGTTC ATCGTCGACG GGCAGGCGCT CGACGGCAGC
GGCGCGCCGC TTGACCGGAG CCGGCCATTT GCCGAAGTGT GCTATCGCTC GAGGGCGCCG
TCGATTCCGG GCCTGAAGTC GACCGATGCC GTCAGGCTGA AAGTCGGCAG CCGGACCTGC
AAGATCGTGC TGCCGTCGGC GGTCGGTCAC ATGGGCGGCG CAAAGGACGC GCAGCGCCGC
GGGCTGGGCG AGATCCTGAA CCGGAGCCGC GCGTTCCGGG CCGTGTTCGA CGGCGGGCGC
GTGCTGTTCT GCTCGGAAGG CGCTTACGGA AGCGACGACC TGGCGCTCGC GACCACGCAG
CTCGCGAGCA TTTTCCGCAG CGTCGCCGCG CTCGGCGACG TGCATTCGGA GAAGGGCAAG
ACGCACGAGC ACGCGACATC GTTTCAAGCG CAGAGCTGTT TTCACGTCAT CGAGACCGAA
CGCGTGCTCG CCCATCCCGA TTCGGCGCTG ATCTGCGACG ACTCGACCGT CGAATGGGCC
GACTATATCG AGCTGGACAG CGCCGCGCCG CGCATTCGCT GGATGCACGC GAAAGTGCAG
AAGGTCGCGT CGGAGGCGGC GAAGCGGGCC AGGCAAGACA AGACGACGTT GCCGCCTCAC
GTCAGCCCGA CGGTCGCCGT GCGCCATAGC CCGTCGCTCA GCGCATCGGA TCTGGAGGAG
GTGGTCGGGC AGGCGATCAA GAATCTGGCG CGGCTGCGCC TGCGCACGAG CGACCCGGAA
TTTTCCGGGC GGCACAACAC ATGGATGTCG GAAACCTGCG CGCTGCCGAG CAAGAGCCGC
ATCGCGCGTT TGCGCCGCCT CGGGGGGCTG CGGCGCAAGG CCGACATCGA GGCGCGCTTC
GACGCCGCGG CGATCGATCC GCACGCGGTC TACGAAGTGG CGATCGTCGT GCCGAACTAT
TCGAAGACGC AAGTCGAAAG CGAGCTCGCG AAGATCGCGG CGGGCGGCGC GCAGCCTTCG
GTGTTGCAGA TGTTCTGGCT GCTGAGCGGC TTCATGCACG CGTGCCTCGA AGTGGGCGCG
AAGCCGCTCG TGTTCATGCA CGCCTAG
 
Protein sequence
MKKTDKREKN AAAKPDSPSN IGECIRLPCS GQFYVLERAL SKPRVSGAFD GIEAALDAQT 
AKFAERNAAA ARAFAQLRVR CVREQRAALQ IDGVTVGATL SSLTRSFLRP PLLAPEADVA
EARFAHVLLV ELTLRPVGKD EVDAYLYVYR ELADDPLDHG LREHCTEIET PAFVEPFVQP
DATRIERMSM RMMAASRGEV RRKTIDAYDV GATTSSLGLH RTIAGTMTLA VPHAGGTRRF
DVSPHRQRVR AGTSRLPLDK VIHWASERAV GFSRSSTATA TSSAFLSQFA RPIARLLEKT
PSSVLIERRA FAEAIDEYAR LTGLAWVAGR GAPQAWKTVD DVLDALGDTF IVDGQALDGS
GAPLDRSRPF AEVCYRSRAP SIPGLKSTDA VRLKVGSRTC KIVLPSAVGH MGGAKDAQRR
GLGEILNRSR AFRAVFDGGR VLFCSEGAYG SDDLALATTQ LASIFRSVAA LGDVHSEKGK
THEHATSFQA QSCFHVIETE RVLAHPDSAL ICDDSTVEWA DYIELDSAAP RIRWMHAKVQ
KVASEAAKRA RQDKTTLPPH VSPTVAVRHS PSLSASDLEE VVGQAIKNLA RLRLRTSDPE
FSGRHNTWMS ETCALPSKSR IARLRRLGGL RRKADIEARF DAAAIDPHAV YEVAIVVPNY
SKTQVESELA KIAAGGAQPS VLQMFWLLSG FMHACLEVGA KPLVFMHA