Gene BURPS1106A_A2981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2981 
Symbol 
ID4904250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2903847 
End bp2905973 
Gene Length2127 bp 
Protein Length708 aa 
Translation table11 
GC content68% 
IMG OID640146084 
Producthypothetical protein 
Protein accessionYP_001077010 
Protein GI126457419 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA CGGACACGCG TGAAAAAAAT GCGGCGGCGA AGCCCGATTC GCCGTCCAAC 
ATCGGCGAGT GCATTCGGCT GCCGTGCTCG GGGCAGTTTT ACGTGCTCGA ACGCGCGCTG
AGCAAGCCGC GCGTGAGCGG CGCGTTCGAC GGCATCGAGG CGGCGCTCGA CGCGCAAATG
GCGAAGTTCG CCGAACGCAA CGCGGCGGCC GCACGGGCGT TCGCGCAACT GCGCGTGCGC
TGCGTGCGCG AGCAGCGCGC CGCTCTGCAG ATCGACGGCG TCACGGTCGG CGCGACGCTG
TCGTCGCTGA CGCGATCGTT TCTCCGGCCT CCGTTGCTCG CGCCCGAAGC GGACGTGGCG
GAAGCGCGCT TCGCCCACGT GCTGCTTGTC GAATTGACGC TGCGCCCGGT CGGCAAGGAC
GAAGTCGACG CATACCTGTA CGTTTATCGC GAGCTGGCGG ACGATCCGCT CGACCACGGC
CTGCGCGAGC ATTGCACCGA GATCGAGACG CCGGCGTTCG TCGAGCCGTT CGTGCAGCCC
GACGCGACGC GCATCGAGCG CATGAGCATG CGGATGATGG CGGCTTCGCG CGGCGAGGTC
CGGCGCAAGA CGATCGATGC GTACGACGTC GGCGCGACGA CGTCGAGCCT CGGCCTGCAT
CGGACCATCG CCGGCACGAT GACGCTCGCG GTGCCGCACG GCGGCGGCAC GCGGCGGTTC
GACGTGTCGC CGCACCGGCA GCGCGTGCGC GCCGGCACGT CGCGGCTGCC GCTCGACAAG
GTGATCCATT GGGCGTCGGA GCGCGCGGTC GGGTTCAGCC GATCGAGCAC CGCCACCGCG
ACGTCCTCGG CGTTTCTCTC GCAGTTCGCG CGGCCGATCG CGCGCCTGCT CGACAAGACG
CCGTCGAGCG TGCTGATCGA GCGGCGCGCG TTCGCCGAGG CGATCGACGA ATACGCGAGG
CTCACCGGCC TCGCGTGGGT GGCCGGACGT GGCGCGCCCC AGGCATGGAA GACGGTCGAC
GACGTGCTCG ACGCCCTCGG CGACACGTTC GTCGTCGATG GGCAGGCGCT CGACGGCAGC
GGCGCGCCGC TTGACCGGAG CCGGCCATTT GCCGAAGTGT GCTATCGCTC GAGGGCGCCG
TCGATTCCGG GCTTGAAGTC GACCGATGCC GTCAGGCTGA AAGTCGGCAG CCGGACCTGC
AAGATCGTGC TGCCGTCGGC GGTCGGTCAC ATGGGCGGCG CAAAGGACGC GCAGCGCCGC
GGGCTGGGCG AGATCCTGAA CCGGAGCCGC GCGTTCCGGG CCGTGTTCGA CGGCGGGCGC
GTGCTGTTCT GCTCGGAAGG CGCTTACGGA AGCGACGACC TGGCGCTCGC GACCACGCAG
CTCGCGAGCA TTTTCCGCAG CGTCGCCGCG CTCGGCGACG TGCATTCGGA GAAGGGCAAG
ACGCACGAGC ACGCGACATC GTTTCAAGCG CAGAGCTGTT TTCACGTCAT CGAGACCGAA
CGCGCGCTCG CCCATCCCGA TTCGGCGCTG ATCTGCGACG ACTCGACCGT CGAATGGGCC
GACTATATCG AGCTGGACAG CGCCGCGCCG CGCATTCGCT GGATGCACGC GAAAGTGCAG
AAGGTCGCGT CGGAGGCGGC GAAGCGGGCC AGGCAAGACA AGACGACGTT GCCGCCTCAC
GTCAGCCCGA CGGTCGCCGT GCGCCATAGC CCGTCGCTCA GCGCATCGGA TCTGGAGGAG
GTGGTCGGGC AGGCGATCAA GAATCTGGCG CGGCTGCGCC TGCGCACGAG CGACCCGGAA
TTTTCCGGGC GGCACAATAC ATGGATGTCG GAAACCTGCG CGCTGCCGAG CAAGAGCCGC
ATCGCGCGTT TGCGCCGCCT CGGAGGGCTG CGGCGCAAGG CCGACATCGA GGCGCGCTTC
GACGCCGCGG CGATCGATCC GCACGCGGTC TACGAAGTGG CGATCGTCGT GCCGAACTAT
TCGAAGACGC AAGTCGAAAG CGAGCTCGCG AAGATCGCGA CGGGCGACGC GCAGCCTTCG
GTGTTGCAGA TGTTCTGGCT GCTGAGCGGC TTCATGCACG CGTGCCTCGA AGTGGGCGCG
AAGCCGCTCG TGTTCATGCA CGCCTAG
 
Protein sequence
MKKTDTREKN AAAKPDSPSN IGECIRLPCS GQFYVLERAL SKPRVSGAFD GIEAALDAQM 
AKFAERNAAA ARAFAQLRVR CVREQRAALQ IDGVTVGATL SSLTRSFLRP PLLAPEADVA
EARFAHVLLV ELTLRPVGKD EVDAYLYVYR ELADDPLDHG LREHCTEIET PAFVEPFVQP
DATRIERMSM RMMAASRGEV RRKTIDAYDV GATTSSLGLH RTIAGTMTLA VPHGGGTRRF
DVSPHRQRVR AGTSRLPLDK VIHWASERAV GFSRSSTATA TSSAFLSQFA RPIARLLDKT
PSSVLIERRA FAEAIDEYAR LTGLAWVAGR GAPQAWKTVD DVLDALGDTF VVDGQALDGS
GAPLDRSRPF AEVCYRSRAP SIPGLKSTDA VRLKVGSRTC KIVLPSAVGH MGGAKDAQRR
GLGEILNRSR AFRAVFDGGR VLFCSEGAYG SDDLALATTQ LASIFRSVAA LGDVHSEKGK
THEHATSFQA QSCFHVIETE RALAHPDSAL ICDDSTVEWA DYIELDSAAP RIRWMHAKVQ
KVASEAAKRA RQDKTTLPPH VSPTVAVRHS PSLSASDLEE VVGQAIKNLA RLRLRTSDPE
FSGRHNTWMS ETCALPSKSR IARLRRLGGL RRKADIEARF DAAAIDPHAV YEVAIVVPNY
SKTQVESELA KIATGDAQPS VLQMFWLLSG FMHACLEVGA KPLVFMHA