Gene BURPS1106A_A3098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A3098 
Symbol 
ID4904182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp3012195 
End bp3013256 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content75% 
IMG OID640146201 
Producthypothetical protein 
Protein accessionYP_001077127 
Protein GI126456682 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.677062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATCGTC ATTTTCTCGT TCCCGTCGAC GACACCGATG CCGGCATCGA CACGGTTGCC 
TATGCGCTCG AGTTCGCGCG CTCGATCGGC GCGCGCATCA CGTTCGTGCA GACGCGCATC
GAATTCGAAG CAGCCGATGC GGCGCGCCGT ATCGGCGAAA CGGAACGAAA GGAACAAACG
CAGCGAACAG ACGATGCCGG CAAGTCCGGC GAGTGCGGCA GATCAGACGA AAGCGTTCGG
CCGGCGCCGA CGCCGGATGC GGCGGCGCGG CCCGCCGCCG AGGCAACGCC GGCCGCGCGC
GCGCCGGAGC TGCCGATCGC GAAAGCCGAG GCCGCCGCCC GCGCGCAGGG CGTGCCGTGC
GATTCGGTGC GCGCCGCCGG CGCGACGACG GCGGACGCGC TCGCCGGCGC GATGCTCGCG
CACGATTGCG ACCTGCTGTG CGTCGGGCCC GCGCTCGGCG ATGCGGCAGC CGCGCCGCCG
CACGCGTGCG TCGCGGACCG GCTCGCCGCG CGGGGCATCG CCGTGCTGAC CTGCGCGTTT
CGGCGCACGC CGGCCGCCGC GCGCGCGATC GCCGCGCTGT ATGCCGCGCA TCGCGAAGCG
GCCGGCGCGC TCGGCGCATG GCTCGCGCAG TTGCGCGCGG CGATCGCCGC CGGCCGCGCG
CTCGACGCCG ACGCGGCGCA CGCGATCGCC AATGGTCTGA GCCATCTGCG CGACGGGCGG
CAGCCGAAAG CGGCGCGCCG GCTCTACGCG GCGCTGCGCG GCGCGACGGG CGCGCTCGAC
GCTGAACTCG GCGAGCTCGA GCGGCAGCGG CTGCGCAATG CGCGGATGTT GTCCGGGCTG
CTCGAGGCGA TCCACGCGGG CATCGCGCGC GAAGCGCCGC CCGTGCGCCT CGAGCACGCG
CTGAGCGCAT ACGCGCAATG CGTGTGCGAG CACGCCGGCC GCGGCGAAGG CGTGATCGTG
CCGGCCGCGC AGCGCTATCT GGCCGACGAC GACTGGCGCG CGATCGACGC GTCGCTTGCC
TTGATCGCGT CGGGCCCGGC GGCCGCGGCG CGCGGCGCGT GA
 
Protein sequence
MYRHFLVPVD DTDAGIDTVA YALEFARSIG ARITFVQTRI EFEAADAARR IGETERKEQT 
QRTDDAGKSG ECGRSDESVR PAPTPDAAAR PAAEATPAAR APELPIAKAE AAARAQGVPC
DSVRAAGATT ADALAGAMLA HDCDLLCVGP ALGDAAAAPP HACVADRLAA RGIAVLTCAF
RRTPAAARAI AALYAAHREA AGALGAWLAQ LRAAIAAGRA LDADAAHAIA NGLSHLRDGR
QPKAARRLYA ALRGATGALD AELGELERQR LRNARMLSGL LEAIHAGIAR EAPPVRLEHA
LSAYAQCVCE HAGRGEGVIV PAAQRYLADD DWRAIDASLA LIASGPAAAA RGA