Gene BURPS1106A_A2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2040 
Symbol 
ID4906230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2012829 
End bp2016833 
Gene Length4005 bp 
Protein Length1334 aa 
Translation table11 
GC content75% 
IMG OID640145145 
Producthypothetical protein 
Protein accessionYP_001076073 
Protein GI126458165 
COG category[S] Function unknown 
COG ID[COG3523] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGATCG CGCGGATCGC CGCGCTGCTC GCGCTCGCCG CGCTCAGTTG GGCGAGCGCG 
CTCTATTTCG ACTGGCCGCT GTGGTGCGCG CCCGCCGTGT TCTGCGCGGC GCTCGCCGGA
TGGCTGCTGT GCCGCGTCGC GCGCCGCGCG CTGCGCGCGG TTCGTGCGCG CGCGCAGCTC
GCACGGCTCG ACGCATCCGA GCGCCTGCCC GCGGCCGACG CGCCGGAGCT GCGCGTCGCC
GCGCGCTGGC GCACCGCGCT CGCCGCGCTC GGGCGCGCGC AGCCCGCGCC GCGCCTGCGC
GGCCGGCCGC GCGACGCGCT GCCGTGGTAT CTCGTGATCG GCCCCGAGGG CGCGGGCAAG
ACAACCGCGC TCGCACGCGC GCGCCTCGTG TCGCCGCTGC GGCACGCGCG CGACGACGCG
GCCATCGCGC CGACCGAGGA CTGCGACTGG TGGTGCTTCG ACGACGCGGT CGTGCTCGAT
CTCGCGGGCC GCTTCGCCGC GCCCGGCGCG ACCGACGACG ATCGCCGCGC ATGGCACGCG
CTCCTCGAGC AGCTCGGCCG CGCGCGTGCG CGCCGGGGCG TGAACGGCGT CGTGGTCGCG
ATCGACGCGC CGCGCCTCAT CGAAGCGAGC CGCGATGCGC TGACGATCGA AGGCAGCGCC
ATTCGCGAAC GGCTCGAGCA ACTGATCCGC CTGTTCGACC GGCGCTTTCC GGTGTACGTG
CTCGTCACGC AATGCGACCG GCTCTACGGC TTCGACGAAT GGGCCGCGCA GCTCGCGCCG
GAACAACGCG AGCGCGCGTT CGGCTATCTC GGCGATCACG ACGCGGATGC GTTCGTCGCG
CGCGCGCTCG ACAGCCTCGA CGCACGGCTC GGCGCACTGC GCGTCGCGCT GGCGGCGCGC
GGCGAGCCGC CGTCGCCGCG CGCGCTGATG CTGCCGCACG AACTCGCGTG CCTGCGGCCC
GCGCTCGAGG CGTTCGCACG CGCGGCGTTC GGCCCGAACG TCTATCAGGA AACACCGTAT
CTGCGCGGGC TGCTGTTCGC GAGCGGCCGG CAGGCGGGCG GCGCGCCGTC GCTGACGCTG
CCCGACTGGC TCGACGCCGC GCCGGCGCGC ACGCCCGGCG ATGCGGGCCT GTTCCTGCAC
GACGTGTTCG CGCGCGTGCT GCCGGGCGAG CGCGACGCGT GGCTGCCCGT CGAGCGGCCG
AGCCGCTCGC GGCTCGTGCT GCGGCGCCTC GCGCTCGCCG CGTGGCTCCT CGCGAGCATC
GCGGCCGGGC TGCTGATGAG CGCATCGTTC GTCGGCGACA TGCGAACCGT CGAGCTGGTC
CGCCGCGACT ATCCCGCGCA TCCGCGCTTC ACCGGCGAAT TCGCGCACGA CACGGCGACG
CTCGAGCGGA TCGGCCGCGT GATCGCCGAC GTCGAGCGGC GCGACGAGCG GCGGCTCGTG
CGCCCGCTCG CCGGCGCGAC GCCCGTCGGC CGGCTCGAAG CGGAACTCAA GCGCCGCTAC
GTCGCGCACT ATCGGCGCTC GATCGAACCC ACCGCCGATC GGCTGTTCTT CGGCGAGCCG
GACGGCGCGA GCGACGTGCG CGCGGCCGAC GATGCGCGGC TCGCCGCGCG CATCCGCAAT
CTGGTCCGCT ACGTGAACCT GATGCAGGCG CGCCGGCGCG GCGCGGATCG CGAGACGCTC
GCGCAAATGC CCGCACCGGC GGTCGCGCGC GCGACGGATG GCGGTGGCGC TCGCGGAGTG
GACAGTGTGG GTGTTTCTGG TGTTTCTGGT GTTTCTGGTG TTTCTGGTGT TTCTGGTGTT
TCTGGTGTTT CTGGTGTTTC TGGCGGTGTA GGCGGTGTAG GCGGTGTAGG CGGTGCGCGC
GCCGCGGAAG GCGCGGGTCG CGCCGTCCGC GCGAGCGGCG GACGGCGCGC AGGCGACGCG
CGCGCCGCGC GCGGCATGCG CGGCGCCGCC GGCAACGCCG GCGCCGGCGC TGAAATCGCA
CCCGCCGCAG ACGGCGTTCG CGATCGCGAC GAAGGGTTCG CGCGCGTCAG CGCGCTCGCC
GTCGACCTGA TCGCGTGGAG CGCGCCCGAC GATCGCGCGC TCGCCGCGCG CGTCGCGGCC
GCGCAGGCAG AGCTCGAACG GCTCGCGTAC CGCGATCCGG ACGGCGCGTG GCTGCTCGCG
CTGCCGGATG CCTCGATGCC GCGCGACGTG ACGCTCGCCG ATTTCTGGCC CGCCGCCGAG
CCGCGCACGC TGCCCGACGC ACAGCCCGCC GCGCTGCGCG ACGTGCGCGT GCCCGCCGCG
CTGACGGCCG CGCATCGCCC GGCCATCGAT GCGTTCCTCG ACGAGATGGC GCAGGCGGTC
GCGAACCGGC CGAAGTTCGC GTTCCATCGC GACGCGTTCG ACACGTGGTA CCGCGCACGG
CGCATCGACG CATGGCGCGA CTTCGTCGCA CGCTTCCCGC AGGGCGAGCA AGGGCTGACG
ACGCAAGCGC AATGGCGCGC GGTGATCGAC CGGATCGCCG ATCGCCGCGA TCCGTTCGCG
GCATTGCTCG CGCGCGTCGA TCGCGAATTC GAATCCGAGC GCGACGACAC GCTGCCGCCG
TGGCTACGGT TCGTGCGCAC GACATCGCAC ATGCTCGCCC CCGCGATGCT CGCGCCCGCG
CGCCTGCCCG CGGCGAGCGG CGGCCTGGGC GCCGCGCTCG GCTCGATCTC GCGCAGCGGC
GGACGCGCGC TGCGCGAAGC GCTCGGCGGC GCGCCCGAGC AAGGCAGGCT CACGCTCGAG
CGCGATGCCG CGCTGCGCGA CGCGCTTGTC GACTACGAGC GGCGCGTCGC CGCGCTCGCC
GCCGATGCGC TTGCGGGCCC CGGCGCCGCG TACCGGCTCG CGGCCGATTT CCACGGCTTC
GGCGTCGATC CGTCCGTCCA GGCTTCCGCG ATGCGCGCGG CCGACAGCGC GCTGCGCGAC
GTCAAGCGGC TCGCGGGCGA TCGCGACGTC GGCGGCGACG TCGTCTGGAG CCTCGTCGGC
GGCCCGCTGC GCGCGCTGAT CGCGTATGTC GAGCGGCAGG CGTCGTGCGC GCTGCAGGAC
GACTGGGAGC GCGACGTGCT GTGGCCGCTC AGGCGCGCGG CGACGCGCGA CGACGTGGAG
GGTCTGCTGT ACGGCCCGCA AGGCGCGATC TGGGCGTTCG TCGACGGCCC GGCGAAGCCG
TTCGTGCGTG TCGGCGCGGC GCGTGCATCG GCGCTCGACA CGCTCGGCTA CCGGCTGCCG
TTCACCGACG CGTTCCTGCC GCTCATCGAC GACGCGGCCG CCCGGCGCGT CGCGCAAGCG
CGGCGCGACG CCGAGCGGCG CGCGCAACAG CAGGCGGCGC TCGAGCTGGA CGAGCGGATC
GCATCGCTCG GCAAGCAGAT CGACGCGCTG CGCGCGCAAA CGGTGCGCTT CGGGATCGTC
GCGCAGCCGA CCGACGTGAA CCCGGATGCG CAGGCCAAGC CGTTCGAAAC CGTGCTGACG
CTGCAATGCG CGCCGCAGGC GCGCACGCTG ACGAACTACA ACCTGCGCGT GTCCGAGCAG
ATCGACTGGC AACCCGATCG ATGCGGCGAC GCGACGCTGC GCATCTCGCT CGGCGGCGTC
ACGCTCGTGC GCCGTTACGC GGGATCGCTC GGCGTCGCGC GCTTCGTCCA GGACTTCCGC
TACGGCGTGC GCCGCTTCAC GCCCCGCGAT TTCCCGGACG CGAAAGCGCA GCTTCAACGT
CTCGGCGTGC GCCACATCGA CGTGCGTTAC GACTTCTCCG GGCATGACGC GCTGCTCGCG
CACGTCGGTC GGATCGACGC GCTCGAACGC GCGCGCCGCG ACGACCTCGC GCGGCAGCGG
CGCGCCGCGA ACCGGCAGGA CGACGACGCG GGCGGCGCGA CGGCGATCGC GCGGGCGGCG
GCGTCCGTCG CGAATGCGCC GGGCGCCGCG CTGCCCCGCC GCATCGGCGT GTGTTGGGGG
GACCCGGCGC ACGACAGGCC CGATGGCGAC GGCGCGCAAC CGTGA
 
Protein sequence
MMIARIAALL ALAALSWASA LYFDWPLWCA PAVFCAALAG WLLCRVARRA LRAVRARAQL 
ARLDASERLP AADAPELRVA ARWRTALAAL GRAQPAPRLR GRPRDALPWY LVIGPEGAGK
TTALARARLV SPLRHARDDA AIAPTEDCDW WCFDDAVVLD LAGRFAAPGA TDDDRRAWHA
LLEQLGRARA RRGVNGVVVA IDAPRLIEAS RDALTIEGSA IRERLEQLIR LFDRRFPVYV
LVTQCDRLYG FDEWAAQLAP EQRERAFGYL GDHDADAFVA RALDSLDARL GALRVALAAR
GEPPSPRALM LPHELACLRP ALEAFARAAF GPNVYQETPY LRGLLFASGR QAGGAPSLTL
PDWLDAAPAR TPGDAGLFLH DVFARVLPGE RDAWLPVERP SRSRLVLRRL ALAAWLLASI
AAGLLMSASF VGDMRTVELV RRDYPAHPRF TGEFAHDTAT LERIGRVIAD VERRDERRLV
RPLAGATPVG RLEAELKRRY VAHYRRSIEP TADRLFFGEP DGASDVRAAD DARLAARIRN
LVRYVNLMQA RRRGADRETL AQMPAPAVAR ATDGGGARGV DSVGVSGVSG VSGVSGVSGV
SGVSGVSGGV GGVGGVGGAR AAEGAGRAVR ASGGRRAGDA RAARGMRGAA GNAGAGAEIA
PAADGVRDRD EGFARVSALA VDLIAWSAPD DRALAARVAA AQAELERLAY RDPDGAWLLA
LPDASMPRDV TLADFWPAAE PRTLPDAQPA ALRDVRVPAA LTAAHRPAID AFLDEMAQAV
ANRPKFAFHR DAFDTWYRAR RIDAWRDFVA RFPQGEQGLT TQAQWRAVID RIADRRDPFA
ALLARVDREF ESERDDTLPP WLRFVRTTSH MLAPAMLAPA RLPAASGGLG AALGSISRSG
GRALREALGG APEQGRLTLE RDAALRDALV DYERRVAALA ADALAGPGAA YRLAADFHGF
GVDPSVQASA MRAADSALRD VKRLAGDRDV GGDVVWSLVG GPLRALIAYV ERQASCALQD
DWERDVLWPL RRAATRDDVE GLLYGPQGAI WAFVDGPAKP FVRVGAARAS ALDTLGYRLP
FTDAFLPLID DAAARRVAQA RRDAERRAQQ QAALELDERI ASLGKQIDAL RAQTVRFGIV
AQPTDVNPDA QAKPFETVLT LQCAPQARTL TNYNLRVSEQ IDWQPDRCGD ATLRISLGGV
TLVRRYAGSL GVARFVQDFR YGVRRFTPRD FPDAKAQLQR LGVRHIDVRY DFSGHDALLA
HVGRIDALER ARRDDLARQR RAANRQDDDA GGATAIARAA ASVANAPGAA LPRRIGVCWG
DPAHDRPDGD GAQP