Gene BURPS1710b_2106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2106 
Symbol 
ID3689317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp2300842 
End bp2304021 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content62% 
IMG OID637728562 
Producttransposase 
Protein accessionYP_333501 
Protein GI76811057 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.818957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGAGC TCCGTTCGGA TGGGGGGATC GGAAGAATAC GGGGGATGAA CAAGGTAAGC 
ACAAATTCCT ATCATAAGCC TTGCAGTTAT GAAGATTACG CTGATCAATT ACGAATTGAA
GTAGTAACAT CAGACTTTTC TTGCGCTAAT CTGAGCGGCC CTCTGCGATC CGGACGATCC
GCAATGGCTA CCCTCGAACG CACCGCGTAC CCGCGCTTCC CCGAGGTTCT GGCACCCCGG
GAACTGCAAG CCTGCTACAC GCCGCTGTCC GACGAACTGG AATGGGCGCG CCGGTCGACG
CGGGGCGAGC GACCTCGATT GGGCCTGCTG GTGCTGCTGA AGGTCTTTCA GCAACTGCAC
TATTTCCCGC CGATCGACAC GATTTCGGTC GCCGTCATCG ATCATGTCCG GGCGGCCTCG
AACATTGCCG ACACCGTGCG ATTCGGCTAC GACACAGCGT CGTCCCCGAC ACTCTTCCGA
CATTACGCCA CGGTACGCGC CTGGTTGAAC GTGAAGCCGT ATTACGGCAC CGACGCAAAC
GCGATCGCGA CACGCGCTGC GCACACGGCG TCGCTGACGA TGGATCAGCC TGTCGACATC
ATCAACGCGA CGATCGACGA GCTGATCGCG CGCGATGTCG AGCTGCCGGC GTTCTCAACG
CTGGACCGGC TGACCGAGCA GATCCACGCC CGAGCGCAGT CCCGACTCTT CAAGCGCGTC
ACGCGACGCC TGACCGATGC GCAGAAACTG GCGCTCGACC GGCTGCTCGC ACGCGACCTG
TCGAGCCGGC AGACTGCCTT CAACCGAATC AAGCGTCATG CGAAACGCCC ATCGCGCCAG
CATCTGGACA TGCTGATCGA CCAGATTGCC TGGCTCGACG AGATCGGCGA TTTTTCACTG
GCCGTCGCAG GCATTCCCGC CTCAAAGCTG CGCTCGCTCG CGAATCAGGC GATGGCGCTC
GATGCGTCGG CGCTCAGGAA CGACACGCAA CCCGAGAAGC GCTACACATT GATCGTCGCG
CTGCTCAACC GCATGCGCGT GCGCGCCCGC GACGATCTCG CCGACATGTT CGTACGGCGA
ATGGGCGCCA TTCACAAGCG CGCGAGCGAC GAGCTGGACG TGATCCAGCG CAAACAGCGT
GATCAGGTCG AGGATCTCGT GGTCTTGTTG GACGGCGTAG CCGACATCCT CGCCGACGAA
TCCGACGAGA CGGCCATCGC CAGGGCGGTC AGGAAGCTGC TCGCGCCGAA GGGCGATCTG
GAGCCGTTGC GCGAGAGCTG CGCGGCGATC CGCGCGTTCA GCGGCCGCAA CTATCTGCCG
TTGCTCTGGA AGCACTTCAA GGCACACCGC TCGGTCATGA TGCGTGTTGC CCGTACGCTC
GAGTGGGATT CGACCAGCCC GGTTCGCTCG CTGCTCGATG CGCTCGACGT CGTGCTGGAA
AACGAGTCGC GGTACCGCGA GTGGATCGAT GCCGACGTCG ATCTGAGCTT CGCCGCACCG
CGGTGGCGTA AGCTCGCGCG GCGCTCGCAC GGCGATGGAT CGCCGACCAA TCGACGGTAT
CTCGAACTCT GCGTGTTCTC GTACATGGCT GAAGAGCTAC GGGTCGGCGA TCTGTGCGTA
TCCGGCTCTG ACGCCTACGC CGACTATCGC GATCACTTGC TGCCGTGGCG GCAATGCGAG
CAGCAGTTGC CGAACTATTG CGACAAGCTC GGCATGCCCA CGACCGGCAG TGACTTCGTT
CAGCATCTCC GTCAATGGCT GACGGACGCC ACCCGCAAGC TCGACGACGA ATTGCCACGC
AAGAGAGAGC ACGTTGTGAT CGATCGTCAG GGCGAACCGA TCCTGCGCAA AACCATTGCG
AAGGAGATCC CCGCAAGCGC GATCGCATTG CAGGAGCGGC TGACTGCCCG ATTGCCGACA
CGGAACATGC TGGACATCCT CGCGAACATT GAACACTGGA CGCATTTCAC GCGGCACTTC
GGGCCGCTGT CGGGCAGCGA TCCCCAGATT CGCAAAGCCG CCGAGCGCTA CCTGCTGACC
ATTTTCGCGA TGGGCTGCAA TCTCGGCTCG ACCCAGGCGG CCCGCCATCT CGACTCCGAG
GTCACCGCGC ACATGCTCTC GTTCGTCAAC CGTCGGCATG TGAGCCTGGA CAAGCTCGAG
ACCGCGCAGC GCGCGCTGAT CGAACTGTAC CTGCGGTTGG ACCTACCCAA ACACTGGGGC
GATGGCAAGA CGGTAGCAGC CGACGGCACG CAGTACGACT TCTACGACAA CAACCTGCTG
GCCGGCTATC ACTTCCGCTA TCGGAAAATG GGCGCCGTTG CATACCGGCA CGTCGCCGAC
AACTATATCG CGGTGTTCCA GCACTTCATT CCGCCGGGCG TCTGGGAAGC GATCTATGTG
ATCGAGGGGC TGCTCAAAGC GGGCCTGTCG GTCAAAGCCG ATACCGTCCA CGCTGACACG
CAGGGTCAGT CGGCTGCGGT GTTTGCGTTC ACGTACCTGC TCGGCATTAA CTTGATGCCA
CGGATTCGCA ACTGGAAGGA CCTCGTCCTG TACCGGCCGG ACAGCAAGGC CAAATACAAG
CACATCGACA AGCTGTTCAC GGCGACCATC GACTGGGGGT TGATCGAGCA GCACTGGCAG
GAGTTGATGC AGGTCGCGCT GTCGATTCAG GCAGGCACGA TCTCGTCGCC GTTGCTGCTG
CGCCGGCTCG GCTCAGAAAG CCGCAAGAAC CGGCTGTACC TGGCCGCGCG CGAGCTCGGC
AACGTCGTTC GCACGGTGTT CCTGCTCGAG TGGATCGGTA GCCTCGAATT GCGGCAGGAT
GTCACCTCGA ACACGAACAA GATCGAGTCG TACAATGGCT TCTCCAAATG GCTGTCGTTC
GGTGGCGATG TGATCGCAGA GAACGAGCCG GAGGAGCAAC AGAAACGGCT ACGCTACAAC
GACCTGGTCG CGTCGGCTGT GATTTTGCAG AACACGGTGG ACATGATGCA GGCGCTGCGC
GAGATGGCGG CAAACGGCGA GAAAGTGCGC GCCGAAGACA TCGAGTTTCT GAGCCCGTAC
CCGACCCACA ACATTCGGCG CTTCGGCCAC TATAAACTGC ACCTGAATCG TCGGCCGGAA
GCCTGGATCA AGGACCCGCT GTTCGGCCAT GCTGCCCGTA CCTACGCATC CCGCCCATGA
 
Protein sequence
MLELRSDGGI GRIRGMNKVS TNSYHKPCSY EDYADQLRIE VVTSDFSCAN LSGPLRSGRS 
AMATLERTAY PRFPEVLAPR ELQACYTPLS DELEWARRST RGERPRLGLL VLLKVFQQLH
YFPPIDTISV AVIDHVRAAS NIADTVRFGY DTASSPTLFR HYATVRAWLN VKPYYGTDAN
AIATRAAHTA SLTMDQPVDI INATIDELIA RDVELPAFST LDRLTEQIHA RAQSRLFKRV
TRRLTDAQKL ALDRLLARDL SSRQTAFNRI KRHAKRPSRQ HLDMLIDQIA WLDEIGDFSL
AVAGIPASKL RSLANQAMAL DASALRNDTQ PEKRYTLIVA LLNRMRVRAR DDLADMFVRR
MGAIHKRASD ELDVIQRKQR DQVEDLVVLL DGVADILADE SDETAIARAV RKLLAPKGDL
EPLRESCAAI RAFSGRNYLP LLWKHFKAHR SVMMRVARTL EWDSTSPVRS LLDALDVVLE
NESRYREWID ADVDLSFAAP RWRKLARRSH GDGSPTNRRY LELCVFSYMA EELRVGDLCV
SGSDAYADYR DHLLPWRQCE QQLPNYCDKL GMPTTGSDFV QHLRQWLTDA TRKLDDELPR
KREHVVIDRQ GEPILRKTIA KEIPASAIAL QERLTARLPT RNMLDILANI EHWTHFTRHF
GPLSGSDPQI RKAAERYLLT IFAMGCNLGS TQAARHLDSE VTAHMLSFVN RRHVSLDKLE
TAQRALIELY LRLDLPKHWG DGKTVAADGT QYDFYDNNLL AGYHFRYRKM GAVAYRHVAD
NYIAVFQHFI PPGVWEAIYV IEGLLKAGLS VKADTVHADT QGQSAAVFAF TYLLGINLMP
RIRNWKDLVL YRPDSKAKYK HIDKLFTATI DWGLIEQHWQ ELMQVALSIQ AGTISSPLLL
RRLGSESRKN RLYLAARELG NVVRTVFLLE WIGSLELRQD VTSNTNKIES YNGFSKWLSF
GGDVIAENEP EEQQKRLRYN DLVASAVILQ NTVDMMQALR EMAANGEKVR AEDIEFLSPY
PTHNIRRFGH YKLHLNRRPE AWIKDPLFGH AARTYASRP