Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_2106 |
Symbol | |
ID | 3689317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | - |
Start bp | 2300842 |
End bp | 2304021 |
Gene Length | 3180 bp |
Protein Length | 1059 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637728562 |
Product | transposase |
Protein accession | YP_333501 |
Protein GI | 76811057 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.818957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGGAGC TCCGTTCGGA TGGGGGGATC GGAAGAATAC GGGGGATGAA CAAGGTAAGC ACAAATTCCT ATCATAAGCC TTGCAGTTAT GAAGATTACG CTGATCAATT ACGAATTGAA GTAGTAACAT CAGACTTTTC TTGCGCTAAT CTGAGCGGCC CTCTGCGATC CGGACGATCC GCAATGGCTA CCCTCGAACG CACCGCGTAC CCGCGCTTCC CCGAGGTTCT GGCACCCCGG GAACTGCAAG CCTGCTACAC GCCGCTGTCC GACGAACTGG AATGGGCGCG CCGGTCGACG CGGGGCGAGC GACCTCGATT GGGCCTGCTG GTGCTGCTGA AGGTCTTTCA GCAACTGCAC TATTTCCCGC CGATCGACAC GATTTCGGTC GCCGTCATCG ATCATGTCCG GGCGGCCTCG AACATTGCCG ACACCGTGCG ATTCGGCTAC GACACAGCGT CGTCCCCGAC ACTCTTCCGA CATTACGCCA CGGTACGCGC CTGGTTGAAC GTGAAGCCGT ATTACGGCAC CGACGCAAAC GCGATCGCGA CACGCGCTGC GCACACGGCG TCGCTGACGA TGGATCAGCC TGTCGACATC ATCAACGCGA CGATCGACGA GCTGATCGCG CGCGATGTCG AGCTGCCGGC GTTCTCAACG CTGGACCGGC TGACCGAGCA GATCCACGCC CGAGCGCAGT CCCGACTCTT CAAGCGCGTC ACGCGACGCC TGACCGATGC GCAGAAACTG GCGCTCGACC GGCTGCTCGC ACGCGACCTG TCGAGCCGGC AGACTGCCTT CAACCGAATC AAGCGTCATG CGAAACGCCC ATCGCGCCAG CATCTGGACA TGCTGATCGA CCAGATTGCC TGGCTCGACG AGATCGGCGA TTTTTCACTG GCCGTCGCAG GCATTCCCGC CTCAAAGCTG CGCTCGCTCG CGAATCAGGC GATGGCGCTC GATGCGTCGG CGCTCAGGAA CGACACGCAA CCCGAGAAGC GCTACACATT GATCGTCGCG CTGCTCAACC GCATGCGCGT GCGCGCCCGC GACGATCTCG CCGACATGTT CGTACGGCGA ATGGGCGCCA TTCACAAGCG CGCGAGCGAC GAGCTGGACG TGATCCAGCG CAAACAGCGT GATCAGGTCG AGGATCTCGT GGTCTTGTTG GACGGCGTAG CCGACATCCT CGCCGACGAA TCCGACGAGA CGGCCATCGC CAGGGCGGTC AGGAAGCTGC TCGCGCCGAA GGGCGATCTG GAGCCGTTGC GCGAGAGCTG CGCGGCGATC CGCGCGTTCA GCGGCCGCAA CTATCTGCCG TTGCTCTGGA AGCACTTCAA GGCACACCGC TCGGTCATGA TGCGTGTTGC CCGTACGCTC GAGTGGGATT CGACCAGCCC GGTTCGCTCG CTGCTCGATG CGCTCGACGT CGTGCTGGAA AACGAGTCGC GGTACCGCGA GTGGATCGAT GCCGACGTCG ATCTGAGCTT CGCCGCACCG CGGTGGCGTA AGCTCGCGCG GCGCTCGCAC GGCGATGGAT CGCCGACCAA TCGACGGTAT CTCGAACTCT GCGTGTTCTC GTACATGGCT GAAGAGCTAC GGGTCGGCGA TCTGTGCGTA TCCGGCTCTG ACGCCTACGC CGACTATCGC GATCACTTGC TGCCGTGGCG GCAATGCGAG CAGCAGTTGC CGAACTATTG CGACAAGCTC GGCATGCCCA CGACCGGCAG TGACTTCGTT CAGCATCTCC GTCAATGGCT GACGGACGCC ACCCGCAAGC TCGACGACGA ATTGCCACGC AAGAGAGAGC ACGTTGTGAT CGATCGTCAG GGCGAACCGA TCCTGCGCAA AACCATTGCG AAGGAGATCC CCGCAAGCGC GATCGCATTG CAGGAGCGGC TGACTGCCCG ATTGCCGACA CGGAACATGC TGGACATCCT CGCGAACATT GAACACTGGA CGCATTTCAC GCGGCACTTC GGGCCGCTGT CGGGCAGCGA TCCCCAGATT CGCAAAGCCG CCGAGCGCTA CCTGCTGACC ATTTTCGCGA TGGGCTGCAA TCTCGGCTCG ACCCAGGCGG CCCGCCATCT CGACTCCGAG GTCACCGCGC ACATGCTCTC GTTCGTCAAC CGTCGGCATG TGAGCCTGGA CAAGCTCGAG ACCGCGCAGC GCGCGCTGAT CGAACTGTAC CTGCGGTTGG ACCTACCCAA ACACTGGGGC GATGGCAAGA CGGTAGCAGC CGACGGCACG CAGTACGACT TCTACGACAA CAACCTGCTG GCCGGCTATC ACTTCCGCTA TCGGAAAATG GGCGCCGTTG CATACCGGCA CGTCGCCGAC AACTATATCG CGGTGTTCCA GCACTTCATT CCGCCGGGCG TCTGGGAAGC GATCTATGTG ATCGAGGGGC TGCTCAAAGC GGGCCTGTCG GTCAAAGCCG ATACCGTCCA CGCTGACACG CAGGGTCAGT CGGCTGCGGT GTTTGCGTTC ACGTACCTGC TCGGCATTAA CTTGATGCCA CGGATTCGCA ACTGGAAGGA CCTCGTCCTG TACCGGCCGG ACAGCAAGGC CAAATACAAG CACATCGACA AGCTGTTCAC GGCGACCATC GACTGGGGGT TGATCGAGCA GCACTGGCAG GAGTTGATGC AGGTCGCGCT GTCGATTCAG GCAGGCACGA TCTCGTCGCC GTTGCTGCTG CGCCGGCTCG GCTCAGAAAG CCGCAAGAAC CGGCTGTACC TGGCCGCGCG CGAGCTCGGC AACGTCGTTC GCACGGTGTT CCTGCTCGAG TGGATCGGTA GCCTCGAATT GCGGCAGGAT GTCACCTCGA ACACGAACAA GATCGAGTCG TACAATGGCT TCTCCAAATG GCTGTCGTTC GGTGGCGATG TGATCGCAGA GAACGAGCCG GAGGAGCAAC AGAAACGGCT ACGCTACAAC GACCTGGTCG CGTCGGCTGT GATTTTGCAG AACACGGTGG ACATGATGCA GGCGCTGCGC GAGATGGCGG CAAACGGCGA GAAAGTGCGC GCCGAAGACA TCGAGTTTCT GAGCCCGTAC CCGACCCACA ACATTCGGCG CTTCGGCCAC TATAAACTGC ACCTGAATCG TCGGCCGGAA GCCTGGATCA AGGACCCGCT GTTCGGCCAT GCTGCCCGTA CCTACGCATC CCGCCCATGA
|
Protein sequence | MLELRSDGGI GRIRGMNKVS TNSYHKPCSY EDYADQLRIE VVTSDFSCAN LSGPLRSGRS AMATLERTAY PRFPEVLAPR ELQACYTPLS DELEWARRST RGERPRLGLL VLLKVFQQLH YFPPIDTISV AVIDHVRAAS NIADTVRFGY DTASSPTLFR HYATVRAWLN VKPYYGTDAN AIATRAAHTA SLTMDQPVDI INATIDELIA RDVELPAFST LDRLTEQIHA RAQSRLFKRV TRRLTDAQKL ALDRLLARDL SSRQTAFNRI KRHAKRPSRQ HLDMLIDQIA WLDEIGDFSL AVAGIPASKL RSLANQAMAL DASALRNDTQ PEKRYTLIVA LLNRMRVRAR DDLADMFVRR MGAIHKRASD ELDVIQRKQR DQVEDLVVLL DGVADILADE SDETAIARAV RKLLAPKGDL EPLRESCAAI RAFSGRNYLP LLWKHFKAHR SVMMRVARTL EWDSTSPVRS LLDALDVVLE NESRYREWID ADVDLSFAAP RWRKLARRSH GDGSPTNRRY LELCVFSYMA EELRVGDLCV SGSDAYADYR DHLLPWRQCE QQLPNYCDKL GMPTTGSDFV QHLRQWLTDA TRKLDDELPR KREHVVIDRQ GEPILRKTIA KEIPASAIAL QERLTARLPT RNMLDILANI EHWTHFTRHF GPLSGSDPQI RKAAERYLLT IFAMGCNLGS TQAARHLDSE VTAHMLSFVN RRHVSLDKLE TAQRALIELY LRLDLPKHWG DGKTVAADGT QYDFYDNNLL AGYHFRYRKM GAVAYRHVAD NYIAVFQHFI PPGVWEAIYV IEGLLKAGLS VKADTVHADT QGQSAAVFAF TYLLGINLMP RIRNWKDLVL YRPDSKAKYK HIDKLFTATI DWGLIEQHWQ ELMQVALSIQ AGTISSPLLL RRLGSESRKN RLYLAARELG NVVRTVFLLE WIGSLELRQD VTSNTNKIES YNGFSKWLSF GGDVIAENEP EEQQKRLRYN DLVASAVILQ NTVDMMQALR EMAANGEKVR AEDIEFLSPY PTHNIRRFGH YKLHLNRRPE AWIKDPLFGH AARTYASRP
|
| |