Gene BURPS1106A_0987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0987 
Symbol 
ID4902522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp968418 
End bp969521 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content62% 
IMG OID640134217 
Producttransposase, IS4 
Protein accessionYP_001065268 
Protein GI126452273 
COG category[L] Replication, recombination and repair 
COG ID[COG3666] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGGAA TGGACGAGAT GCAAGAACCG CTGTTCACGA CGGTGAAGCT GGAAGACTTC 
GTGCCGGCCG ATCACCCGTT GCGGCCGCTT CGGCTGCTGG TCAATCAAGC GCTGAAGCGG
CTCAATGGGC TGTTTGGCAC GATCTACGCG GATAGCGGCC GTGCGTCGAT TGCGCCGGAG
AAACTGCTGC GTGCATTGCT GCTGCAGGTG CTGTACTCGG TGCGCAGCGA GCGCATGTTG
ATGGAACAGA TGCGCTACAA CCTGCTGTTC CGTTGGTTCG TCGGACTGGC GATCGAAGAC
GCCGTCTGGG ACCACTCGGT GTTCTCCAAG AACCGTGATC GTCTGCTCGA GCACGAAGTG
GTCGAAGCGT TCTTTACCGA AGTCATGAGC TTGGCCGACA AGCAAGGGCT GCTGTCCAGA
GAACACTTCT CGGTCGATGG CACGCTGATC CAGGCTTGGG CCAGCCACAA GAGCTTCCGG
CCCAAGGACG GCTCGGACGA TCCACCAGCC GGCGGTGGTC GCAATGTCGA CACCGACTGG
AAGGGCAAGC GGCGCAGCAA CGATACGCAT GAATCGAGCA CCGATCCGGA CGCGCGCCTG
TTCCGCAAGG GGCGGCAAAG CGGAGCCATC CTTTGTTATC AGGGGCACAT CCTGATGGAG
AACCGCTCGG GCTTGGTGGT CGGCGCGGTG GTCAGCCACG CTGACGGTTT CGGCGAACGC
GCAAGTGCGT TGCGCTTGCT CGATTGCGTG CCGGGCCGTC ACGCCAAGAC GCTCGGCGCC
GACAAGGGTT ACGACATGCG CGACTTTGTG CGGGACTGTC GTGCGCGCAA GGTGACGCCG
CATGTCGCGC GCAACGACGC GCATCAAGGC GGCAGCGCGA TCGATGGCCG CACCTCGCGG
CACGCCGGTT ATGGCATCAG CCAGGTGATT CGCAAGCGCA TCGAAGAGCA CTTCGGCTGG
GGCAAGACCG TCGGTCGGAT TCGACAGACC GCGTATCGCG GCATCAAGCG AGTCGACCAG
CACTTCAAAC TGACGATGCT GGCGAGCAAC CTGACTCGAA TGGCTCGAAT ACTGGCAGCG
GTGCCGCAAG GAGCGGCACG ATGA
 
Protein sequence
MRGMDEMQEP LFTTVKLEDF VPADHPLRPL RLLVNQALKR LNGLFGTIYA DSGRASIAPE 
KLLRALLLQV LYSVRSERML MEQMRYNLLF RWFVGLAIED AVWDHSVFSK NRDRLLEHEV
VEAFFTEVMS LADKQGLLSR EHFSVDGTLI QAWASHKSFR PKDGSDDPPA GGGRNVDTDW
KGKRRSNDTH ESSTDPDARL FRKGRQSGAI LCYQGHILME NRSGLVVGAV VSHADGFGER
ASALRLLDCV PGRHAKTLGA DKGYDMRDFV RDCRARKVTP HVARNDAHQG GSAIDGRTSR
HAGYGISQVI RKRIEEHFGW GKTVGRIRQT AYRGIKRVDQ HFKLTMLASN LTRMARILAA
VPQGAAR