Gene BURPS1106A_A3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A3044 
Symbol 
ID4903723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2960076 
End bp2961236 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content62% 
IMG OID640146147 
ProductISBma1, transposase 
Protein accessionYP_001077073 
Protein GI126456410 
COG category[L] Replication, recombination and repair 
COG ID[COG3464] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.932011 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGTGGC CGCAGGGCGA TAGCCGCACG CTGTCGCTCT ACCTGAAGCC GGTCAGTCAG 
ATCATGTACT GCGAGCAATG CGGTGCGCGT TGCCAGCAGA TTCATGAAAC GACCGTACGG
CGGGTACGTG ATCTGCCGTT GTTCGAGTAC CGGGTGGTGC TGCACGTGCC TCGACGCCGA
GTCTGGTGCG AACGCTGCGG CGCAGCGCGG CTGGAGAAGC TGGACTGGCT GGGCCGCTAC
CAGCGGGTGA CGGAGCGGTT TGCCAAGGCC TGCGAGAAGT TGCTGCAGGC CGCCAGCGTA
CAGGCCGTGG CGGCCTTCTA CGAACTGGGC TGGCACACGG TCAAATCGAT CGACAAGATG
CGCTTGCGCG CGCGCGTGGC CGAACCGGAC TGGTCGACGA TCCGTTATCT GGCGATGGAC
GAGTTCGCGC TCCATAAAGG CCATCGCTAC GCCACGGTGG TGGTTGATCC GATCGGCCGA
CAGGTCCTCT GGGTTGGGCC CGGACGGTCA CGCGAGACGG CGCGCGCCTT CTTCGAACAA
CTCCCCGAAG GCGTGGCCGA GCGCATCGAA GCGGTCGCAA TCGACATGAC CACGGCCTAT
GAGCTGGAGA TCAAGGAACA GTGCCCGCAG GCGGAAATCG TCTTTGACCT GTACCACGTC
GTGGCCAAGT ACGGTCGCGA GGTGATCGAT CGGGTACGGG TGGATCAGGC CAACCAACTG
CGACATGACA AGCCGGCCCG CAAGGTTCTG AAGTCCAGTC GCTGGTTGCT GCTGCGCAAC
CGTCATAACC TGAAGCCAGA ACAGGCCGTG CATCTGAAGG AACTGCTGGC GGCCAATCAG
TCGCTGTTAT GCGTCTATGT GCTGCGCGAC GAGCTCAAAC GGCTCTGGTT CTACCGCAAG
CCGGCCTGGG CGGAAAAGGC TTGGGGGCAA TGGTTCGAAC AGGCTCAGCA AAGCGGGATC
GCCGCCTTGC AAAAGTTCGC CCAGCGCTTG CAGGGTTACT GGCACGGAAT CGTGGCCCGC
TGCCGCCATC CGCTCAATAC CAGCGTCGTC GAAGGCATCA ACAACACGAT CAAGGTCATC
AAGCGCCGAG CTTACGGGTA CCGCGACGAG CAATACTTCT TCCTCAAGAT CCGCGCCGCG
TTCCCCGGGA TTCAGCGATG A
 
Protein sequence
MEWPQGDSRT LSLYLKPVSQ IMYCEQCGAR CQQIHETTVR RVRDLPLFEY RVVLHVPRRR 
VWCERCGAAR LEKLDWLGRY QRVTERFAKA CEKLLQAASV QAVAAFYELG WHTVKSIDKM
RLRARVAEPD WSTIRYLAMD EFALHKGHRY ATVVVDPIGR QVLWVGPGRS RETARAFFEQ
LPEGVAERIE AVAIDMTTAY ELEIKEQCPQ AEIVFDLYHV VAKYGREVID RVRVDQANQL
RHDKPARKVL KSSRWLLLRN RHNLKPEQAV HLKELLAANQ SLLCVYVLRD ELKRLWFYRK
PAWAEKAWGQ WFEQAQQSGI AALQKFAQRL QGYWHGIVAR CRHPLNTSVV EGINNTIKVI
KRRAYGYRDE QYFFLKIRAA FPGIQR