Gene BURPS668_3047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3047 
Symbol 
ID4882292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2991895 
End bp2993040 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content56% 
IMG OID640128975 
Productphage integrase 
Protein accessionYP_001060060 
Protein GI126441280 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.182322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACTCA TCCGTCGAAA CAACAGCAAG AACTGGTACT ACCAGTTCCA GATTCAGGGC 
AAGAAGTTCT TCGGTTCAAC GGGGACGCCG AACAAGACGA AGGCCGCTCA GGTCGAACGC
GAGATGCGCA ATCGAGCCCA CGCCGAGCAG TACCTCGGGG ATGCGCAGCC CATTACGCTC
AAAGATGCGC TGGAACAATA TGTCGAGGCA CGAAGCGACA CGTCCTATCA CAGGAACATG
TGCTCGATCG TTCGAAAGAC GCTCGGCTAT AAGCTTCATC CGCGGACGAA GGCCAAGCTC
CCCTGCTATG GCATGTTGCC TGATACGCTG CTGCATGAAC TGACGACGCG CGATTTCGAT
CTTCTCGCTG CGAAGCGCAA GGCCGAAGGC GACAAACCAG CGACGATCAA GCATGAGATC
GGTCTCCTTC GAGCGACCAT CAACGAGATG GCGAAGCTCG GATTCAAAGT CAGTCGTGAG
ATCGTTTTTC CAGAGCTTAG GACTTCGTAT AGGCTTCGAT ATCTGGATTC TAACGACGAG
TCGGCGTTAT TGCGCGAACT CGATCCGGAA CGGCTTAGAG CGCGTATAAA CGCGTCTAAA
CCGCAAACAC CTGAAATGAC GCGAAATATG CAGGATAACT ACGATATCAC GGTTTTCCTG
CTCGATACGG GGTGCCGTTA CTCGGAGGTC GCGAACATTC CGTGGTCGGC GATCAACCTC
GACACGTGCA CGATCAGCCT CTATCGGAGC AAGGTGCGAA ATGAAGATGT GCTGCACATG
ACCTCCCGAC TCGAAGCGAT CCTCCGTCGG CGGTGGGAAG AACGCAGGAC CGGACAGCGA
TACGTATTCG AAGACCGCAC CGGCAACGAG CGAGGCTACA GCACGAAGTC GATCAAGAAG
GCGATCGAGC GCGCTGGCCT CAACGATCCC GTTCTTGTAA AAGAACGCGG TGGGCGCGTC
ACGCTGCATA CGCTGCGACA TACGTTCGCG AGTAAGCTAG TTAAGGCGGG CGTCAGTCTC
TATGAGGTAT CGGTCCTGCT CGGCCACAGC GATCCGAAAA TGACGCAGCG CTATGCCCAC
CTGAGCCCGA ACGACGCGAG CCGGAAGGCC GTCAAGGTCA TCGATTCGCT ACTGCAACCC
TCCTGA
 
Protein sequence
MTLIRRNNSK NWYYQFQIQG KKFFGSTGTP NKTKAAQVER EMRNRAHAEQ YLGDAQPITL 
KDALEQYVEA RSDTSYHRNM CSIVRKTLGY KLHPRTKAKL PCYGMLPDTL LHELTTRDFD
LLAAKRKAEG DKPATIKHEI GLLRATINEM AKLGFKVSRE IVFPELRTSY RLRYLDSNDE
SALLRELDPE RLRARINASK PQTPEMTRNM QDNYDITVFL LDTGCRYSEV ANIPWSAINL
DTCTISLYRS KVRNEDVLHM TSRLEAILRR RWEERRTGQR YVFEDRTGNE RGYSTKSIKK
AIERAGLNDP VLVKERGGRV TLHTLRHTFA SKLVKAGVSL YEVSVLLGHS DPKMTQRYAH
LSPNDASRKA VKVIDSLLQP S