Gene BURPS668_0046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0046 
Symbol 
ID4884731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp42061 
End bp44076 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content68% 
IMG OID640125974 
Producttype III DNA modification methyltransferase 
Protein accessionYP_001057101 
Protein GI126440391 
COG category[L] Replication, recombination and repair 
COG ID[COG2189] Adenine specific DNA methylase Mod 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCAAA AACTCGATGC GGCGAGCCCG GAGGCGCAAT CCGCGGATCT CGTGGCCGCC 
AACGTCGAGC GCCTGAAGGC GCTCTTTCCG GACGTGGTGA CCGAAGGGCC GGACGGCGCG
TCGGTGAATC TCGACGCGCT CGCGGCGCTG GTGGGCGCGA GCGCGGCGGC CGCGGCCGAC
GCCGACGAGA AGTACGGCCT GAACTGGCAC GGCAAGCGGC GCGCGCGCCG GCTCGCGCTC
ACGCCGTCGA CGGGCACGCT GCGCCCGTGC CCGCGCGAGA GCGCCGGCTG GGCGTCGACG
CGCAACCTGA TGATCGAGGG CGAGAACCTC GAGGTGCTGA AGCTGCTGCA GAAGAGCTAC
GCGGGGCGCG TGAAGCTCGT CTACATCGAT CCGCCGTACA ACACCGGCAA GGATTTCGTC
TATCCGGACA ATTTCACCGA CAGCCTGCGC CATTATCTCG AGCTGACCGG CCAGACGACG
GGCGGCAAGC GGGTCACCAG CCACACCGAC GCGAGCGGGC GCTTCCACAC CGACTGGCTG
AACATGATCT ACCCGCGCCT GAAGCTCGCG CGCGATCTGC TCACCGAGGA CGGCGTGATC
GCCGTGCACA TCGACGAGCA CGAACAGCAC GCGCTCGTGC TCGTGATGCG CGAGATCTTC
GGCGAAGACA ACGAGCTCGG CGTCGCGGTG TGGGACAAGC GCAATCCGAA GGGCGATGCG
CGCGGGATCG CGTACCAGCA CGAATCGATC GTGCTGTTCG CGCGCGACGC TGAACGGCTG
TTCGAGCGTG CGCCGCTCAA GCGCCCGAAA CGCAACGCGC AGCGCATGCT GGACGCGGCG
CGCGAGGCGG TCGCCGGCGC GGCGACGATC GCGGACGCGA ACGCCGCGTA CCGCGGCTGG
GTGAAGTCTC AGACGACGCT GTCGGGCGGC GAGGCGATGT ACGACCGAAT CTCCGCCGAC
GGGCGCGTGT ACCGCCTCGT GTCGATGGCG TGGCCGAACA AGAAGAAGGC GCCCGACGAC
TACTTCGTGC CGCTCGTGCA TCCGGTGACG GGCAAGCCGT GCCCCGTGCC CGAGCGCGGC
TGGCGCAACC CGCCCGCGAC GATGCGCGCG CTCATCGACA AGGGCCTCGT CGAATTCGGC
GCGGACGAGA CCACGCAGCC GCAGCGGATC TATTTCCTCG ACGAGAACAT GTACGAGAAC
GTGCCTTCGG TGCTGCCGTT CGGCGGCTCG GACGACGCGC TGATGAAGTC GCTCGGCATT
CCTTTCGATC AGCCCAAGCC CGTCGAATTC GCCGCGTCGA TCATCGGCTG GTGCACCGAC
GGCGACGATC TGATCGTCGA CTTCTTCGGC GGCTCCGGCA CGACCGCGCA CGCGGTGATG
GCGCTGAACG CGGCCGACGG CGGCCATCGC CGCTACGTGC TCGTGCAACT GCCCGAGCCG
CTCGACGCCG ACAGCAAGGA CCAGAAGGCC GCCGCCGATT TCTGCGCGGC GCAGCGCGTG
CCGCTCAATC TCGCCGAGCT GACGAAGGAG CGGCTGCGGC GCGCGGCGGC GCGCATCGCG
GCCGAGCATC CGGGCACGCG GGCGGATCTC GGTTTTCGCG TGTTCAGGCT CGATTCGACG
AACGTCTCCG AATGGGACCC GCGCGGCGAC GACATCCAGC AGTCGCTGTT CGCGGCCGTC
GAGCACATCA AGCCGAACCG CTCCGAGGAA GATCTGCTGT ACGAACTGAT GCTCAAGCTC
GGCCTCGATC TGTGCGCGCC GATCGACGCA CGCATGATCG CCGGCAAGGC GGTCTACGTG
ATCGACGGTG CGATCGTCGC GTGCTTCGAT GCGCATATCG ACCGCGCGTC GACCGACGCG
CTCGGCGAGG GCATCGTCGG GCTGATCGCC GAAGCGGCCG ACGCGCGCGA GGTGACCTGC
GTGTTCCGCG ACAGCGGCTT CGCGGACGAC GTCGCGAAGG TGAACCTGTC GGCGATTCTC
GAGCAGCACG GCGTGAAGCG CATCCGCAGC CTCTGA
 
Protein sequence
MMQKLDAASP EAQSADLVAA NVERLKALFP DVVTEGPDGA SVNLDALAAL VGASAAAAAD 
ADEKYGLNWH GKRRARRLAL TPSTGTLRPC PRESAGWAST RNLMIEGENL EVLKLLQKSY
AGRVKLVYID PPYNTGKDFV YPDNFTDSLR HYLELTGQTT GGKRVTSHTD ASGRFHTDWL
NMIYPRLKLA RDLLTEDGVI AVHIDEHEQH ALVLVMREIF GEDNELGVAV WDKRNPKGDA
RGIAYQHESI VLFARDAERL FERAPLKRPK RNAQRMLDAA REAVAGAATI ADANAAYRGW
VKSQTTLSGG EAMYDRISAD GRVYRLVSMA WPNKKKAPDD YFVPLVHPVT GKPCPVPERG
WRNPPATMRA LIDKGLVEFG ADETTQPQRI YFLDENMYEN VPSVLPFGGS DDALMKSLGI
PFDQPKPVEF AASIIGWCTD GDDLIVDFFG GSGTTAHAVM ALNAADGGHR RYVLVQLPEP
LDADSKDQKA AADFCAAQRV PLNLAELTKE RLRRAAARIA AEHPGTRADL GFRVFRLDST
NVSEWDPRGD DIQQSLFAAV EHIKPNRSEE DLLYELMLKL GLDLCAPIDA RMIAGKAVYV
IDGAIVACFD AHIDRASTDA LGEGIVGLIA EAADAREVTC VFRDSGFADD VAKVNLSAIL
EQHGVKRIRS L