Gene BURPS1106A_0046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0046 
Symbol 
ID4899644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp42115 
End bp44148 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content68% 
IMG OID640133276 
Producttype III DNA modification methyltransferase 
Protein accessionYP_001064331 
Protein GI126452068 
COG category[L] Replication, recombination and repair 
COG ID[COG2189] Adenine specific DNA methylase Mod 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.312444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTGA TGGGAAGGAT GATGCAAAAA CTCGATGCGG CGAGCCCGGA GGCGCAATCC 
GCGGATCTCG TGGCCGCCAA CGTCGAGCGC CTGAAGGCGC TCTTTCCGGA CGTGGTGACC
GAAGGGCCGG ACGGCGCGTC GGTGAATCTC GACGCGCTCG CGGCGCTGGT GGGCGCGAGC
GCGGCGGCCG CGGCCGACGC CGACGAGAAG TACGGCCTGA ACTGGCACGG CAAGCGGCGC
GCGCGCCGGC TCGCGCTCAC GCCGTCGACG GGCACGCTGC GCCCGTGCCC GCGCGAGAGC
GCCGGCTGGG CGTCGACGCG CAACCTGATG ATCGAGGGCG AGAACCTCGA GGTGCTGAAG
CTGCTGCAGA AGAGCTACGC GGGGCGCGTG AAGCTCGTCT ACATCGATCC GCCGTACAAC
ACCGGCAAGG ATTTCGTCTA TCCGGACAAT TTCACCGACA GCCTGCGCCA TTATCTCGAG
CTGACCGGCC AGACGACGGG CGGCAAGCGG GTCACCAGCC ACACCGACGC GAGCGGGCGC
TTCCACACCG ACTGGCTGAA CATGATCTAC CCGCGCCTGA AGCTCGCGCG CGATCTGCTC
ACCGAGGACG GCGTGATCGC CGTGCACATC GACGAGCACG AACAGCACGC GCTCGTGCTC
GTGATGCGCG AGATCTTCGG CGAAGACAAC GAGCTCGGCG TCGCGGTGTG GGACAAGCGC
AATCCGAAGG GCGATGCGCG CGGGATCGCG TACCAGCACG AATCGATCGT GCTGTTCGCG
CGCGACGCTG AACGGCTGTT CGAGCGTGCG CCGCTCAAGC GCCCGAAACG CAACGCGCAG
CGCATGCTGG GCGCGGCGCG CGAGGCGGTC GCCGGCGCGG CGACGATCGC GGACGCGAAC
GCCGCGTACC GCGGCTGGGT GAAGTCTCAG ACGACGCTGT CGGGCGGCGA GGCGATGTAC
GACCGAATCT CCGCCGACGG GCGCGTGTAC CGCCTCGTGT CGATGGCGTG GCCGAACAAG
AAGAAGGCGC CCGACGACTA CTTCGTGCCG CTCGTGCATC CGGTGACGGG CAAGCCGTGC
CCCGTGCCCG AGCGCGGCTG GCGCAACCCG CCCGCGACGA TGCGCGCGCT CATCGACAAG
GGCCTCGTCG AATTCGGCGC GGACGAGACC ACGCAGCCGC AGCGGATCTA TTTCCTCGAC
GAGAACATGT ACGAGAACGT GCCTTCGGTG CTGCCGTTCG GCGGCTCGGA CGACGCGCTG
ATGAAGTCGC TCGGCATTCC TTTCGATCAG CCCAAGCCCG TCGAATTCGC CGCGTCGATC
ATCGGCTGGT GCACCGACGG CGACGATCTG ATCGTCGACT TCTTCGGCGG CTCCGGCACG
ACCGCGCACG CGGTGATGGC GCTGAACGCG GCCGACGGCG GCCATCGCCG CTACGTGCTC
GTGCAACTGC CCGAGCCGCT CGACGCCGAC AGCAAGGACC AGAAGGCCGC CGCCGATTTC
TGCGCGGCGC AGCGCGTGCC GCTCAATCTC GCCGAGCTGA CGAAGGAGCG GCTGCGGCGC
GCGGCGGCGC GCATCGCGGC CGAGCATCCG GGCACGCGGG CGGATCTCGG TTTTCGCGTG
TTCAGGCTCG ATTCGACGAA CGTCTCCGAA TGGGACCCGC GCGGCGACGA CATCCAGCAG
TCGCTGTTCG CGGCCGTCGA GCACATCAAG CCGAACCGCT CCGAGGAAGA TCTGCTGTAC
GAACTGATGC TCAAGCTCGG CCTCGATCTG TGCGCGCCGA TCGACGCACG CATGATCGCC
GGCAAGGCGG TCTACGTGAT CGACGGCGCG ATCGTCGCGT GCTTCGATGC GCATATCGAC
CGCGCGTCGA CCGACGCGCT CGGCGAGGGC ATCGTCGGGC TGATCGCCGA AGCGGCCGAC
GCGCGCGAGG TGACCTGCGT GTTCCGCGAC AGCGGCTTCG CGGACGACGT CGCGAAGGTG
AACCTGTCGG CGATTCTCGA GCAGCACGGC GTGAAGCGCA TCCGCAGCCT CTGA
 
Protein sequence
MQLMGRMMQK LDAASPEAQS ADLVAANVER LKALFPDVVT EGPDGASVNL DALAALVGAS 
AAAAADADEK YGLNWHGKRR ARRLALTPST GTLRPCPRES AGWASTRNLM IEGENLEVLK
LLQKSYAGRV KLVYIDPPYN TGKDFVYPDN FTDSLRHYLE LTGQTTGGKR VTSHTDASGR
FHTDWLNMIY PRLKLARDLL TEDGVIAVHI DEHEQHALVL VMREIFGEDN ELGVAVWDKR
NPKGDARGIA YQHESIVLFA RDAERLFERA PLKRPKRNAQ RMLGAAREAV AGAATIADAN
AAYRGWVKSQ TTLSGGEAMY DRISADGRVY RLVSMAWPNK KKAPDDYFVP LVHPVTGKPC
PVPERGWRNP PATMRALIDK GLVEFGADET TQPQRIYFLD ENMYENVPSV LPFGGSDDAL
MKSLGIPFDQ PKPVEFAASI IGWCTDGDDL IVDFFGGSGT TAHAVMALNA ADGGHRRYVL
VQLPEPLDAD SKDQKAAADF CAAQRVPLNL AELTKERLRR AAARIAAEHP GTRADLGFRV
FRLDSTNVSE WDPRGDDIQQ SLFAAVEHIK PNRSEEDLLY ELMLKLGLDL CAPIDARMIA
GKAVYVIDGA IVACFDAHID RASTDALGEG IVGLIAEAAD AREVTCVFRD SGFADDVAKV
NLSAILEQHG VKRIRSL