Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMAA1778 |
Symbol | |
ID | 3087029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei ATCC 23344 |
Kingdom | Bacteria |
Replicon accession | NC_006349 |
Strand | - |
Start bp | 1945055 |
End bp | 1948477 |
Gene Length | 3423 bp |
Protein Length | 1140 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637565655 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_106337 |
Protein GI | 53716076 |
COG category | [E] Amino acid transport and metabolism [S] Function unknown |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.527948 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCATTC ACGTCGCGCT TCACCACGTC ACCCGCTACC GCTACGACCG GCTCGTCCGG CTCGGCCCGC AGATCGTGCG CCTGCGCCCC GCGCCGCATT GCCGCACGCC TGTCCTGTCG TACTCGATGA AGGTCGAGCC CGCGCAGCAC TTCGTCAACT GGCAGCAGGA TCCGTTCGCG AACTACATGG CGCGGCTCGT GTTTCCCGAG CGCACGCGCA CGCTCGAGAT CGCGATCGAC CTCGTCGCCG AGATGTCGGT CTACAACCCA TTCGATTTTT TCCTGGAAGA GCACGCGCAG ACCTTTCCGT TCGACTATGG CGACGCACTG CGCCGCGAGC TCGCGCCGTA TCTCGCGTGC GATTCCGCCA CGGGCGCCTG CGACGCGTTT CGTTCGTATA TCGAATCGAT CGATCGCGCG CCGGCGGGCA CCGTCGACTT TCTCGTCGCG CTGAACCGGC GGTTGCAGCA CGACATCCGC TACGTCGTGC GGCTCGAGCC GGGCGTGCAG ACGCCCGAGC AGACGCTTGC GCTCGCATCG GGCTCGTGCC GCGACAGCGG CTGGTTGCTC GTGCAGCTGT GCCGTCACCT CGGCCTCGCC GCACGCTTCG TGTCCGGCTA TCTGATCCAG TTGACGCCCG ACGTGAAATC GCTCGACGGC CCGAGCGGCG CCGAGGCCGA CTTCACCGAT CTGCACGCGT GGTGCGAAGT GTATCTGCCG GGCGCGGGCT GGATCGGTTT CGATCCGACA TCGGGGCTGC TCGCGGGCGA AGGGCATATT CCGCTTGCCG CGACGCCGCA GCCGACGAGC GCCGCGCCCG TCGAGGGGCT CGTCGACGAA TGCGAGGTCG AGTTCGAGCA CGAGATGCGC GTGACGCGAA TCTACGAATC GCCGCGCGTG ACGAAGCCGT ATACGGACGA AGCGTGGCAG CGCGTGCTGC GGCTCGGCGC GCAGGTCGAC GCCGCTTTGA ACGCGGGCGA CGTGCGCCTC ACGCAGGGCG GCGAGCCGAC GTTCGTGTCG ATCGACGATT GCGACGGCGC GGAATGGAAC ACCGATGCGC TGGGGCCGAC GAAGCGCGGC CATGCGACGT CGCTCGTGCA GAAGCTGCGC GCGGAGTACG GCGTGGGCGG CTTCCTGCAC TTCGGTCAGG GCAAGTGGTA TCCGGGCGAG CAACTGCCGC GCTGGGCACT GTCGATATTC TGGCGCGCGG ACGGCCAGCC CGTCTGGCGC GATCCGGCGC GCTTCGCCGA CGAGCGCGAA CCGTCCGCGT ACACGAGCGC CGACGCCGAG CGCTTCATTC GCGCGCTCGC GGCGCGTCTC GGGCTCGCGG GCGATTACGT GACGCCGGGC TACGAAGACG TCTGGTATTA CCTGTGGCGC GAGCGGCGGC TACCCGTGAA CGTCGATCCG TTCGACGCGC GGCTCGACGA CGAGCTCGAG CGTGCGCGTC TGCGCAGGGT GTTCTCGCAG CAGTTGGACA GCGTGATCGG CTACGTGCTG CCGCTCAAGC CGCTCGCGCC GAATCCGGCG CTCGCGGGGC CGCGCTGGGA GACGGGCCCG TGGTTCTTCC GCGATGAGCG GATGTATCTC GTGCCGGGCG ATTCGCCGAT GGGCTATCGC CTGCCGCTCG ATTCGCTGCC GTGGGCGAGC CGCGGCGACT ATCCATATCT CGTCGAGCGT GATCCGTTCG CGCCGCGCGA CGCGTTGCCC GATGCGGATG CGCTGCGCGC GCGCCATGTC GGCGGCGGCT TCGGCGCACC TCGCGATCCG GGCGCGGCGG CGCGCGACGC CGACGCGCCC GCGCGGCCCG TGATGCAGGC GCGCGCCGGC GAACGCGCGT TCGCGCGGCC GGCGGGCGAG GCCGAGGCCG CGCGCTTCCC GCAGCGCTTC GAGTCCGCCG GGTGGATCAC GCGCACCGCG CTGTGCGTCG AGGCACGCGG CGGCGTGCTC TACGTGTTCA TGCCGCCGAT CGCCGCGCTC GAAGACTATC TCGACCTGTT GGCCGCGATC GAGCTGACCG CCGAATCGCT CGACGCGAAG CTCGTGCTCG AAGGCTACCC GCCGCCGCGC GACGCGCGGC TGAAGATGCT GCAGGTGACA CCCGATCCCG GTGTGATCGA AGTGAACATC CATCCCGCGC ACGATTTCGA CGAGCTCGTT CAGCACACCG AATTCCTGTA CGACGCCGCG TATCGATCGC GGCTGTCGAG CGAGAAGTTC ATGGTCGACG GCCGGCATGT CGGCACGGGC GGCGGCAATC ACTTCGTGCT CGGCGGCGCG ACGCCCGCCG ACAGCCCGTT CCTGCGCCGT CCCGATCTGC TCGCGAGCCT GATCGCGTAT TGGCACAATC ATCCGTCGCT GTCCTACCTG TTCTCGGGGC TCTTCATCGG CCCGACGAGC CAGGCGCCGC GCGTCGACGA GGCGCGCGAC GATCAGGTCT ACGAACTCGA TATCGCGTTC GCCGAGCTTC GCCGCAACAC GCTGCGCGCG GGCGAGGACA TGCCGCCGTG GCTCGTCGAT CGCGTGCTGC GCAATCTGCT GATCGACGTG ACGGGAAACA CGCATCGCAG CGAATTCTGC ATCGACAAGC TCTATTCGCC GGATTCGGCG ACGGGGCGCC TCGGCCTGCT CGAGCTGCGC GCGTTCGAAA TGCCGCCGCA CGCGCGAATG AGCATCGTCC AGCAATTGCT GTTGCGCGCG CTGATCGCGC GCTTCTGGCG CGTGCCGTAC ACGGCGCCGC TCGCGCGCTG GGGCACCGCG CTGCACGATC GCTTCCTGCT GCCGAGCTTC GTCAGGATGG ATTTCGACGA CGTGCTGACC GAGCTGCGCG AAGCGGGCTT CGGGTTCGAC GCGGCGTGGT TCGCGCCGCA CTTCGAATTC CGTTTTCCGC TGTTCGGGCA GATCGCCGCG CGCGGCGTCG CGCTCGCGCT GCGCGGCGCG CTCGAGCCGT GGCACGTGAT GGGCGAGGAG GGGGCGATCG GCGGCACCGT GCGCTACGTC GATTCGTCGC TCGAGCGGCT CGAAGTCCGG GTGAGCGGGC TGAACGACAG CCGCTACGTC GTGACCGTCA ACGGCCGCGC GCTGCCGCTG CAGCCGACGG GCACCGCCGG CGAGTACGTC GCGGGCGTCC GCTACAAGGC ATGGGCGCCG CCTTCGGCGC TGCATCCGAC GATCGGCGTA CACGCGCCGC TCACGTTCGA CATCGTCGAT ACATGGATGC GGCGCTCGCT CGGCGGCTGC CGGTATCACG TCGCGCATCC GGGCGGGCGC CACTACGACA CGTTCCCCGT GAACGCTTAC GAAGCCGAGA GCCGGCGGCT CGCGCGCTTC GTGTCGATGG GGCACACGCC GGGCGCGATG ACGGTCGAGC CGGCCGCGCC GGGCCGCGAA TTTCCGTTCA CGCTGGATTT GCGGCATGGG TGA
|
Protein sequence | MSIHVALHHV TRYRYDRLVR LGPQIVRLRP APHCRTPVLS YSMKVEPAQH FVNWQQDPFA NYMARLVFPE RTRTLEIAID LVAEMSVYNP FDFFLEEHAQ TFPFDYGDAL RRELAPYLAC DSATGACDAF RSYIESIDRA PAGTVDFLVA LNRRLQHDIR YVVRLEPGVQ TPEQTLALAS GSCRDSGWLL VQLCRHLGLA ARFVSGYLIQ LTPDVKSLDG PSGAEADFTD LHAWCEVYLP GAGWIGFDPT SGLLAGEGHI PLAATPQPTS AAPVEGLVDE CEVEFEHEMR VTRIYESPRV TKPYTDEAWQ RVLRLGAQVD AALNAGDVRL TQGGEPTFVS IDDCDGAEWN TDALGPTKRG HATSLVQKLR AEYGVGGFLH FGQGKWYPGE QLPRWALSIF WRADGQPVWR DPARFADERE PSAYTSADAE RFIRALAARL GLAGDYVTPG YEDVWYYLWR ERRLPVNVDP FDARLDDELE RARLRRVFSQ QLDSVIGYVL PLKPLAPNPA LAGPRWETGP WFFRDERMYL VPGDSPMGYR LPLDSLPWAS RGDYPYLVER DPFAPRDALP DADALRARHV GGGFGAPRDP GAAARDADAP ARPVMQARAG ERAFARPAGE AEAARFPQRF ESAGWITRTA LCVEARGGVL YVFMPPIAAL EDYLDLLAAI ELTAESLDAK LVLEGYPPPR DARLKMLQVT PDPGVIEVNI HPAHDFDELV QHTEFLYDAA YRSRLSSEKF MVDGRHVGTG GGNHFVLGGA TPADSPFLRR PDLLASLIAY WHNHPSLSYL FSGLFIGPTS QAPRVDEARD DQVYELDIAF AELRRNTLRA GEDMPPWLVD RVLRNLLIDV TGNTHRSEFC IDKLYSPDSA TGRLGLLELR AFEMPPHARM SIVQQLLLRA LIARFWRVPY TAPLARWGTA LHDRFLLPSF VRMDFDDVLT ELREAGFGFD AAWFAPHFEF RFPLFGQIAA RGVALALRGA LEPWHVMGEE GAIGGTVRYV DSSLERLEVR VSGLNDSRYV VTVNGRALPL QPTGTAGEYV AGVRYKAWAP PSALHPTIGV HAPLTFDIVD TWMRRSLGGC RYHVAHPGGR HYDTFPVNAY EAESRRLARF VSMGHTPGAM TVEPAAPGRE FPFTLDLRHG
|
| |