Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BamMC406_0028 |
Symbol | |
ID | 6176112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia ambifaria MC40-6 |
Kingdom | Bacteria |
Replicon accession | NC_010551 |
Strand | - |
Start bp | 35356 |
End bp | 38370 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641679763 |
Product | type III restriction protein res subunit |
Protein accession | YP_001806746 |
Protein GI | 172059094 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCTGC ATTTCGAAGC GGATCTCGAC TACCAGCGCG AGGCGATCGA CGCCGTCTGC GACCTGTTTC GCGGCCAGGA ATCGTATCGC GGCGACTTCA GCGTGCTCGC GAACGCCGCG CCCGGCACGA CGGCCGCCGC GCAAGGTTCG CTCGGCTTCG CGGTGTCGGA GCAAGGCGTC GGCAACCGGC TGTCGCTGAC CGACGACGCG CTCGCGCGCA ATCTCGCCGA CGTGCAGCTG CGCGGCGGGC TGCCGGCGTC CGGCCTGCCC GGCTCGCGCG ACTTCACCGT CGAGATGGAG ACCGGCACCG GCAAGACCTA CGTGTACCTG CGCACGATCT TCGAGCTGAA CCGCCGCTAC GGCTTCACGA AGTTCGTGAT CGTCGTGCCG TCGATCGCGA TCAAGGAAGG CGTGCACAAG ACGCTGTCGA TCACCGAGGA CCATTTCCGC GCGCTGTATG CGGGCGTGCC GTACGACTAC TTCCTGTACG ACTCCGCGAA GCTCGGCCAG GTGCGCCACT TCGCGGCGAG CGCGGCGATC CAGATCATGG TGATGACGGT TGCCGCGATC AACAAGAAGG AGATCAACAA CCTCTACAAG GACAGCGAGA AGACGGGCGG CGAGAAGCCG ATCGACCTGA TCCGCGCGAC GCGGCCGATC GTGATCGTCG ACGAGCCGCA GAGCGTCGAC GGCGGGCTCG AGGGGCGCGG GCGCGAAGCG CTCGCCGCGA TGGCGCCGCT CTGCACGCTG CGCTATTCGG CGACGCACGT CGACCGTCAT CACATGGTGT ACCGGCTCGA CGCGGTCGAT GCGTACGAGC GCAAGCTCGT CAAGCAGATC GAGATCGCGT CGGCGATAGT CGAGGATGCG CACAACAAGC CGTACGTGCG GCTCGTCGGC GTGTCGAACC GGCGCGGCGC GATCAGCGCG CGCATCGAGC TGGACGTCGC GACCGCGGCC GGCGTGAAGC GCCAGATCGT GTCGGCGACC GACGGCGACG ATCTCGAGCG CCTGACGAAG CGCGCACTGT ATGCGGGGCT GCGCGTCGGC GAAGTCCATG CGGTGAAGGG CGCCGAGTAC GTCGAGCTGC GCCATCCGGA AGGCGAGGCG TTCCTGTCGC TCGGCGAGGC GTTCGGCGAC ATCGACACGC TCGCCGTGCA GCGCGAGATG ATCCGCCGCA CGATCCGCGA GCATCTCGAC AAGGAATTGC GTCTGGCCGA GCGCGGCGTG AAGGTGCTGT CGCTGTTCTT CGTCGATTCG GTCGAACGCT ATCGCCGCTA CGACGAGAAC GGGATGCCCG TGAAGGGCGA CTACGCGCTG ATCTTCGAAG AGGAATATGC GCGCGCGGCC CGCGTGCCGG CTTACCGTGC GTTGTTCGAC GGCGTCGACG TCGCGCTCGA GGTCGAGCGC GCGCACAACG GCTACTTCTC GATCGACCGC AAGGGTGGCT GGACCGACAC GAGCGAGAGC AGCGCCGCCG CACGCGAGAA CGCGGAGCGC GCGTACGGCC TCATCATGCG CGAGAAGGAA GCGCTGCTGT CGTTCGACAC GCCGCTGAAG TTCATCTTCT CGCACTCGGC GCTGAAGGAA GGCTGGGACA ACCCGAACGT GTTCCAGATC TGCACGCTGC GCGACATCCA GACCGAGCGC GAGCGTCGCC AGACGCTCGG CCGCGGGCTG CGCCTCGCCG TCGACCAGGA CGGCGAGCGC GTGCGCGACC CGGGCGTCAA CACGCTGACC GTGATCGCGA CCGAGCGCTA CGAGTCGTTT GCCGAGAACC TGCAAAAGGA AATCGAGGCC GATACGGGCA TCCGCTTCGG GATCGTCGAG GAACACCAGT TCGCGGCGCT GCCGGTGCAG GAAGGCGACG GGCCCGCGCA TGCGCTCGGC ATCGAGCTGT CGCGCGTGCT GTGGGCGCAT CTGCGCGAAC AGGGCTACGT CGACGCGCAG GGCAAAGTGC TCGACCGGCT GAAGGACGCG CTGCGCCAGA GCGCACTCGT GCTGCCCGAA GCGTTCGAGA CGCTGCGCGC GCCGATCGTC GCGATGCTGC GCAAGCTGTC GGGCCGCTTC GCCGTGCGCA ATGCGGACGA GCGGCGCGCG ATCGCGTTGC GGCGCGACGC GTCGGGCAAG GCTGTCGTGT TCGGCGACGA CTTCCGCGCG CTGTGGGACC GTATCCGCCA CCGCACCGTG TACAGCGTCG AGTTCGACAA CGCGAAGCTC GTGCGCGACT GCACGGCCGC GCTGCGCGAC GCGCCGGACA TCGCGCGTGC CCGGCTGCAG TGGCGCAAGG CCGAGATCGA CATCGGCAAG GCCGGCATCG AGGCGGTCGA GGTGGCCGGC GCGGGCACGG TGCTGATCGA CGAAGGCGAG CTGCCGCTGC CCGACCTGCT CACCGAGCTG CAGGACCGCA CGCAGCTCAC GCGCCGCTCG CTCGCGACGA TCCTCGCCGA CAGCGGCCGG CTCGACGATT TCCGCGTGAA TCCGCAGCAG TTCATCGCGC TGGCCGCCGA TGCGATCAAC CGCTGCAAGC GGCTCGCGCT GGTCGCGGGC ATCGCGTACC GCAGGCTCGG CGAACGCAAC GTGCATGCGC TCGAATCGTT CGAGAGCGAG GCGCTGACCG GCTATCTGCG CAACCTGCGT CCCGATGCGC GCAAGTCGAT CCACGAGGCC GTCGTGTGCG AGACCGACGC GGAGCGTGCG TTCGCCGATG CGCTGGAAGC GCACGACGGC GTGAAGCTGT ACGCGAAGCT GCCCGCGTGG TTCCGCGTGC CGACGCCGCT CGGCAGCTAT CACCCCGACT GGGCGGTGCT CGCGGAGCAG GATGGCGGCG AGCGGCTGTA TTTCGTCGTC GATACGCCGA ATGCCGATGG CGACGTGACG ACCGAGCACG AGCGCGCGAA GCTCGCGTGC GGTGAAGCGC ATTTCCGTGC GCTGGTGGAC GGGGAAGATG CCGCGCGATT CGTGCGCGTC AGGCAGGCGG AGGCGTTGTT CGAGCCTGCT GTGTTGCGGC TTTGA
|
Protein sequence | MRLHFEADLD YQREAIDAVC DLFRGQESYR GDFSVLANAA PGTTAAAQGS LGFAVSEQGV GNRLSLTDDA LARNLADVQL RGGLPASGLP GSRDFTVEME TGTGKTYVYL RTIFELNRRY GFTKFVIVVP SIAIKEGVHK TLSITEDHFR ALYAGVPYDY FLYDSAKLGQ VRHFAASAAI QIMVMTVAAI NKKEINNLYK DSEKTGGEKP IDLIRATRPI VIVDEPQSVD GGLEGRGREA LAAMAPLCTL RYSATHVDRH HMVYRLDAVD AYERKLVKQI EIASAIVEDA HNKPYVRLVG VSNRRGAISA RIELDVATAA GVKRQIVSAT DGDDLERLTK RALYAGLRVG EVHAVKGAEY VELRHPEGEA FLSLGEAFGD IDTLAVQREM IRRTIREHLD KELRLAERGV KVLSLFFVDS VERYRRYDEN GMPVKGDYAL IFEEEYARAA RVPAYRALFD GVDVALEVER AHNGYFSIDR KGGWTDTSES SAAARENAER AYGLIMREKE ALLSFDTPLK FIFSHSALKE GWDNPNVFQI CTLRDIQTER ERRQTLGRGL RLAVDQDGER VRDPGVNTLT VIATERYESF AENLQKEIEA DTGIRFGIVE EHQFAALPVQ EGDGPAHALG IELSRVLWAH LREQGYVDAQ GKVLDRLKDA LRQSALVLPE AFETLRAPIV AMLRKLSGRF AVRNADERRA IALRRDASGK AVVFGDDFRA LWDRIRHRTV YSVEFDNAKL VRDCTAALRD APDIARARLQ WRKAEIDIGK AGIEAVEVAG AGTVLIDEGE LPLPDLLTEL QDRTQLTRRS LATILADSGR LDDFRVNPQQ FIALAADAIN RCKRLALVAG IAYRRLGERN VHALESFESE ALTGYLRNLR PDARKSIHEA VVCETDAERA FADALEAHDG VKLYAKLPAW FRVPTPLGSY HPDWAVLAEQ DGGERLYFVV DTPNADGDVT TEHERAKLAC GEAHFRALVD GEDAARFVRV RQAEALFEPA VLRL
|
| |