Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A3212 |
Symbol | |
ID | 3748359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | - |
Start bp | 41836 |
End bp | 44862 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637761475 |
Product | Type III restriction enzyme, res subunit |
Protein accession | YP_367458 |
Protein GI | 78064689 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.505736 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCTGC ATTTCGAAGC GGATCTCGAC TACCAGCGCG AGGCGATCGA CGCCGTCTGC GACCTGTTTC GCGGCCAGGA ATCGTATCGC GGCGACTTCA GCGTGCTCGC GAACGCCGCG CCCGGCACCA CGGCCGCCGC GCAAGGCTCG CTCGGCTTCG CGGTGTCGGA ACAGGGCGTG GGCAACCGGC TGTCGCTGAC CGACGACGCG CTCGCGCGCA ATCTCGCCGA CGTGCAGCTG CGCGGCGGGC TGCCGCCGTC CGGGCTGCCC GGCTCGCGCG ACTTCACCGT CGAGATGGAA ACCGGCACCG GCAAGACCTA CGTGTACCTG CGCACGATCT TCGAGCTGAA CCGCCGCTAC GGCTTCACGA AGTTCGTGAT CGTCGTGCCG TCGATCGCGA TCAAGGAAGG CGTGCACAAG ACGCTCGCGA TCACCGAGGA TCACTTCCGC GCGCTGTACG CGGGCGTGCC GTACGACTAC TTCCTGTACG ACTCAGCGAA GCTCGGCCAG GTGCGCCACT TCGCGGCGAG CGCGGCGATC CAGATCATGG TGATGACGGT CGCGGCGATC AACAAGAAGG AAATCAACAA CCTCTACAAG GACAGCGAGA AGACCGGCGG CGAGAAGCCG ATCGACCTGA TCCGCGCGAC GCGTCCGATC GTGATCGTCG ACGAGCCGCA GAGCGTCGAC GGCGGGCTCG AAGGCCGCGG CCGCGAAGCG CTCGCCGCGA TGGCGCCGCT CTGCACGCTG CGCTATTCGG CCACGCACGT CGACCGTCAC CACATGGTGT ACCGGCTCGA CGCGGTCGAC GCGTACGAGC GCAAGCTGGT CAAGCAGATC GAGATCGCAT CGGCGATCGT CGAGGATGCG CACAACAAGC CTTACGTGCG GCTCGTCGGC GTGTCGAACC GGCGCGGCGC GATCAGCGCG CGCGTCGAGC TCGACGTCGC GACGACGGCC GGCGTGAAGC GCCAGATCGT GACCGCCACC GACGGCGACG ATCTCGAACG CCTGACCAAG CGCGCGCTGT ACGCGGGGCT GCGGATCGGC GAAGTCCATG CGGTGAAGGG CGCCGAATAC GTCGAGCTGC GCCATCCGGA AGGCGAGGCG TTCCTGTCGC TCGGCGAAGC GTTCGGCGAC ATCGACACGC TCGCCGTGCA GCGCGAGATG ATCCGCCGCA CGATCCGCGA GCATCTGGAC AAGGAGCTGC GCCTGGCCGA ACGCGGCGTG AAGGTGCTGT CGTTGTTCTT CGTCGATTCG GTCGAGCGCT ATCGCCAGTA CGACGAGAAC GGCATGCCGG TGAAGGGCGA CTACGCGTTG ATCTTCGAAG AGGAATATGC GCGTGCGGCA CGTGTGCCGG CCTACCGTGC GCTGTTCGAC GGTGTCGACG TCGCGCTCGA AGTCGGGCGC GCGCACAACG GCTACTTCTC AATCGACCGC AAGGGCGGCT GGACCGACAC GAGCGAAAGC AGCGCCGCGG CGCGCGAAAA CGCGGAGCGT GCGTACGGCC TGATCATGCG CGAGAAGGAA GCGCTGCTGT CGTTCGACAC GCCGCTGAAG TTCATCTTCT CGCACTCGGC GCTGAAGGAA GGCTGGGACA ACCCGAACGT GTTCCAGATC TGCACGCTGC GCGACATCCA GACCGAACGC GAGCGCCGCC AGACGCTCGG CCGCGGGCTG CGCCTCGCCG TCGACCAGGA CGGCGAACGC GTGCGCGATG CGGGCGTGAA CACGCTCACC GTGATCGCGA CCGAGCGCTA CGAGTCGTTC GCGGAGAACC TGCAGAAGGA AATCGAGGCC GATACGGGCA TCCGGTTCGG CATCGTCGAG GAGCACCAGT TCGCGGCGCT GCCCGTGCAG GAAGGCGACG GGCCCGCGCA TGCGCTCGGC GTCGAGCTGT CGCGCGTGCT GTGGAATCAC CTGCACGAGC AAGGCTACGT CGACGCGCAG GGCAAGGTGC TCGACCGGTT GAAGGATGCG CTGCGCCAGA GCGCGCTGGT GCTGCCGGAG GCGTTCGAGA TGCTGCGCGC GCCGATCGTC GCGACGCTGC GCAAGCTGTC GGGCCGCTTT GCGGTTCGCA ATGCGGACGA GCGCCGTGCG ATCGCGCTGC GACGCGACGC ATCGGGCAAG GCCGTCGTGT TCGGCGAGGA GTTCCGCGCG CTGTGGGACC GCATCCGCCA TCGCACCGTG TACCGCGTCG AGTTCGACAA CGCGAAGCTC GTGCGAGACT GCGCGGCTGC GCTGCACGAC GCGCCGGACA TCGCACGCGC ACGGCTGCAG TGGCGCAAGG CCGAGATCGA CATCGGCAAG GCCGGCATCG AGGCGATCGA GGTGGCCGGC GCCGGCACGG TGCTGCTCGA CGAAGGCGAG CTGCCGCTGC CCGACCTGCT CACCGAGCTG CAGGACCGCA CGCAGCTCAC GCGCCGTTCG CTCGCGACGA TCCTGGCCGA CAGCGGCCGG CTCGAGGATT TCCGCGTGAA TCCGCAGCAG TTCATCACGG TTGCGGCCGA TGCGATCAAC CGCTGCAAGC GGCTCGCGCT CGTCGCGGGC ATCGCGTACC GCAAGCTCGG CGAGCGCCAT GTGCATGCGC TCGAATCGTT CGAGAGCGAG GCGCTGACCG GCTATCTGCG CAACCTGCGG CCCGATGCGC AGAAGTCGAT CCACGAGGCG GTCGTGTGCG AGACGGACGC GGAACGCGCG TTCGCCGATG CGCTCGAGGC GCACGACGGC GTGAAGCTGT ACGCGAAGCT GCCCGCATGG TTCCGCGTGC AGACGCCGCT CGGCAGCTAT CACCCGGACT GGGCCGTGCT CGCGGAACAG GACGGCGGCG AGCGGCTGTA CTTCGTCGTC GATACGCCGA ATGCGGACGG CAACGTGCCG AGCGAGCACG AGCGTGCGAA GCTCGCGTGC GGCGAAGCGC ATTTCCGCGC GCTGGTGGAC GGCGACGGCG CGGCGCGCTT CGTGCGCGTC AGGCAGGCCG ACGCGTTGTT TGAGCCTGCG GTGCCGCTGG TGGGCACCGG GCGATAA
|
Protein sequence | MRLHFEADLD YQREAIDAVC DLFRGQESYR GDFSVLANAA PGTTAAAQGS LGFAVSEQGV GNRLSLTDDA LARNLADVQL RGGLPPSGLP GSRDFTVEME TGTGKTYVYL RTIFELNRRY GFTKFVIVVP SIAIKEGVHK TLAITEDHFR ALYAGVPYDY FLYDSAKLGQ VRHFAASAAI QIMVMTVAAI NKKEINNLYK DSEKTGGEKP IDLIRATRPI VIVDEPQSVD GGLEGRGREA LAAMAPLCTL RYSATHVDRH HMVYRLDAVD AYERKLVKQI EIASAIVEDA HNKPYVRLVG VSNRRGAISA RVELDVATTA GVKRQIVTAT DGDDLERLTK RALYAGLRIG EVHAVKGAEY VELRHPEGEA FLSLGEAFGD IDTLAVQREM IRRTIREHLD KELRLAERGV KVLSLFFVDS VERYRQYDEN GMPVKGDYAL IFEEEYARAA RVPAYRALFD GVDVALEVGR AHNGYFSIDR KGGWTDTSES SAAARENAER AYGLIMREKE ALLSFDTPLK FIFSHSALKE GWDNPNVFQI CTLRDIQTER ERRQTLGRGL RLAVDQDGER VRDAGVNTLT VIATERYESF AENLQKEIEA DTGIRFGIVE EHQFAALPVQ EGDGPAHALG VELSRVLWNH LHEQGYVDAQ GKVLDRLKDA LRQSALVLPE AFEMLRAPIV ATLRKLSGRF AVRNADERRA IALRRDASGK AVVFGEEFRA LWDRIRHRTV YRVEFDNAKL VRDCAAALHD APDIARARLQ WRKAEIDIGK AGIEAIEVAG AGTVLLDEGE LPLPDLLTEL QDRTQLTRRS LATILADSGR LEDFRVNPQQ FITVAADAIN RCKRLALVAG IAYRKLGERH VHALESFESE ALTGYLRNLR PDAQKSIHEA VVCETDAERA FADALEAHDG VKLYAKLPAW FRVQTPLGSY HPDWAVLAEQ DGGERLYFVV DTPNADGNVP SEHERAKLAC GEAHFRALVD GDGAARFVRV RQADALFEPA VPLVGTGR
|
| |