Gene Bcep18194_A3212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A3212 
Symbol 
ID3748359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp41836 
End bp44862 
Gene Length3027 bp 
Protein Length1008 aa 
Translation table11 
GC content68% 
IMG OID637761475 
ProductType III restriction enzyme, res subunit 
Protein accessionYP_367458 
Protein GI78064689 
COG category[V] Defense mechanisms 
COG ID[COG3587] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.505736 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCTGC ATTTCGAAGC GGATCTCGAC TACCAGCGCG AGGCGATCGA CGCCGTCTGC 
GACCTGTTTC GCGGCCAGGA ATCGTATCGC GGCGACTTCA GCGTGCTCGC GAACGCCGCG
CCCGGCACCA CGGCCGCCGC GCAAGGCTCG CTCGGCTTCG CGGTGTCGGA ACAGGGCGTG
GGCAACCGGC TGTCGCTGAC CGACGACGCG CTCGCGCGCA ATCTCGCCGA CGTGCAGCTG
CGCGGCGGGC TGCCGCCGTC CGGGCTGCCC GGCTCGCGCG ACTTCACCGT CGAGATGGAA
ACCGGCACCG GCAAGACCTA CGTGTACCTG CGCACGATCT TCGAGCTGAA CCGCCGCTAC
GGCTTCACGA AGTTCGTGAT CGTCGTGCCG TCGATCGCGA TCAAGGAAGG CGTGCACAAG
ACGCTCGCGA TCACCGAGGA TCACTTCCGC GCGCTGTACG CGGGCGTGCC GTACGACTAC
TTCCTGTACG ACTCAGCGAA GCTCGGCCAG GTGCGCCACT TCGCGGCGAG CGCGGCGATC
CAGATCATGG TGATGACGGT CGCGGCGATC AACAAGAAGG AAATCAACAA CCTCTACAAG
GACAGCGAGA AGACCGGCGG CGAGAAGCCG ATCGACCTGA TCCGCGCGAC GCGTCCGATC
GTGATCGTCG ACGAGCCGCA GAGCGTCGAC GGCGGGCTCG AAGGCCGCGG CCGCGAAGCG
CTCGCCGCGA TGGCGCCGCT CTGCACGCTG CGCTATTCGG CCACGCACGT CGACCGTCAC
CACATGGTGT ACCGGCTCGA CGCGGTCGAC GCGTACGAGC GCAAGCTGGT CAAGCAGATC
GAGATCGCAT CGGCGATCGT CGAGGATGCG CACAACAAGC CTTACGTGCG GCTCGTCGGC
GTGTCGAACC GGCGCGGCGC GATCAGCGCG CGCGTCGAGC TCGACGTCGC GACGACGGCC
GGCGTGAAGC GCCAGATCGT GACCGCCACC GACGGCGACG ATCTCGAACG CCTGACCAAG
CGCGCGCTGT ACGCGGGGCT GCGGATCGGC GAAGTCCATG CGGTGAAGGG CGCCGAATAC
GTCGAGCTGC GCCATCCGGA AGGCGAGGCG TTCCTGTCGC TCGGCGAAGC GTTCGGCGAC
ATCGACACGC TCGCCGTGCA GCGCGAGATG ATCCGCCGCA CGATCCGCGA GCATCTGGAC
AAGGAGCTGC GCCTGGCCGA ACGCGGCGTG AAGGTGCTGT CGTTGTTCTT CGTCGATTCG
GTCGAGCGCT ATCGCCAGTA CGACGAGAAC GGCATGCCGG TGAAGGGCGA CTACGCGTTG
ATCTTCGAAG AGGAATATGC GCGTGCGGCA CGTGTGCCGG CCTACCGTGC GCTGTTCGAC
GGTGTCGACG TCGCGCTCGA AGTCGGGCGC GCGCACAACG GCTACTTCTC AATCGACCGC
AAGGGCGGCT GGACCGACAC GAGCGAAAGC AGCGCCGCGG CGCGCGAAAA CGCGGAGCGT
GCGTACGGCC TGATCATGCG CGAGAAGGAA GCGCTGCTGT CGTTCGACAC GCCGCTGAAG
TTCATCTTCT CGCACTCGGC GCTGAAGGAA GGCTGGGACA ACCCGAACGT GTTCCAGATC
TGCACGCTGC GCGACATCCA GACCGAACGC GAGCGCCGCC AGACGCTCGG CCGCGGGCTG
CGCCTCGCCG TCGACCAGGA CGGCGAACGC GTGCGCGATG CGGGCGTGAA CACGCTCACC
GTGATCGCGA CCGAGCGCTA CGAGTCGTTC GCGGAGAACC TGCAGAAGGA AATCGAGGCC
GATACGGGCA TCCGGTTCGG CATCGTCGAG GAGCACCAGT TCGCGGCGCT GCCCGTGCAG
GAAGGCGACG GGCCCGCGCA TGCGCTCGGC GTCGAGCTGT CGCGCGTGCT GTGGAATCAC
CTGCACGAGC AAGGCTACGT CGACGCGCAG GGCAAGGTGC TCGACCGGTT GAAGGATGCG
CTGCGCCAGA GCGCGCTGGT GCTGCCGGAG GCGTTCGAGA TGCTGCGCGC GCCGATCGTC
GCGACGCTGC GCAAGCTGTC GGGCCGCTTT GCGGTTCGCA ATGCGGACGA GCGCCGTGCG
ATCGCGCTGC GACGCGACGC ATCGGGCAAG GCCGTCGTGT TCGGCGAGGA GTTCCGCGCG
CTGTGGGACC GCATCCGCCA TCGCACCGTG TACCGCGTCG AGTTCGACAA CGCGAAGCTC
GTGCGAGACT GCGCGGCTGC GCTGCACGAC GCGCCGGACA TCGCACGCGC ACGGCTGCAG
TGGCGCAAGG CCGAGATCGA CATCGGCAAG GCCGGCATCG AGGCGATCGA GGTGGCCGGC
GCCGGCACGG TGCTGCTCGA CGAAGGCGAG CTGCCGCTGC CCGACCTGCT CACCGAGCTG
CAGGACCGCA CGCAGCTCAC GCGCCGTTCG CTCGCGACGA TCCTGGCCGA CAGCGGCCGG
CTCGAGGATT TCCGCGTGAA TCCGCAGCAG TTCATCACGG TTGCGGCCGA TGCGATCAAC
CGCTGCAAGC GGCTCGCGCT CGTCGCGGGC ATCGCGTACC GCAAGCTCGG CGAGCGCCAT
GTGCATGCGC TCGAATCGTT CGAGAGCGAG GCGCTGACCG GCTATCTGCG CAACCTGCGG
CCCGATGCGC AGAAGTCGAT CCACGAGGCG GTCGTGTGCG AGACGGACGC GGAACGCGCG
TTCGCCGATG CGCTCGAGGC GCACGACGGC GTGAAGCTGT ACGCGAAGCT GCCCGCATGG
TTCCGCGTGC AGACGCCGCT CGGCAGCTAT CACCCGGACT GGGCCGTGCT CGCGGAACAG
GACGGCGGCG AGCGGCTGTA CTTCGTCGTC GATACGCCGA ATGCGGACGG CAACGTGCCG
AGCGAGCACG AGCGTGCGAA GCTCGCGTGC GGCGAAGCGC ATTTCCGCGC GCTGGTGGAC
GGCGACGGCG CGGCGCGCTT CGTGCGCGTC AGGCAGGCCG ACGCGTTGTT TGAGCCTGCG
GTGCCGCTGG TGGGCACCGG GCGATAA
 
Protein sequence
MRLHFEADLD YQREAIDAVC DLFRGQESYR GDFSVLANAA PGTTAAAQGS LGFAVSEQGV 
GNRLSLTDDA LARNLADVQL RGGLPPSGLP GSRDFTVEME TGTGKTYVYL RTIFELNRRY
GFTKFVIVVP SIAIKEGVHK TLAITEDHFR ALYAGVPYDY FLYDSAKLGQ VRHFAASAAI
QIMVMTVAAI NKKEINNLYK DSEKTGGEKP IDLIRATRPI VIVDEPQSVD GGLEGRGREA
LAAMAPLCTL RYSATHVDRH HMVYRLDAVD AYERKLVKQI EIASAIVEDA HNKPYVRLVG
VSNRRGAISA RVELDVATTA GVKRQIVTAT DGDDLERLTK RALYAGLRIG EVHAVKGAEY
VELRHPEGEA FLSLGEAFGD IDTLAVQREM IRRTIREHLD KELRLAERGV KVLSLFFVDS
VERYRQYDEN GMPVKGDYAL IFEEEYARAA RVPAYRALFD GVDVALEVGR AHNGYFSIDR
KGGWTDTSES SAAARENAER AYGLIMREKE ALLSFDTPLK FIFSHSALKE GWDNPNVFQI
CTLRDIQTER ERRQTLGRGL RLAVDQDGER VRDAGVNTLT VIATERYESF AENLQKEIEA
DTGIRFGIVE EHQFAALPVQ EGDGPAHALG VELSRVLWNH LHEQGYVDAQ GKVLDRLKDA
LRQSALVLPE AFEMLRAPIV ATLRKLSGRF AVRNADERRA IALRRDASGK AVVFGEEFRA
LWDRIRHRTV YRVEFDNAKL VRDCAAALHD APDIARARLQ WRKAEIDIGK AGIEAIEVAG
AGTVLLDEGE LPLPDLLTEL QDRTQLTRRS LATILADSGR LEDFRVNPQQ FITVAADAIN
RCKRLALVAG IAYRKLGERH VHALESFESE ALTGYLRNLR PDAQKSIHEA VVCETDAERA
FADALEAHDG VKLYAKLPAW FRVQTPLGSY HPDWAVLAEQ DGGERLYFVV DTPNADGNVP
SEHERAKLAC GEAHFRALVD GDGAARFVRV RQADALFEPA VPLVGTGR