Gene Bcep18194_A5455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A5455 
Symbol 
ID3750675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp2532190 
End bp2534304 
Gene Length2115 bp 
Protein Length704 aa 
Translation table11 
GC content70% 
IMG OID637763763 
Productpeptidase S9, prolyl oligopeptidase active site region 
Protein accessionYP_369693 
Protein GI78066924 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGATT CCTTCCGCTG GCCCGCTGGG GCCGACCCGT TCCGTTTCCT CGAATCGCTC 
GACAGCAAGC GCGCCCGCAC CTGGGTCGAC GAGCAGAATG CGCGCACGCG CGCCGCGCTG
CGTGACGACG ACGCGTATCG CGCGCTGACC GCGCGCCTCG CCAAAGCGTA TCTGCCGCGC
GAACGCCCGG TGATTCCGAC CCGCTGGCGC GACTGGGCCT ACGACCTGTG GCAGGACGAT
CTCCATCCGA AAGGGCTGTG GCGCCGCACG CGCTGGGACG ACTGGCGCGC GGGAAACCCG
GCGTGGGAAA CGCTGCTCGA CGTCGACGCG CTTGGTGCGG AGGAGGGCGA GTCGTGGGTG
TTCGAGCAGG ACTCGATCCT GTATCCGGAC GGTGATCGGG CGCTGCTGTC GCTGTCGCCG
GGCGGCGCCG ACGCGGTCGT CGTGCGTGAA TTCGATCTCG TCGAACGGCG TTTCGTCGAC
GGCGGCTTCA CGATCGACGA GCCGGGGCAT CACACGGTCG GCTGGATCGA TCGCGATACC
GTCTACGTGA GCTGGGATCG CGGCGAAGCG CATGCGACCG CGGCCGGCTA TCCGTATGAA
GCGCGGCGCT GGGTGCGCGG CACGGCGCTC GCCGATGCGC CCGTCGTGTT CAGCGGCGAA
CCCGACGACA TCAGTGCGGG TGCGGGGTTC GATCCGATCG ACAACCGTCA CGTCGCGTGG
CGCAGCGTCG ATTTCTTCGA CGCACATGCG TACCGGCTGA CCGACACGGG CGAGTGGGCG
CGCTACGACG TGCCGACCCA TGTCGAGGTC GGGTTCTGGG AGGGCTGGCT CATGCTGGAG
CCGCGCCTCG ACTGGGATTG TGACGGCGTG CGCCATGCGG GCGGTTCGCT GCTCGCGATC
CGCGAGCAGG CGTTTCTCGC CGGGTCGCGC GCACTCACGA CGCTGTTCGT GCCGCAACCG
ACGACATCCG CGTGCACGTG GACGCACACG CGCACGACGC TGATCGCGAG CTGGCTCGAC
GACGTGCACA ACCGCACGAT GCTGTGGCAG CCGAGCCAGG CCGACGACGG TACGTGGACG
TGGGATGCCC GGCCGTTCGA GTGGCCGGGG CCGGGCGACG CGCAGATCGA CGTCGAGCCG
GTCGAAGCGA CGCTGAACGA CGAGATTTAC GTGGACGTCG ACACCTATCT CGATCCGCCC
GAATGCTGGC TGGCCGATCT TGCCGATCGC GCGGACGACG CGCCGTCGCG TCGCGTGCTG
CTCGACCGGC CGCCGGTGCA GTTCGATGCG GCCGGGCTGG TCGTGCGTCG TGCGAGCGCG
CGCTCGCAAG ACGGCACGAT CGTGCCGTAC ACGCTGATCG GGCCACGCGA TGCACTCGAC
GTGGCCGACG GCGCCGCGCG CGTTGCGCGG CCGTGCCTGC TATCGGGCTA TGGCGGTTTT
GCGATTCCGA ACCTGCCGGG CTACAGCGAT GCGTTCGGTA TCGGGTGGCT CGAGCGCGGC
GGCGTGATGG CGTTCGCGCA CATCCGCGGC GGCGGCGAGT TCGGGCCGCG CTGGCACGTC
GATGCGCAGC GCGAACACCG CCAGCGTTCC TTCGACGATT TCATCGCGGT GGCCGAGGAT
CTGGCCGCGA CCGGCGTGAC GACCGCCGCG CAGCTCGGCA TCGAGGGGGG CAGCAACGGC
GGGTTGCTGG TCGCCGCGTG CATGGTGCAG CGGCCGGAGC TGTTCGGCGC GGTGCTCTGC
CGCGTGCCGC TGCTCGACAT GCGGCGTTAT CCGAAGCTGC ACGCGGGCGC CGCATGGCTC
GACGAATACG GCGATCCGGA CGATCCGCAT GAAGGCGCGG CGCTGGCCGC GTACTCGCCG
TATCACCGCG TGCGCGAAGG TGTCGCGTAT CCGCCGCTGC TGCTGACGAC GTCGACGCGC
GACGACCGTG TGCATCCCGC GCATGCGCGC AAGATGGCCG CGCGCATGCA TGCGCTCGGT
CACGAACGGG TGTGGTACTG GGAGAACACC GACGGTGGCC ACGGCAGCGC CGACGATCTG
GAACGCGCGG AAGCCGATGC GGCCGAATTC GGGTTCCTGT GGGCCCATCT CGGGCCGGCG
CCCGCGCGGC GCTGA
 
Protein sequence
MSDSFRWPAG ADPFRFLESL DSKRARTWVD EQNARTRAAL RDDDAYRALT ARLAKAYLPR 
ERPVIPTRWR DWAYDLWQDD LHPKGLWRRT RWDDWRAGNP AWETLLDVDA LGAEEGESWV
FEQDSILYPD GDRALLSLSP GGADAVVVRE FDLVERRFVD GGFTIDEPGH HTVGWIDRDT
VYVSWDRGEA HATAAGYPYE ARRWVRGTAL ADAPVVFSGE PDDISAGAGF DPIDNRHVAW
RSVDFFDAHA YRLTDTGEWA RYDVPTHVEV GFWEGWLMLE PRLDWDCDGV RHAGGSLLAI
REQAFLAGSR ALTTLFVPQP TTSACTWTHT RTTLIASWLD DVHNRTMLWQ PSQADDGTWT
WDARPFEWPG PGDAQIDVEP VEATLNDEIY VDVDTYLDPP ECWLADLADR ADDAPSRRVL
LDRPPVQFDA AGLVVRRASA RSQDGTIVPY TLIGPRDALD VADGAARVAR PCLLSGYGGF
AIPNLPGYSD AFGIGWLERG GVMAFAHIRG GGEFGPRWHV DAQREHRQRS FDDFIAVAED
LAATGVTTAA QLGIEGGSNG GLLVAACMVQ RPELFGAVLC RVPLLDMRRY PKLHAGAAWL
DEYGDPDDPH EGAALAAYSP YHRVREGVAY PPLLLTTSTR DDRVHPAHAR KMAARMHALG
HERVWYWENT DGGHGSADDL ERAEADAAEF GFLWAHLGPA PARR