Gene Bcep18194_B1333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B1333 
Symbol 
ID3753098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp1514897 
End bp1516363 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content71% 
IMG OID637766182 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_372091 
Protein GI78062183 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.448139 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGC GGCCGGCTGC TACGCCTTAT GCGATGCCGG TCGATTTTCC GGCGATCGTC 
GACCGCTACG GCCCGGCGGT CGTGACCATC ATGACCGCCG CACCCGACCA GCAGGCCGGC
GGGCCGTCCT TCTCGATCCT CGATCCCGAC GATCCGCTCG CCGCGTTCTT CCGGCATGAC
GCACCGCCGC CCGCGCAGGG CCCGCAGGCG CCAACCCAGG CGCAAGCCCA GGCCCAGGCG
CCGGCACCCG ACACGGCGCC GCGCGCGGTA TCGGGCAGCG GCTCGGGTTT CATCGTCAGT
GCCGACGGCC TGATCCTCAC CTCCGCCCAC GTGGTCGACG ACGCGACCGA CGTGACGGTG
CGGCTCACCG ACCGCCGCGA ATTTAAGGCG ACCGTGCTGG CCGTCGATCC GCAAAGCGAT
GTCGCGGTGT TGCGTGTCAA CGCAACGAAG CTGCCGTTCG TGCGCGTCGG CGATTCGTCG
AAGGTACGCG CGGGCGAACC GGTGATGACG ATCGGTGCGC CGGACGGGTC GGGCAACACG
GTCACGGCCG GTATCGTCAG CGCGACGTCG CACCGGCTGC CGGATGGCAG CGCGTTCCCG
TTCTTCGAAA CCGACATCGC GCCGAACCCC GACAACTCGG GCGGCCCCGT GTTCAATCGC
GCGGGCGACG TGATCGGTAT CGCGGTGCAG GTCTATACGG GCGCCGACCG CTACGCGAGC
ATGACGTTCG CGATCCCGAT CGCGTTTGCC GCAAAGGTGC GTGCGCAACT GCAGAAGCAG
ATGCAGGCGC AGGCACAGCA GTCGCAGGCG CCGGGCGAGC CGTCATCCGG TAATGCGATC
GGCGTCGACG TGCAGGATGT CGGGATCGGG CTCGCGGCGG CGTTCGGGCT GCCGCGTCCG
GCGGGCGCGC TCGTCAATGG TGTCGCGCCC GGCTCGCCGG CTGCCGCGGC CGGCGTGAAG
CCCGGCGACG TGATCGTGAA GCTGGGCGAC AAGACGATCG GCCGCTCGGC CGAGCTGAAC
GACCTGGCCG CCGCGCTGCC GCCCGGCCAG AAGGCGCCGC TGCGGCTGAT CCGCAACCGT
GCGCCGGTGA TGCTGACGAT CACGGGCGCC GACGAGGCCG CCGGCAACGC GGGTGCGACC
GTCGGCATGG CCACGCCGAA GGCGGCTGCA CGACCGGGGG CGGGAGCAGG GGCAGCCGAC
GGCGGCGACC GCCTCGGGCT GACGATGCAC GCGCTCAGCG ACGACGAGCG CCGCTCGACG
GGGCTCGCGG TGGGAATGAT GGTGGACGCG GTGCACGGCC CGGCAGCCAC TGCGGGCATC
CAGCCGGGCG ATGTCGTGCT GGAGCTCAAC GATACGCTGC TCGAGACGCC CGATGACATA
CCGACGCTCG AAGCGAACGG CAGCGTGATC GCGGTGCTGA TCCAGCGCAA CAACGCAAGG
AAGTTCGTGT CGGTGCGTGC GCGTTAG
 
Protein sequence
MKQRPAATPY AMPVDFPAIV DRYGPAVVTI MTAAPDQQAG GPSFSILDPD DPLAAFFRHD 
APPPAQGPQA PTQAQAQAQA PAPDTAPRAV SGSGSGFIVS ADGLILTSAH VVDDATDVTV
RLTDRREFKA TVLAVDPQSD VAVLRVNATK LPFVRVGDSS KVRAGEPVMT IGAPDGSGNT
VTAGIVSATS HRLPDGSAFP FFETDIAPNP DNSGGPVFNR AGDVIGIAVQ VYTGADRYAS
MTFAIPIAFA AKVRAQLQKQ MQAQAQQSQA PGEPSSGNAI GVDVQDVGIG LAAAFGLPRP
AGALVNGVAP GSPAAAAGVK PGDVIVKLGD KTIGRSAELN DLAAALPPGQ KAPLRLIRNR
APVMLTITGA DEAAGNAGAT VGMATPKAAA RPGAGAGAAD GGDRLGLTMH ALSDDERRST
GLAVGMMVDA VHGPAATAGI QPGDVVLELN DTLLETPDDI PTLEANGSVI AVLIQRNNAR
KFVSVRAR