Gene Bcep18194_A3537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A3537 
Symbol 
ID3748714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp399419 
End bp400624 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content67% 
IMG OID637761811 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_367783 
Protein GI78065014 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family
[TIGR02038] periplasmic serine pepetdase DegS 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.727825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAGAC GCTTCTGGCT GTTCTTCGCG CAGGCGGTTA CCGTGCTGCT CGCGCTGATG 
TTCATCGTCG TGACGCTCAA GCCGCAATGG CTGCAACGGC AAGGACAGCT CGGCAAGCAG
CTCGCCACGC CGATCGTTGC GCTGCGGGAA GTCGCGCCGG GCATCGGCGG GGCGCCGGCG
ACCACGTCGT ACGCCGAAGC CGCGCAGAAG GCGATGCCGG CCGTCGTCAA CGTCTTCTCC
AGCAAGGACG GCTCGCTGCC GCCCGACCCA CGCGCGAAAG ATCCGCTGTT CCGCTACTTC
TTCGGCGACC GCAACGCCCG CAAGCAGCAG GACGAACCGG CCGCCAACCT TGGCTCGGGC
GTTATCGTGA GCCCTGAAGG TTACATTCTA ACGAACCAGC ACGTCGTGGA CGGCGCCGAC
CAGATCGAAG TCGCGCTCGC CGACGGCCGC ACGGCCACCG CGAAGGTGAT CGGCAGCGAT
CCCGAAACCG ATCTCGCCGT GCTGAAGATC AACATGACGA ACCTGCCGAC GATCACGCTC
GGCCGCTCCG ACCAGTCGCG GGTCGGCGAC GTCGTGCTCG CGATCGGCAA CCCGTTCGGC
GTCGGCCAGA CGGTCACGAT GGGGATCATC AGCGCGCTCG GCCGCAATCA CCTCGGCATC
AACACGTTCG AGAACTTCAT CCAGACCGAC GCACCGATCA ACCCCGGCAA CTCGGGCGGC
GCGCTGGTCG ACGTGAACGG CAACCTGCTC GGCATCAACA CGGCGATCTA CTCGCGCTCG
GGCGGCTCGC TCGGCATCGG CTTCGCGATT CCCGTATCGA CTGCGCGCAC GGTGCTGGAA
AGCATCATCA CGTCCGGCTC GGTCACGCGC GGCTGGATCG GCGTCGAGCC GCAGGACGTC
ACGCCGGAGA TCGCCGAGTC GTTCGGGCTG TCGCAGAAAT CAGGCGCGAT CGTCGCCGGC
GTGCTGCAGG GCGGCCCGGC CGACAAGGCC GGCATCAAGC CGGGCGACAT CCTGGTCTCG
GTCAACGGCG ACGAGATCAC CGACACGACG AAGCTGCTGA ACACCGTCGC GCAGATCAAG
CCCGGCACGC CGACCAAGGT GCACGTCGTG CGCAAGGGCA AGGAGTTCGA CGTGAACGTG
GTGATCGGCA AGCGCCCGCC GCCGCCGAAG CAGGCGCTCG ACGAGCAGGA CAGCGATACC
GAGTGA
 
Protein sequence
MLRRFWLFFA QAVTVLLALM FIVVTLKPQW LQRQGQLGKQ LATPIVALRE VAPGIGGAPA 
TTSYAEAAQK AMPAVVNVFS SKDGSLPPDP RAKDPLFRYF FGDRNARKQQ DEPAANLGSG
VIVSPEGYIL TNQHVVDGAD QIEVALADGR TATAKVIGSD PETDLAVLKI NMTNLPTITL
GRSDQSRVGD VVLAIGNPFG VGQTVTMGII SALGRNHLGI NTFENFIQTD APINPGNSGG
ALVDVNGNLL GINTAIYSRS GGSLGIGFAI PVSTARTVLE SIITSGSVTR GWIGVEPQDV
TPEIAESFGL SQKSGAIVAG VLQGGPADKA GIKPGDILVS VNGDEITDTT KLLNTVAQIK
PGTPTKVHVV RKGKEFDVNV VIGKRPPPPK QALDEQDSDT E