Gene Bcep18194_B1097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B1097 
Symbol 
ID3752862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp1237425 
End bp1238864 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content71% 
IMG OID637765946 
Productpeptidase S1C, Do 
Protein accessionYP_371855 
Protein GI78061947 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.676957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0936478 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGG GCAGGCACAC GAACCTGGCA GCGGTGCTGC GAGCGATGGC CGGCGGGCTG 
GCGTTGGCCG GTGCGTGCGC ATGCGGTTGT TCGACGGCAA CGGCTGCCGA GCGGCCCGCG
CAGCCGGGCG CCGCATCGCC AACGCCGACG CCGACCGTCG CGCTGCCGAA TTTTGCGGCG
CTGGTCCGGC GCGTCGGCCC GGCCGTCGTC AACATCAGCG TCACGCGCGA AGTCACGCAG
ATGGGCATCC AGTTGCCGCC GGGCATCGCG CCCGATCATC CGCTCGCGCC GTTTCTTGCG
CGGCGCGTGA TCGGCAACCG CGAGGAGGTG AGCCTCGGTT CGGGCTTCAT CGTCAGCGAG
GACGGCGTGA TCCTGACCAA CCGGCACGTG GTCGGCGACG CCGTTGCGAT CGACGTGAAG
CTGACCGACA AGCGGCAGTT CAAGGGGCGC GTGATCGGCA GCGATCCCGT CTCCGACGTC
GCGGTGATCC GCATCGACGC GCACAACCTG CCGGTCGTCG CGACCGGCGA CCCGGCGCGC
ACCGAAGTGG GCGACTGGGT AATGGCGATC GGCTCGCCAT ACGGGTTCGC GAACACGGTC
ACGCAGGGCA TCGTCAGCGC GAAATCGCGC TCGTTGCCCG GCGAGCGCGC GATTCCGTTC
ATCCAGACCG ACGTGCCGAT CAACCCCGGC AATTCGGGCG GCCCGCTGTT CGATCTCGGC
GGCCGGGTGA TCGCGATCAA CTCGATGATC TTCTCGAAGA CGGGCGGCTA TCAGGGGCTT
GCGTTTGCGA TCCCGATCGA TATCGCGCTC GACGTGAAGG ACCAGTTGCT GCGAACCGGC
AAGGTCACGC GCGGCCGGCT CGGCGTGGCC GTCCAGGAAG TGAGCCAGGC GCTGGCGCGT
TCGTTCGGCC TTGCCAGCCC CGACGGCGCG CTGATCACCA TGGTCGAGCC GGATGGCCCG
GCCGCGCACG CGGGCCTGCA GCCGGGCGAC GTCGTGCTCG CGGTCGATGG CAAGCCGGTC
GCCGAATCGT CGGACCTGCT CGGCACCGTT GCGGGCATGC GTGCCGGCCG GCAAGCCGAC
CTGCTCGTAT GGCGCGCCGG ACGGGCGATG CACATGAGCG CGACGGTCGG CGCATTCGAC
AGCGGCACAG CGGCGGCGAG CGGCGAGCAA GGGCCCGCGC GGTTCGGGCT CGCGTTGCGC
GCGGCGACGG AGCAGGAGCG CCAGCGGCTC GGTGTCGGGC AGGCGCTCGT CGTCGAGCAG
GCGAGCGGCC AGGCTGCGCG CGCGGGGTTG CAGCCCGGCG ATGTCGTGCT GTCGGTCAAC
GGCACGCCGG TCGCGAACAT CGGTGCGCTG ATGACCGAGA TCGACGCCGC GCACGGCAAC
GTCGCGCTGC TCGTCCAGCG CGGCGGTACG CGGCTGTATG TGCCGATCGA GATCGGTTGA
 
Protein sequence
MKAGRHTNLA AVLRAMAGGL ALAGACACGC STATAAERPA QPGAASPTPT PTVALPNFAA 
LVRRVGPAVV NISVTREVTQ MGIQLPPGIA PDHPLAPFLA RRVIGNREEV SLGSGFIVSE
DGVILTNRHV VGDAVAIDVK LTDKRQFKGR VIGSDPVSDV AVIRIDAHNL PVVATGDPAR
TEVGDWVMAI GSPYGFANTV TQGIVSAKSR SLPGERAIPF IQTDVPINPG NSGGPLFDLG
GRVIAINSMI FSKTGGYQGL AFAIPIDIAL DVKDQLLRTG KVTRGRLGVA VQEVSQALAR
SFGLASPDGA LITMVEPDGP AAHAGLQPGD VVLAVDGKPV AESSDLLGTV AGMRAGRQAD
LLVWRAGRAM HMSATVGAFD SGTAAASGEQ GPARFGLALR AATEQERQRL GVGQALVVEQ
ASGQAARAGL QPGDVVLSVN GTPVANIGAL MTEIDAAHGN VALLVQRGGT RLYVPIEIG