Gene Bcep18194_A5947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A5947 
Symbol 
ID3751178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp3074444 
End bp3075928 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content68% 
IMG OID637764266 
Productpeptidase S1C, Do 
Protein accessionYP_370185 
Protein GI78067416 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.314622 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCC GATTCCTTGC GCGCGGCGCC GTCGCGGTCG CCGTGGCGGC CGCCCTGTCC 
GCCGGCTATG TCGCCGGCAC CCGCCGCGCC GATCCGCAGA TCATCACGCC CGCACAGGCG
GCGGCGCTGA TGCCGGCCGA AGCGGCCGCG AAGACCGGCA TCCCCGATTT CTCCGGGCTC
GTCGAGACCT ACGGCCCGGC GGTCGTGAAC ATCAGCGCGA AGCACGTCGT GAAGCAGGTG
TCGCGGCGCG TGCCGCAGCC GCAACTGCCG ATCGATCCGA GCGATCCGTT CTACCAGTTC
TTCAAGCACT TCTACGGCCA GGTGCCGGGC ATGGGCGGTG ACGCGCAGCC GGACGACCAG
CCGAGCGCGA GCCTCGGTTC CGGGTTCGTC ATCAGCTCCG ACGGGTACAT CCTCACCAAT
GCGCACGTGA TCGACGGCGC GAACGTCGTC ACGGTCAAGC TGACCGACAA GCGCGAATAC
AAGGCGAAGG TCGTCGGCTC CGACAAGCAG TCCGACGTCG CGGTGCTGAA GATCGACGCA
AGCGGCCTGC CGACCGTGAA GATCGGCGAC CCGGCGCAGA GCAAGGTCGG CCAGTGGGTC
GTCGCGATCG GTTCGCCGTA CGGCTTCGAC AACACGGTCA CGTCGGGCAT CATCAGCGCG
AAGTCGCGTG CGCTGCCGGA CGAGAACTAC ACGCCGTTCA TCCAGACCGA CGTGCCGGTG
AACCCCGGCA ACTCGGGCGG CCCGCTGTTC AACCTGCAGG GTGAGGTGAT CGGCATCAAC
TCGATGATCT ACTCGCAGAC GGGCGGCTTC CAGGGCCTGT CGTTCGCGAT CCCGATCAAC
GAGGCGATCA AGGTGAAGGA CGAACTCGTG AAGACGGGCC ACGTGAGCCG CGGCCGCCTC
GGCGTCGCCG TGCAGGGGCT GAACCAGACG CTCGCGAGCT CGTTCGGACT GCAGAAGCCG
GACGGCGCGC TCGTCAGCTC GGTCGATCCG AACGGCCCGG CCGCGAAGGC CGGCCTGCAG
CCGGGTGACG TGATCCTGTC GGTCAACGGC TCGCCGGTTG CCGATTCGAC GTCGCTGCCC
GCGCAGATCG CGAACCTGAA GCCGGGTTCG AAGGCCGACC TGCAGATCTG GCGCGACAAG
TCGAAGAAGT CGATCAGCGT GACGCTCGGC GCGATGGCCG ACGCGAAGCT CGCGTCGAAC
GACGGCGGCC CTGTTGAACA AGGGCGCCTG GGCGTCGCGG TGCGTCCGCT GTCGCCGCAG
GAGCGCAGTG CCGACAACCT GTCGCACGGC CTGATCGTGC AGCAGGCCGG CGGGCCGGCC
GCGAACGCGG GCATCCAGCC CGGCGACGTG ATCCTCGCGG TCAACGGCCG CCCGGTCACG
AGCCCCGAGC AGTTGCGCGA CGCGGTCAAG GGCGCGGGCA ACAGTCTTGC TCTGCTGATC
CAGCGCGATA ATGCACAGAT CTTCGTGCCG GTCGACCTGA GCTGA
 
Protein sequence
MNTRFLARGA VAVAVAAALS AGYVAGTRRA DPQIITPAQA AALMPAEAAA KTGIPDFSGL 
VETYGPAVVN ISAKHVVKQV SRRVPQPQLP IDPSDPFYQF FKHFYGQVPG MGGDAQPDDQ
PSASLGSGFV ISSDGYILTN AHVIDGANVV TVKLTDKREY KAKVVGSDKQ SDVAVLKIDA
SGLPTVKIGD PAQSKVGQWV VAIGSPYGFD NTVTSGIISA KSRALPDENY TPFIQTDVPV
NPGNSGGPLF NLQGEVIGIN SMIYSQTGGF QGLSFAIPIN EAIKVKDELV KTGHVSRGRL
GVAVQGLNQT LASSFGLQKP DGALVSSVDP NGPAAKAGLQ PGDVILSVNG SPVADSTSLP
AQIANLKPGS KADLQIWRDK SKKSISVTLG AMADAKLASN DGGPVEQGRL GVAVRPLSPQ
ERSADNLSHG LIVQQAGGPA ANAGIQPGDV ILAVNGRPVT SPEQLRDAVK GAGNSLALLI
QRDNAQIFVP VDLS