Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A5947 |
Symbol | |
ID | 3751178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | - |
Start bp | 3074444 |
End bp | 3075928 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637764266 |
Product | peptidase S1C, Do |
Protein accession | YP_370185 |
Protein GI | 78067416 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.314622 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACCC GATTCCTTGC GCGCGGCGCC GTCGCGGTCG CCGTGGCGGC CGCCCTGTCC GCCGGCTATG TCGCCGGCAC CCGCCGCGCC GATCCGCAGA TCATCACGCC CGCACAGGCG GCGGCGCTGA TGCCGGCCGA AGCGGCCGCG AAGACCGGCA TCCCCGATTT CTCCGGGCTC GTCGAGACCT ACGGCCCGGC GGTCGTGAAC ATCAGCGCGA AGCACGTCGT GAAGCAGGTG TCGCGGCGCG TGCCGCAGCC GCAACTGCCG ATCGATCCGA GCGATCCGTT CTACCAGTTC TTCAAGCACT TCTACGGCCA GGTGCCGGGC ATGGGCGGTG ACGCGCAGCC GGACGACCAG CCGAGCGCGA GCCTCGGTTC CGGGTTCGTC ATCAGCTCCG ACGGGTACAT CCTCACCAAT GCGCACGTGA TCGACGGCGC GAACGTCGTC ACGGTCAAGC TGACCGACAA GCGCGAATAC AAGGCGAAGG TCGTCGGCTC CGACAAGCAG TCCGACGTCG CGGTGCTGAA GATCGACGCA AGCGGCCTGC CGACCGTGAA GATCGGCGAC CCGGCGCAGA GCAAGGTCGG CCAGTGGGTC GTCGCGATCG GTTCGCCGTA CGGCTTCGAC AACACGGTCA CGTCGGGCAT CATCAGCGCG AAGTCGCGTG CGCTGCCGGA CGAGAACTAC ACGCCGTTCA TCCAGACCGA CGTGCCGGTG AACCCCGGCA ACTCGGGCGG CCCGCTGTTC AACCTGCAGG GTGAGGTGAT CGGCATCAAC TCGATGATCT ACTCGCAGAC GGGCGGCTTC CAGGGCCTGT CGTTCGCGAT CCCGATCAAC GAGGCGATCA AGGTGAAGGA CGAACTCGTG AAGACGGGCC ACGTGAGCCG CGGCCGCCTC GGCGTCGCCG TGCAGGGGCT GAACCAGACG CTCGCGAGCT CGTTCGGACT GCAGAAGCCG GACGGCGCGC TCGTCAGCTC GGTCGATCCG AACGGCCCGG CCGCGAAGGC CGGCCTGCAG CCGGGTGACG TGATCCTGTC GGTCAACGGC TCGCCGGTTG CCGATTCGAC GTCGCTGCCC GCGCAGATCG CGAACCTGAA GCCGGGTTCG AAGGCCGACC TGCAGATCTG GCGCGACAAG TCGAAGAAGT CGATCAGCGT GACGCTCGGC GCGATGGCCG ACGCGAAGCT CGCGTCGAAC GACGGCGGCC CTGTTGAACA AGGGCGCCTG GGCGTCGCGG TGCGTCCGCT GTCGCCGCAG GAGCGCAGTG CCGACAACCT GTCGCACGGC CTGATCGTGC AGCAGGCCGG CGGGCCGGCC GCGAACGCGG GCATCCAGCC CGGCGACGTG ATCCTCGCGG TCAACGGCCG CCCGGTCACG AGCCCCGAGC AGTTGCGCGA CGCGGTCAAG GGCGCGGGCA ACAGTCTTGC TCTGCTGATC CAGCGCGATA ATGCACAGAT CTTCGTGCCG GTCGACCTGA GCTGA
|
Protein sequence | MNTRFLARGA VAVAVAAALS AGYVAGTRRA DPQIITPAQA AALMPAEAAA KTGIPDFSGL VETYGPAVVN ISAKHVVKQV SRRVPQPQLP IDPSDPFYQF FKHFYGQVPG MGGDAQPDDQ PSASLGSGFV ISSDGYILTN AHVIDGANVV TVKLTDKREY KAKVVGSDKQ SDVAVLKIDA SGLPTVKIGD PAQSKVGQWV VAIGSPYGFD NTVTSGIISA KSRALPDENY TPFIQTDVPV NPGNSGGPLF NLQGEVIGIN SMIYSQTGGF QGLSFAIPIN EAIKVKDELV KTGHVSRGRL GVAVQGLNQT LASSFGLQKP DGALVSSVDP NGPAAKAGLQ PGDVILSVNG SPVADSTSLP AQIANLKPGS KADLQIWRDK SKKSISVTLG AMADAKLASN DGGPVEQGRL GVAVRPLSPQ ERSADNLSHG LIVQQAGGPA ANAGIQPGDV ILAVNGRPVT SPEQLRDAVK GAGNSLALLI QRDNAQIFVP VDLS
|
| |