Gene Bcep18194_A4242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A4242 
Symbol 
ID3749437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp1175777 
End bp1177279 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content67% 
IMG OID637762528 
Productpeptidase S1C, Do 
Protein accessionYP_368483 
Protein GI78065714 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAC TCACGCTGCG CAAATGGCTC GCGGCGGCGG CGCTGTCTGC CTGCCTGCCG 
CTCGTGCCGC ATACCGCGTT CGCGGTGACG GCTCCCGCTG CCAGCCTGCC CGACTTCGCC
GACCTGGTCG AAAAGGTCGG GCCGGCCGTC GTGAACATCC GGACCACCGC CAACGTGCCG
ACGAGCAGCG GCCCGCGCGG GATGCTCCCG CCGGGCTTCG ACAACGGCGA CATGTCGGAA
TTCTTCCGGC GCTTCTTCGG CATTCCGCTG CCGCAGGCGC CCGGTAACGG TAGCGGCAAC
GGCGGCAACA ACCCGAAGAA CGCGCCCGCG CCGGACAACC CGCCCGATAC CGAACAGAAT
CGCGGCGTCG GCTCGGGCTT CATCGTGTCG GCCGACGGTT ACGTGATGAC CAATGCGCAC
GTCGTCGACG ACGCGGACAC CATCTACGTG ACGCTGACCG ACAAGCGCGA ATTCAAGGCG
AAGCTGATCG GCGTCGACGA TCGCACGGAC GTCGCGGTCG TCAAGATCCA GGCGTCGAAC
CTGCCGGTCG TCGCGATCGG CGATTCGAAC AAGGTGCGCG TCGGCGAGTG GGTTGTCGCG
ATCGGTTCGC CGTTCGGCCT CGACAACACG GTGACGGCCG GCATCGTCAG CTCGAAGAGC
CGCAATACGG GCGACTACCT GCCGTTCATC CAGACCGACG TCGCCGTGAA CCCCGGCAAC
TCGGGCGGCC CGCTGATCAA CATGCAGGGC GAGGTGATCG GCATCAACTC GCAGATCTAC
AGCCGTACCG GCGGTTTCAT GGGCATCTCG TTCGCGATTC CGATCGACGA GGCGATGCGC
GTGGCCGACC AGCTGAAGGC GACGGGCAAG GTCACGCGCG GCCGGATCGC GGTCGCGATC
GGCGAGGTGA CGAAGGACGT GGCCGATTCG ATCGGGTTGC CGAAGGCCGA GGGTGCGCTC
GTCAGCAGCG TCGAGCCGGG CGGTCCGGCC GACAAGGCGG GCATCCAGCC GGGCGACATC
ATCCTGAAGT TCAATGGCCG TTCGGTCGAT ACGGCGTCGG ACCTGCCGCG CATGGTCGGC
GACACGAAGC CCGGCGCGAA GGCGACGGTC AGCGTATGGC GCAAGGGTCA GGCACGCGAT
CTGCCGATCA CGATCGCGGA GACGCCGGCC GAAACGACGG CGAAGGCCGA GCAGCGCAAG
AATGCGCCGC AGAAGCCGCG CCAGACCAAT TCGCTCGGCC TGACGGTCAG TGATCTGACG
GCGGAGCAGA TGAAGACGCT GAAGCTGAAG AACGGCGTGC AGATCGACGG CGTAGACGGC
CCGGCCGCCC GTGCGGGCCT GCAGCGCGGT GACATCGTGC TGCGCGTCGG CGACACCGAC
ATCACGAACG CGAAGCAGTT TGCCGACGTG ACCGCGCAGC TCGATCCGCA GAAGGCGGTC
GCGGTGCTCG TGCGGCGTGG CGACAACACG CAGTTCGTGC CCGTGCGGCC GCGCCAGAAG
TAA
 
Protein sequence
MMKLTLRKWL AAAALSACLP LVPHTAFAVT APAASLPDFA DLVEKVGPAV VNIRTTANVP 
TSSGPRGMLP PGFDNGDMSE FFRRFFGIPL PQAPGNGSGN GGNNPKNAPA PDNPPDTEQN
RGVGSGFIVS ADGYVMTNAH VVDDADTIYV TLTDKREFKA KLIGVDDRTD VAVVKIQASN
LPVVAIGDSN KVRVGEWVVA IGSPFGLDNT VTAGIVSSKS RNTGDYLPFI QTDVAVNPGN
SGGPLINMQG EVIGINSQIY SRTGGFMGIS FAIPIDEAMR VADQLKATGK VTRGRIAVAI
GEVTKDVADS IGLPKAEGAL VSSVEPGGPA DKAGIQPGDI ILKFNGRSVD TASDLPRMVG
DTKPGAKATV SVWRKGQARD LPITIAETPA ETTAKAEQRK NAPQKPRQTN SLGLTVSDLT
AEQMKTLKLK NGVQIDGVDG PAARAGLQRG DIVLRVGDTD ITNAKQFADV TAQLDPQKAV
AVLVRRGDNT QFVPVRPRQK