Gene Bcep18194_A4195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A4195 
Symbol 
ID3749388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp1128532 
End bp1129902 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content67% 
IMG OID637762479 
Productmicrocin-processing peptidase 1 
Protein accessionYP_368436 
Protein GI78065667 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCCA ATCTCGACGT ACAAGCGCGC TATTTCCCGC ACACGCAGGA CCAGCTCAAA 
GAAATCGCCT CGGACATCCT TCGCCATGCG AAGGCACTCG GCGCGACGGA CGCCGCGACC
GAAATCTCCG AGGGCGACGG CCTGTCGGTG TCGGTGCGCC GCGGCGAGGT CGAAACGATC
GAGCACAACC GCGACAAGAT GGTCGGCGTG ACCGTGTTCA TCGGCAAGAA GCGCGGCAAC
GCGAGCACGT CGGATTTTTC GCCGGGCGCG ATCAAGGATA CCGTCGCCGC CGCGTACAAC
ATCGCACGCT TCACGGCCGA GGACGAAGCG GCCGGCCTGG CCGAGGCCGA ACTGCTCGAA
ACCGACCCGC GTGACCTCGA CCTGTACCAC CCGTGGGCGC TGACGGCCGA CGAGGCGGTC
GAGCTCGCCC GCCGCGCGGA AGACGCCGCA TTCGCGGTCA GCCCGCAGAT CCGCAATTCG
GAAGGCGCAA GCGTGTCGGC CCAGCATTCG CAATTCGTGC TGGCCACGTC GCGCGGCTTC
CTGTCCGGCT ACCCGTACTC GCGCCACTAC ATCGCATGCG CACCGATCGC GGGCAGCGGG
CGTCACATGC AGCGCGACGA CTGGTATTCG TCGAAGCGCA GCGCGATCGA TCTCGCGGCA
CCCGAAGCGG TAGGCCGTTA CGCGGCCGAG CGTGCGCTGG CGCGGATGGG CGCACGCCGC
CTCGACACCC GCAAGGTGCC CGTGCTGTTC GAGGCGCCGC TCGCGGCCGG CCTGCTCGGC
GCATTCGTGC AGGCGGTGAG CGGCGGCGCG CTGTACCGCA AGACGTCGTT CCTCGTCGAC
AGCCTCGGCA AGCCGGTGTT CGCACCGCAC ATCCAGGTCG TCGAGGATCC GCACGTGCCG
CGCGCGATGG GCAGCGCGCC GTTCGACGAG GAAGGCGTGC GCACGCGTGC GCGCAGCGTC
GTCAAGGATG GCGTCGTCGA AGGCTACTTC CTGTCGACCT ATTCGGCGCG CAAGCTCGGC
ACGCAGACCA CCGGCAACGC GGGCGGCTCG CACAACCTCG CGCTGCGCAG CTCGAACACG
CAGGCAGGCG ACGATTTCGA CGCGATGCTG AAGAAGCTCG GCACGGGCCT GTTGCTGACC
GAGCTGATGG GGCAGGGCGT GAACTACGTG ACGGGTGACT ATTCGCGCGG TGCGGCGGGC
TTCTGGGTCG AGAACGGCGT AATCCAGTAT CCGGTCGAGG AAATCACCGT CGCGAGCACG
CTGCAGGAAA TGTTCCGGCA TATCGTTGCG ATCGGCGCCG ATTCGATCGT GCGCGGCACG
AAGGAAACGG GCTCGGTGCT GATCGAGCAG ATGACGATCG CCGGGCAGTA A
 
Protein sequence
MAANLDVQAR YFPHTQDQLK EIASDILRHA KALGATDAAT EISEGDGLSV SVRRGEVETI 
EHNRDKMVGV TVFIGKKRGN ASTSDFSPGA IKDTVAAAYN IARFTAEDEA AGLAEAELLE
TDPRDLDLYH PWALTADEAV ELARRAEDAA FAVSPQIRNS EGASVSAQHS QFVLATSRGF
LSGYPYSRHY IACAPIAGSG RHMQRDDWYS SKRSAIDLAA PEAVGRYAAE RALARMGARR
LDTRKVPVLF EAPLAAGLLG AFVQAVSGGA LYRKTSFLVD SLGKPVFAPH IQVVEDPHVP
RAMGSAPFDE EGVRTRARSV VKDGVVEGYF LSTYSARKLG TQTTGNAGGS HNLALRSSNT
QAGDDFDAML KKLGTGLLLT ELMGQGVNYV TGDYSRGAAG FWVENGVIQY PVEEITVAST
LQEMFRHIVA IGADSIVRGT KETGSVLIEQ MTIAGQ