Gene Bcep18194_C7089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_C7089 
Symbol 
ID3734658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007509 
Strand
Start bp651208 
End bp652863 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content61% 
IMG OID637760790 
Producthypothetical protein 
Protein accessionYP_366777 
Protein GI78060202 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAACA AAGATGATGG ACAGATGGGT GCTGTCTCAT TGCATAACTT GCTATCGGGG 
ATTGGGATGT CGTTTGATGA GGACTTCAAT GGCGTCGCGG CACAGGTGCG CGAGGCGGAG
GCTGCCGGAC AGTTCGACCG GGCGGCCGAT CTTTTCCACG TCCTGTTTGA ACGATATCCT
CGTAGCGACG TCGCTCTAAA ATCCTGTCTG GACGCGTTAT TGCGTCAGGG GCAAGCAGGC
AAAGCGCTCG AGGCTTGCGA GCGCGCCATT GCCGCGACAC CCGAATCGAC CGTTCCGTGG
CGCGAAAGAG CGCTTCTCTG CTTCAATCAC CTGAACGATC CCGCCGCCGC CATCGCCAGC
TTGAAAGACG GGCTGGCGGC GTTTCCGTCC AATGCCGAGT TGCATCTGAT GCTTGCCGAC
GCGAGTTTCC AGATGCTCGA CTTTGAGACG TCACGCCGGC ATGGGATGAA GGCGGCCGAA
TTCGGCGATC TGACCATCGC GTTGCGAGCA CGCCATCGAT TCATTGAGGA TCACGAAGGT
GCCGTACACA TCGCACGCGC CATTCTGGAA AGGTACCCGA CGGAGATCGG TGCGCTTGTG
CAGGGCGGTA TTTCGCTCTA CTTGCTGGGG CGCTTTGAAG AAGGTATCGG CTATCTGTAC
CGTGCCGCCG AACACGATCC CTATCGGGGC GAAGTCATCT TTCCGCTGGC GAATCTGCTC
TTGCTGCTGG GTGACACAAA GGCCGGATGG CGCCGATACG AAATGCTCGC GGATCTTGCA
TCGCTCCGCA GCGGCCCTCG TGAATTGACC ACCTATCACG ACCGCCTGTG GCGCGGACAG
CCTCTTGATG GCAAGCGCAT TCTGGTGATC AGCCATCTCG GCCTCGGCGA TTGCCTGATG
TACGCCCGTT ACGCCCGAGA CCTGAAGGCG GCGGGCGCCC ACGTCACGCT CTGCGTCAAG
CCGGAGTTGA TGCAGCTTCT GCGAGAACTC GAAGGCGTCG ACGAGTTGTT GAGCGCCTGG
CCCCTCGAGA CGTGGGGCAA CTACGATTAC TGGATCTTCG AAAACCTGCT GCCCGCGAGA
TTGGGGGCGA GTGACGGGAT CGTGCCTACC TACCGGGACG GCTATATCAA GCTGAAAGAC
CCGGACGCCG CCAAGGCACT GAACGAGCGC AGTCGCCCAT CCGAGCGGTT GCGAATCGGC
CTGTGCTGGG ACACGTCGCC CAATTATTTT GCGGGGCGCT CCCGCAGTCT CTTGCCTGAA
GACCTTCAGC CGTTGGCCGA GATTGAAAAC GTCGACTGGT TCGTGCTTCA GAAACATCCG
CTCGAGCCGG ATTTCGCGGC ACGTAGCGGG CTGTCGATCC TGAATCGATC CGATGAATGG
AGCGATCTCT ACGATACGGC GGTATTTGCG GCATCCCTCG ATCTGACCAT CTCGATCTGC
TCGGCGCCCG TACACCTGGC TGGTTCGCTG GGCCTGCCAG CGTGGGTCAT GCTGGGTGCG
CCGGAGTGGC GGTGGGGCGC GCAAGGCGAC ACGGGGCCCT GGTATCCGCA CATCCGCGTC
TTCCGGCAGG CGACACCCGG TAACTGGCGT AGCGTTACCG AGGCGGTGCG TGCCGCACTC
GAATCGGAGC GCGGGGGCTT GCGTCGAGTG GCTTGA
 
Protein sequence
MRNKDDGQMG AVSLHNLLSG IGMSFDEDFN GVAAQVREAE AAGQFDRAAD LFHVLFERYP 
RSDVALKSCL DALLRQGQAG KALEACERAI AATPESTVPW RERALLCFNH LNDPAAAIAS
LKDGLAAFPS NAELHLMLAD ASFQMLDFET SRRHGMKAAE FGDLTIALRA RHRFIEDHEG
AVHIARAILE RYPTEIGALV QGGISLYLLG RFEEGIGYLY RAAEHDPYRG EVIFPLANLL
LLLGDTKAGW RRYEMLADLA SLRSGPRELT TYHDRLWRGQ PLDGKRILVI SHLGLGDCLM
YARYARDLKA AGAHVTLCVK PELMQLLREL EGVDELLSAW PLETWGNYDY WIFENLLPAR
LGASDGIVPT YRDGYIKLKD PDAAKALNER SRPSERLRIG LCWDTSPNYF AGRSRSLLPE
DLQPLAEIEN VDWFVLQKHP LEPDFAARSG LSILNRSDEW SDLYDTAVFA ASLDLTISIC
SAPVHLAGSL GLPAWVMLGA PEWRWGAQGD TGPWYPHIRV FRQATPGNWR SVTEAVRAAL
ESERGGLRRV A