Gene Bcep18194_A4387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A4387 
Symbol 
ID3749586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp1345097 
End bp1346317 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content68% 
IMG OID637762676 
Productpeptidase M20D, amidohydrolase 
Protein accessionYP_368627 
Protein GI78065858 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.68228 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATTT CTCCACCGAT CCAGGCGCTG GCAGCCGACA TGCGCGACTG GCGCCGGCTC 
ATTCACAGCA AACCGGAAAT CGCATTCCAG GAACGCGGCA CGGCCGACTT CATCGCGACG
CGGTTGCGCG AATTCGGCAT CGACGTTCAT ACCGGCGTCG GCCAGACCGG CGTGGTCGGC
ATCGTCGACG GAACGCTCGG CGCCGGCCGC ACCGTCGCCT TGCGCGCCGA CATGGATGCG
CTGCCGATGC GGGAACTGGG CCGCCCGGTC TACCGGTCGG TGTTCGAAGG CATCTTTCAC
GGCTGCGGCC ACGATGGCCA CGTCGCGATC CTGCTCGGCA CCGCACGCCA TCTCGCCGAG
CATCGCCACT TCCGCGGCCG CGTCGTGCTG ATCTTCCAGC CGGCCGAGGA AATCGTCTGC
GGCAGCCGCG CGATGCTCGA CGACGGGTTG CTCGAGCGCT TCCCGTTCGA CGAAATCTAC
AGCCTCCACA ACGACCCGAT GCTGCCGCCG TCGAGGATCG GCGTGCGCGC CGGCGCGCAA
CAGGCATCGT CCGACCGGTT TCAGATCCGG ATCCACGGCA TCGGCACCCA CGCCGGCATG
CCGCACCTCG GCATCGATCC GGTCGCGATC GGCGCGCACC TGGTTGCGAT GCTGCAAACC
GTCGCGAGCC GGTCGGTCGA TCCGCTGGAA AGCGTCGTGA TCTCGATCGC GCGCTTTCAC
GCCGGCGACG CATTCAACGT GATTCCGCAC GAAGCCGTGC TCGGCGGCAC CGTGCGCGCG
CTGTCGAACG ACACGCGCAC CTTCGCGCTC GAGCGGATGC GTGCGATCTG CGACGGCGTC
GCGCTCGCCA ATCGCACGCG CATCGAATTC GAATTGCTGG ACGGCACGCC GGCGATCGTC
AACCACGCCG ACGCGGTGCA GTGCGTGATG GACGCCGCGC GCGAGGTCGT CGGGGCGGAA
AACGTGATCG GCAACGTCAC GCCGCTGATG GCCGGCGACG ACATCGCGAA CTTCCTCGAT
GCACGCCCCG GCTGCCACTT TCTGCTCGGC CAGGGCGGCC ACATGTGCCA TCACCCCGAA
TACGACTTCA ACGACGACGT CGCGCCGATC GGCGTCGCGA TGTTCGCATC GATCCTGCGC
GCGCGGCTGG GCGCCGGCGC CGAACCGATA CCGACCGGCC TCGACGAACG CAGTGCGCTG
GCGGCGGCCG CCGCGCGCTG A
 
Protein sequence
MPISPPIQAL AADMRDWRRL IHSKPEIAFQ ERGTADFIAT RLREFGIDVH TGVGQTGVVG 
IVDGTLGAGR TVALRADMDA LPMRELGRPV YRSVFEGIFH GCGHDGHVAI LLGTARHLAE
HRHFRGRVVL IFQPAEEIVC GSRAMLDDGL LERFPFDEIY SLHNDPMLPP SRIGVRAGAQ
QASSDRFQIR IHGIGTHAGM PHLGIDPVAI GAHLVAMLQT VASRSVDPLE SVVISIARFH
AGDAFNVIPH EAVLGGTVRA LSNDTRTFAL ERMRAICDGV ALANRTRIEF ELLDGTPAIV
NHADAVQCVM DAAREVVGAE NVIGNVTPLM AGDDIANFLD ARPGCHFLLG QGGHMCHHPE
YDFNDDVAPI GVAMFASILR ARLGAGAEPI PTGLDERSAL AAAAAR