Gene Bcep18194_A3933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A3933 
Symbol 
ID3749118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp851954 
End bp853789 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content66% 
IMG OID637762212 
Productcarboxypeptidase C (cathepsin A)-like 
Protein accessionYP_368176 
Protein GI78065407 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00462563 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGACAC GGAAGTCCTT GAAAGACGGT TTCGCGCTAT TCGGAACGAC ACTGAGCGTG 
CCGCTCGCTG CAGCCGCGGC CGCCGCGCTG CTCGTCACGG GGTGCGGTGG CGACGACGGG
TCGGGCCCGG CCGCTTCGGC CGCCGCCGCG GCCGCGACGT CGGCCGGCGC CAGCACGTCG
GCCAACACCA ACGCAACGGC CGCCGCGGCC GACCAGCCTT ACGTCGACAA CGACGTGTAC
GGCACCGGGC CGAACGACGC GGTCACGGAT TCCACGGAAG GGGCGGCTGT CGTGCACCGC
ACGGTCACGA TCGGCGGCAA GACCATCAAG TACACGGCTA CCACGGGCCA CCTGACGACG
ATCGATCCGA CCACGTCGGC GCCGAACGCG AAGATGTTCT ACGTCGCGTA CACGCAGGAC
AATCCGGACC CGTCGAAGCC GCGCCCGGTC ACGTTCTTCT ACAACGGCGG CCCGGGCTCG
TCGTCGGTCT ACCTGCTGCT CGGCTCGTAC GGGCCGAAGC GCCTGCAGTC GTCGTTCCCG
AACTTCACGC CGCCCGCGCC GTACAAGCTG CTCGACAACC CGGACAGCCT GCTCGACCGC
ACCGACCTCG TGTTCATCAA CCCGGTCGGC ACCGGTTACT CGACGGCGAT CGCCCCGGCG
AAGAACAAGG ACTTCTGGGG CACCGACCAG GACGCCCGCT CGATCGACCG CTTCATCCAG
CGCTACCTGA CCAAGTACTC GCGCTGGAAT TCGCCGAAGT TCCTGTACGG CGAGTCGTAC
GGCACCGCGC GCAGCGCAGT CGTGTCGTGG GTGCTGCATG AAGACGGCAT CGACCTGAAC
GGGATCACGC TGCAGTCGTC GATCCTCGAC TACGCGAACG CGCTGTCCGC GCCGGGCACG
TTCCCGACGC TCGCGGCCGA CGCGTTCTAC TGGAAGAAGA CGACGCTCAA CCCGACGCCG
ACCGATCTCG ACGCGTACAT GATCCAGGCC CGCAACTATG CGGACAACAC GCTCGCGCCG
CTCGCGCAGA AGCCGAACCC GCAGGACGGC GGCTTCGTGA ACGTGCGCCT GAACCTGAAC
CTTCAGACCG CGCAGCAGAT GGGCTCGTAC ATCGGCACGG ATCCGACGTC GCTGATCCAG
ACGTTCGGCA ACCCGGCCGC GCTCGGCAAC GTGCCGTCGT CGGATGACAA CCCGCCGTAC
ACGTTCTTCC TGACGCTCGT GCCGGGCACG CAGATCGGCC AGTACGACGG TCGCGCGAAC
TTCACGGGCA AGGGCATCGC GCCGTACATC CTGCCGAACT CGGGCAGCAA CGATCCGTCG
ATCACGAACG TCGGCGGCGC GTACACGGTG CTGTGGAACA GCTATATCAA CACCGACCTC
AAGTACACGT CGACGTCGTC GTTCGTGGAC CTGAACGACC AGGTCTTCAA CAACTGGGAC
TTCAGCCACA CGGACCCGAC CGGCGCGAAC AAGGGCGGCG GCAATACGCT GTACACGGCC
GGCGACCTGG CGTCGACGAT GAGCGTGAAC CCGGACCTGA AGGTGCTGTC GGCCAACGGC
TATTTCGACG CCGTCACGCC GTTCCACCAG ACGGAGCTGA CGCTCGCGCA GATGCCGCTC
GACCCGACGC TCAAGGCGCA GAACCTGACG ATCAAGAACT ACCCGTCGGG CCACATGATC
TACCTGAACG ACGCGTCGCG GACCGCGCTG AAGGGCGATC TCGCCAACTT CTACGACGGC
ATCCTGGCCA ACCGCACTGC GCTGCAGCGC GTGCTGAAGC TGCAGATGCG CACGCAGCAG
CTCAAGCAGC AGAAGTTGCA GCAGCAGGGG CAGTAA
 
Protein sequence
MTTRKSLKDG FALFGTTLSV PLAAAAAAAL LVTGCGGDDG SGPAASAAAA AATSAGASTS 
ANTNATAAAA DQPYVDNDVY GTGPNDAVTD STEGAAVVHR TVTIGGKTIK YTATTGHLTT
IDPTTSAPNA KMFYVAYTQD NPDPSKPRPV TFFYNGGPGS SSVYLLLGSY GPKRLQSSFP
NFTPPAPYKL LDNPDSLLDR TDLVFINPVG TGYSTAIAPA KNKDFWGTDQ DARSIDRFIQ
RYLTKYSRWN SPKFLYGESY GTARSAVVSW VLHEDGIDLN GITLQSSILD YANALSAPGT
FPTLAADAFY WKKTTLNPTP TDLDAYMIQA RNYADNTLAP LAQKPNPQDG GFVNVRLNLN
LQTAQQMGSY IGTDPTSLIQ TFGNPAALGN VPSSDDNPPY TFFLTLVPGT QIGQYDGRAN
FTGKGIAPYI LPNSGSNDPS ITNVGGAYTV LWNSYINTDL KYTSTSSFVD LNDQVFNNWD
FSHTDPTGAN KGGGNTLYTA GDLASTMSVN PDLKVLSANG YFDAVTPFHQ TELTLAQMPL
DPTLKAQNLT IKNYPSGHMI YLNDASRTAL KGDLANFYDG ILANRTALQR VLKLQMRTQQ
LKQQKLQQQG Q