Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A3933 |
Symbol | |
ID | 3749118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | + |
Start bp | 851954 |
End bp | 853789 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637762212 |
Product | carboxypeptidase C (cathepsin A)-like |
Protein accession | YP_368176 |
Protein GI | 78065407 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00462563 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGACAC GGAAGTCCTT GAAAGACGGT TTCGCGCTAT TCGGAACGAC ACTGAGCGTG CCGCTCGCTG CAGCCGCGGC CGCCGCGCTG CTCGTCACGG GGTGCGGTGG CGACGACGGG TCGGGCCCGG CCGCTTCGGC CGCCGCCGCG GCCGCGACGT CGGCCGGCGC CAGCACGTCG GCCAACACCA ACGCAACGGC CGCCGCGGCC GACCAGCCTT ACGTCGACAA CGACGTGTAC GGCACCGGGC CGAACGACGC GGTCACGGAT TCCACGGAAG GGGCGGCTGT CGTGCACCGC ACGGTCACGA TCGGCGGCAA GACCATCAAG TACACGGCTA CCACGGGCCA CCTGACGACG ATCGATCCGA CCACGTCGGC GCCGAACGCG AAGATGTTCT ACGTCGCGTA CACGCAGGAC AATCCGGACC CGTCGAAGCC GCGCCCGGTC ACGTTCTTCT ACAACGGCGG CCCGGGCTCG TCGTCGGTCT ACCTGCTGCT CGGCTCGTAC GGGCCGAAGC GCCTGCAGTC GTCGTTCCCG AACTTCACGC CGCCCGCGCC GTACAAGCTG CTCGACAACC CGGACAGCCT GCTCGACCGC ACCGACCTCG TGTTCATCAA CCCGGTCGGC ACCGGTTACT CGACGGCGAT CGCCCCGGCG AAGAACAAGG ACTTCTGGGG CACCGACCAG GACGCCCGCT CGATCGACCG CTTCATCCAG CGCTACCTGA CCAAGTACTC GCGCTGGAAT TCGCCGAAGT TCCTGTACGG CGAGTCGTAC GGCACCGCGC GCAGCGCAGT CGTGTCGTGG GTGCTGCATG AAGACGGCAT CGACCTGAAC GGGATCACGC TGCAGTCGTC GATCCTCGAC TACGCGAACG CGCTGTCCGC GCCGGGCACG TTCCCGACGC TCGCGGCCGA CGCGTTCTAC TGGAAGAAGA CGACGCTCAA CCCGACGCCG ACCGATCTCG ACGCGTACAT GATCCAGGCC CGCAACTATG CGGACAACAC GCTCGCGCCG CTCGCGCAGA AGCCGAACCC GCAGGACGGC GGCTTCGTGA ACGTGCGCCT GAACCTGAAC CTTCAGACCG CGCAGCAGAT GGGCTCGTAC ATCGGCACGG ATCCGACGTC GCTGATCCAG ACGTTCGGCA ACCCGGCCGC GCTCGGCAAC GTGCCGTCGT CGGATGACAA CCCGCCGTAC ACGTTCTTCC TGACGCTCGT GCCGGGCACG CAGATCGGCC AGTACGACGG TCGCGCGAAC TTCACGGGCA AGGGCATCGC GCCGTACATC CTGCCGAACT CGGGCAGCAA CGATCCGTCG ATCACGAACG TCGGCGGCGC GTACACGGTG CTGTGGAACA GCTATATCAA CACCGACCTC AAGTACACGT CGACGTCGTC GTTCGTGGAC CTGAACGACC AGGTCTTCAA CAACTGGGAC TTCAGCCACA CGGACCCGAC CGGCGCGAAC AAGGGCGGCG GCAATACGCT GTACACGGCC GGCGACCTGG CGTCGACGAT GAGCGTGAAC CCGGACCTGA AGGTGCTGTC GGCCAACGGC TATTTCGACG CCGTCACGCC GTTCCACCAG ACGGAGCTGA CGCTCGCGCA GATGCCGCTC GACCCGACGC TCAAGGCGCA GAACCTGACG ATCAAGAACT ACCCGTCGGG CCACATGATC TACCTGAACG ACGCGTCGCG GACCGCGCTG AAGGGCGATC TCGCCAACTT CTACGACGGC ATCCTGGCCA ACCGCACTGC GCTGCAGCGC GTGCTGAAGC TGCAGATGCG CACGCAGCAG CTCAAGCAGC AGAAGTTGCA GCAGCAGGGG CAGTAA
|
Protein sequence | MTTRKSLKDG FALFGTTLSV PLAAAAAAAL LVTGCGGDDG SGPAASAAAA AATSAGASTS ANTNATAAAA DQPYVDNDVY GTGPNDAVTD STEGAAVVHR TVTIGGKTIK YTATTGHLTT IDPTTSAPNA KMFYVAYTQD NPDPSKPRPV TFFYNGGPGS SSVYLLLGSY GPKRLQSSFP NFTPPAPYKL LDNPDSLLDR TDLVFINPVG TGYSTAIAPA KNKDFWGTDQ DARSIDRFIQ RYLTKYSRWN SPKFLYGESY GTARSAVVSW VLHEDGIDLN GITLQSSILD YANALSAPGT FPTLAADAFY WKKTTLNPTP TDLDAYMIQA RNYADNTLAP LAQKPNPQDG GFVNVRLNLN LQTAQQMGSY IGTDPTSLIQ TFGNPAALGN VPSSDDNPPY TFFLTLVPGT QIGQYDGRAN FTGKGIAPYI LPNSGSNDPS ITNVGGAYTV LWNSYINTDL KYTSTSSFVD LNDQVFNNWD FSHTDPTGAN KGGGNTLYTA GDLASTMSVN PDLKVLSANG YFDAVTPFHQ TELTLAQMPL DPTLKAQNLT IKNYPSGHMI YLNDASRTAL KGDLANFYDG ILANRTALQR VLKLQMRTQQ LKQQKLQQQG Q
|
| |