Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A3697 |
Symbol | |
ID | 3748881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | + |
Start bp | 585409 |
End bp | 587019 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637761977 |
Product | carboxypeptidase C (cathepsin A)-like |
Protein accession | YP_367942 |
Protein GI | 78065173 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.393293 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.740754 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGCGA GCAAAGCGAA GCTGTTGTTG GGAGTGGTGT TTTCTTCGCT GATTCTGACG GCCTGCAACG ACGACGTGAC ATCGTCCTCT GCTGCCAGCG CCGACAATTC AGCCGATCCC GCCAGTCAGG TGGACAGGGC GTACAACGAT CCGAACAGTT ATTCGTCGAG CGCGAACGCG TCGCTCGACG CCTCCGCCGC GGTTGAAAAG GCCGCCGTCA CGCATCACCA GATCACACTC AACGGCAAGA CGATCCGCTA CACGGCCACG GCCGGCCACC TCGTCGCGCG CAATCCGCAG ACCGGCGCGC CCGAAGCATC GTTCTTCTAC GTCGCCTATA CGGCCGACAA CCAGCCCGCG GCGAAGCGGC CCGTCACGTT CCTGTACAAC GGCGGCCCCG GCTCGGCCTC GGTGTGGCTG CATCTCGGCT CGTTCGGCCC GCGCCGGATC CAGACGGGCG ACCCGAACGC GAACACGTCG ACGTTCCCGT TCGTCGACAA CCAGGAAAGC CTGCTCGACA CGACCGACCT CGTGTTCGTC GATGCGATCG GCACCGGCTT CTCCGAAGCG GTCGCGCCGA ACACGAACCA GACGTTCTGG GGCGTCGACC AGGACGGCGG TGCGTTCCGC GATTTCGTGA CGCGCTACAT CGCCGTGAAC CAGCGCAACG ATTCGCCGAA ATACCTGTTC GGCGAGTCGT ACGGCACGCC GCGCACCGAC GTGCTCGCGA ACCTGCTCGA GACGGCCGGC GTGAAGCTCG CCGGCATCGT GCTGCAGTCG TCGATCCTGA ACTACAACGT GAACTGCGAC ATGGCGAGCG ACTACATCGG CAACTCGAAC AACGGGTCGA GCCCGGTGAG TTGCGCGGGC TTCGTGCCGT CGTATGGCAC CGTCGGCGCG TACTACCAGC TCGACAACCC GAACCCGTCG AGCCTGCCGC AGTATGCGGA CCAGATGCGC CTGCTGACGG CCGGCAGCTA CGCGCCTGCC GTGAACGCGT ACCTCGCGAG CCATACGCCG CCCCCGCCGT CCCTCGTCAC GACGATGGTG AATTCGACCG GCGTGAAGCA GTCGCTGTGG AATGCGAACT TCAACGTGAT CCCGACCTTC TTCGACAACA GCTTCCAGTT GTCGCTGATC CCGGGCACGC TGATCGGCCG CTACGACGCG CGCGTGAACG TGCCGGTGTC GAGCCCGCTG GCATCCGGCG GCGACCCGTC GAGCACGTAC ATCACGCAGC CGTTCACCGA CACGATCGGC AAGTACCTGC CGAACGAGCT GAAGTACACC GCGCAGTCGT CGTATTCGTT GAGCAGCAAT GCGATCAACA CGTGGGACTG GACGCACGAC GGCCTCGCGA TGCCGGACAC GATCCCCGAT CTCGCTGCGG CGCTGTCGCT GAATCCGCAA CTGAAGGTGT TGTCGCTGAA CGGGTATCAC GACATCGCGA CGCCGTTCTA CCAGACCGAG CTCGATCTCG CGCGGCTCGG GACGCAACCG AACCTGACGA TCAAGGACTA TCAGGGCGGA CACATGGTCT ATCTCGACGA TACGTCGCGT CCGCAGGAGA AAGCGGACCT CGTGACCTTC TACAACGCGG CCGCGCACTG A
|
Protein sequence | MPASKAKLLL GVVFSSLILT ACNDDVTSSS AASADNSADP ASQVDRAYND PNSYSSSANA SLDASAAVEK AAVTHHQITL NGKTIRYTAT AGHLVARNPQ TGAPEASFFY VAYTADNQPA AKRPVTFLYN GGPGSASVWL HLGSFGPRRI QTGDPNANTS TFPFVDNQES LLDTTDLVFV DAIGTGFSEA VAPNTNQTFW GVDQDGGAFR DFVTRYIAVN QRNDSPKYLF GESYGTPRTD VLANLLETAG VKLAGIVLQS SILNYNVNCD MASDYIGNSN NGSSPVSCAG FVPSYGTVGA YYQLDNPNPS SLPQYADQMR LLTAGSYAPA VNAYLASHTP PPPSLVTTMV NSTGVKQSLW NANFNVIPTF FDNSFQLSLI PGTLIGRYDA RVNVPVSSPL ASGGDPSSTY ITQPFTDTIG KYLPNELKYT AQSSYSLSSN AINTWDWTHD GLAMPDTIPD LAAALSLNPQ LKVLSLNGYH DIATPFYQTE LDLARLGTQP NLTIKDYQGG HMVYLDDTSR PQEKADLVTF YNAAAH
|
| |