Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1421 |
Symbol | |
ID | 3847761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 1603959 |
End bp | 1605800 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637841093 |
Product | serine-type carboxypeptidase family protein |
Protein accession | YP_441967 |
Protein GI | 83719333 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATAC AGAAGTCCTT GAAAGACGGT TTCGCGCTTG GATTGTGCAG GGCGGCACGG CCGGTCGCCG CTGCCGCGCT CGCCGCGCTG CTCGTCGCCG CGTGCGGCGG CGACGACGGC GGCGGCGGCC CGTCGCTCGC GGCCGCGAAC GTCGCAAACA CGAGCACGTC GACGAACGCG ACAGCCGCCG GCGCGGCGAA TAATGCGGCG CTCCCGCCGG ATCAACCGTA TATCGACAAC GATGTCTACG GCACCGGGCC GAACGACGCG GTCACCGACG CGACGGAGGG CGCGGCGGTC GTGCACCGGC AGGTGAAGAT CGGCGACCAG ATCCTCACCT ACACGGCGAC GGCCGGCCAC CTCGTGACGA TCGATCCGGT CACGTCGAAG CCGAACGCGA AGATGTTCTA CGTCGCATAC ACGCTCGACA ATCCGAATCC GAGCAAGCCG CGCCCCGTTA CGTTCTTCTA CAACGGCGGC CCGGGCTCGT CGTCGGTGTA CCTGCTGCTC GGTTCGTTCG GACCGAAGCG CTTGCAGTCG TCGTTCCCGA ACTTCACGCC GCCCGCGCCG TACCGGTTGC GCGACAACCC CGAGAGCCTG CTCGACCGCT CGGACCTCGT GTTCATCAAC CCGGTCGGCA CCGGCTACTC GGCCGCGATC GCACCGGCGA AGAACAAGGA TTTCTGGGGC GTCGACCAGG ACGCGCACTC GATCGACCGC TTCATCCAGC GCTACCTGAC GAAGTACGCG CGCTGGAACT CGCCGAAGTT CCTGTTCGGC GAGTCGTACG GCACGGCGCG CAGCGCGGTG ACGTCGTGGG TGCTGCATGA GGACGGCATC GAGCTGAACG GGATCACGCT CCAGTCGTCG ATTCTCGACT ACGCGAACGC GGTGAGCGCG ATCGGCATCT TCCCGACGCT CGCGGCCGAT GCGTTCTACT GGAACAAGAC GACCATCAGC CCGAAACCGG CCGATCTGGA CGCGTACATG GCGCAGGCGC GCAGCTACGC GGACAACGTG CTCGCGCCGC TCGCGCAGGC GCCGAATCCG CAGGACGGCG GCTTCGTCAA CGTGCGGCTG AACCTGAACC TCGCGGCCGC GCAGCAGATG GGCGCGTACA TCGGCACCGA TCCGGTTTCG CTGATCCAGA CGTTCGGCAA TCCGGCCGCG CTCGGCAACG TGCCGTCGTC CGACGACAAC CCGCCGTACA CGTTCTTCCT GACGCTCGTG CCGGGCGTCC AGATCGGCCA GTACGACGGG CGCGCGAACT ACACGGGCAA GGGCATCGCG CCGTACATCC TGCCGAACTC GGGCAGCAAC GATCCGTCGA TCAGCAACGT CGGCGGCGCG TACACGGTGC TGTGGAACGA CTACATCAAC AACGACCTGA AGTACGTGTC GACGTCGTCG TTCGTCGATC TGAACGACCA GGTGTTCAAC AACTGGGACT TCAGCCACAC GGACCCGACG GGCGCGAACC GCGGCGGCGG CAATACGCTG TATACGGCGG GCGATCTCGC CGCGACGATG AGCCTGAATC CGGACCTGAA GGTGTTGTCG GCGAACGGCT ATTTCGACGC GGTGACGCCG TTCCACCAGA CCGAGCTCAC GCTGCAGCAG ATGCCGCTCG ATCCGTCGAT CAAGTCGGCG AACCTGACGA TGAAATACTA TCCGTCGGGC CACATGATCT ATCTGAACGA TAACTCGCGG ATCGCGATGA AGGCGGATCT CGCGACGTTC TACGACGGCA TCCTCACGGA TCGCAAGGCG CTGCAACGCG TGCTGCTGCG TCAGCAGAAG GCGCTGCAGT TAAAGCAACA GAAGCAACAG CAAGGGCAGT GA
|
Protein sequence | MKIQKSLKDG FALGLCRAAR PVAAAALAAL LVAACGGDDG GGGPSLAAAN VANTSTSTNA TAAGAANNAA LPPDQPYIDN DVYGTGPNDA VTDATEGAAV VHRQVKIGDQ ILTYTATAGH LVTIDPVTSK PNAKMFYVAY TLDNPNPSKP RPVTFFYNGG PGSSSVYLLL GSFGPKRLQS SFPNFTPPAP YRLRDNPESL LDRSDLVFIN PVGTGYSAAI APAKNKDFWG VDQDAHSIDR FIQRYLTKYA RWNSPKFLFG ESYGTARSAV TSWVLHEDGI ELNGITLQSS ILDYANAVSA IGIFPTLAAD AFYWNKTTIS PKPADLDAYM AQARSYADNV LAPLAQAPNP QDGGFVNVRL NLNLAAAQQM GAYIGTDPVS LIQTFGNPAA LGNVPSSDDN PPYTFFLTLV PGVQIGQYDG RANYTGKGIA PYILPNSGSN DPSISNVGGA YTVLWNDYIN NDLKYVSTSS FVDLNDQVFN NWDFSHTDPT GANRGGGNTL YTAGDLAATM SLNPDLKVLS ANGYFDAVTP FHQTELTLQQ MPLDPSIKSA NLTMKYYPSG HMIYLNDNSR IAMKADLATF YDGILTDRKA LQRVLLRQQK ALQLKQQKQQ QGQ
|
| |