Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3144 |
Symbol | |
ID | 4884322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 3080763 |
End bp | 3082622 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640129072 |
Product | serine carboxypeptidase family protein |
Protein accession | YP_001060156 |
Protein GI | 126441836 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.875872 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATAC AGAAGTCCTT GAAAGACGGT TTCATGCTCG GATGGTGCAG GGCGGCACGG CCGGTTGCCG CTGCCGCGCT GGCCGCGCTG CTCGTCGCCG CGTGCGGCGG CGACGACGGC GGCGGCAGCA GCCCGTCGCT CGCGGCCGCG AACGTCGCGA ACACGAGCAC GTCGACGAAC GCGACGACGA ACGCGACGAC GGCCGCCGAT GCGACGACCA ACGCCGCGCT GCCGCCGGAC CAGCCGTATA TCGACAACGA CGTCTATGGC ACCGGGCCGA ACGATTCGGT CAGCGACGCG ACGGAGGGCA CCGCGGTCGT GCACCGGCAG GTGAAGATCG GCGATCAGAT CCTCACCTAC ACGGCGACGG CCGGCCACCT CGTGACGATC GATCCGATCA CGTCGAAGCC GAACGCGAAG ATGTTCTACG TCGCGTACAC GCTCGACAAT CCGAACCCGG GCAAGCCGCG CCCCGTCACG TTCTTCTACA ACGGCGGCCC GGGCTCGTCG TCGGTGTACC TGCTGCTGGG CTCGTTCGGG CCGAAGCGCC TGCAGTCGTC GTTCCCGAAC TTCACGCCGC CCGCGCCGTA CCGGCTGCGC GACAACCCCG AGAGCCTGCT CGACCGCTCC GATCTCGTGT TCATCAATCC GGTCGGCACC GGCTACTCGG CCGCGATCGC GCCGGCGAAG AACAAGGATT TCTGGGGCGT CGACCAGGAC GCGCACTCGA TCGACCGCTT CATCCAGCGC TACCTGACGA AGTACGCGCG CTGGAACTCG CCGAAGTTCC TGTTCGGCGA ATCGTACGGC ACGGCGCGCA GCGCGGTGAC CGCGTGGGTG CTGCATGAGG ACGGCATCGA GCTGAACGGG ATCACGCTGC AGTCGTCGAT TCTCGACTAT GCGAACGCGG TGAGCGCGAT CGGCATCTTC CCGACGCTCG CGGCCGATGC GTTCTACTGG AACAAGACGA CCATCAGCCC GAAACCGGCC GATCTGGACG CATACATGGC GCAGGCGCGC AGCTATGCGG ACAACGTGCT CGCGCCGCTC GCGCAGGCGC CGAATCCGCA GGACGGCGGC TTCGTCAACG TGCGGCTGAA CCTGAACGTC GCGACCGCGC AGCAGATGGG CGCGTACATC GGCACCGATC CGATCTCGCT GGTCCAGACG TTCGGCAATC CGGCCGCGCT CGGCAACGTG CCGTCGTCCA ACGACAACCC GCCGTACACG TTCTTCCTGA CGCTCGTGCC GGGCATCCAG ATCGGCCAGT ACGACGGACG CGCGAACTAC ACGGGCAAGG GCATCGCGCC GTATATCCTG CCGAACTCGG GCAGCAACGA TCCGTCGATC AGCAACGTCG GCGGCGCGTA CACGGTGCTG TGGAACGACT ACATCAACAA CGACCTGAAG TATGTGTCGA CGTCGTCGTT CGTCGATCTG AACGACCAGG TGTTCAACAA CTGGGACTTC AGCCACACGG ACCCGACGGG CGCGAACCGC GGCGGCGGCA ACACGCTGTA CACGGCGGGC GATCTCGCCG CGACGATGAG CCTGAACCCG GACCTGAAGG TGCTGTCGGC GAACGGCTAT TTCGACGCGG TGACGCCGTT CCACCAGACC GAGCTCACGC TCGCGCAGAT GCCGCTCGAT CCGTCGCTGA AGTCGGCGAA CCTGACGATG AAATACTATC CGTCGGGCCA CATGATCTAT CTGAACGATC ACTCGCGGAT CGCGATGAAG GCGGATCTGG CGACGTTCTA CGACGGCATC CTCGCGGACC GCACGGCGAT GCGGCGCGTG CTGCTGCGCC AGCAGAAGGC GCTGCAGTTG AAGCAGCAGA AGCAACAGCA AGGGCAGTGA
|
Protein sequence | MKIQKSLKDG FMLGWCRAAR PVAAAALAAL LVAACGGDDG GGSSPSLAAA NVANTSTSTN ATTNATTAAD ATTNAALPPD QPYIDNDVYG TGPNDSVSDA TEGTAVVHRQ VKIGDQILTY TATAGHLVTI DPITSKPNAK MFYVAYTLDN PNPGKPRPVT FFYNGGPGSS SVYLLLGSFG PKRLQSSFPN FTPPAPYRLR DNPESLLDRS DLVFINPVGT GYSAAIAPAK NKDFWGVDQD AHSIDRFIQR YLTKYARWNS PKFLFGESYG TARSAVTAWV LHEDGIELNG ITLQSSILDY ANAVSAIGIF PTLAADAFYW NKTTISPKPA DLDAYMAQAR SYADNVLAPL AQAPNPQDGG FVNVRLNLNV ATAQQMGAYI GTDPISLVQT FGNPAALGNV PSSNDNPPYT FFLTLVPGIQ IGQYDGRANY TGKGIAPYIL PNSGSNDPSI SNVGGAYTVL WNDYINNDLK YVSTSSFVDL NDQVFNNWDF SHTDPTGANR GGGNTLYTAG DLAATMSLNP DLKVLSANGY FDAVTPFHQT ELTLAQMPLD PSLKSANLTM KYYPSGHMIY LNDHSRIAMK ADLATFYDGI LADRTAMRRV LLRQQKALQL KQQKQQQGQ
|
| |