Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1465 |
Symbol | |
ID | 4886354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 1364410 |
End bp | 1366134 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640131404 |
Product | TPR repeat-containing protein |
Protein accession | YP_001062462 |
Protein GI | 126442784 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.388393 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACGCTCG GCCGGAAACA CGCTGTAGGC ATGGGAAGCA GGAACATGGG CGACTATCTG GCGCAGGACG CGCGTATCGA GCGGATCACG GTTCGCGATC CCGAGGCGCG GACATGGTTC GATCGCGGTC TGCTGTGGGC ATACGGATTC AATTTCGAAG CCGCGGTCGA TTGCTTTCAG GAAGCGGCGC GGATCGATCC GGATTGCGTG CTCGCCTACT GGGGCATCGC GTACGCGTCG GGCTGCAATT ACAACAAGCA GTGGAAGGTG TTTCATCCGC GGATGATCGC GCGCTGCATG AAGCTCGCGC GCGACGCGAT CCGGCGCGCG CAGGCGTGCC GGGCGGGCGC GAGCGCGCTG GAGATGGCGC TCGTGCGCGC GATCGAGTGC CGCTTCCAGG CCGAGGGCGC CCATGACGAA GCGCTGCTGC GGCGCTGGAA CGACGACTAC GCCCGCGCGA TGCGCGACGT CTATCTGTCC TGCCCCGACA ATCTCGATGT CGCCGCGCTG TTCGCGGACG CGCTGATCAA TCGCACGCCG TGGAAGCTCT GGGACATGTC GAGCGGCAAG ACCGCCGACG GCGCCGATAC CGACGAGGCG ATCGCGGTTC TCGAGCGCGC GCTCGCGCAG GTCGACGCGA ACGGCCTCGC CCCGCATCCG GGGCTGCTGC ACGTCTACAT CCACACGATC GAAATGTCGC CGACGCCGGA GAAGGCGCTT CGCGCGGCCG ACGCACTGCG CGATCTCGCG CCCGACGTCG GCCATCTGCT GCACATGGCC TCGCACATCG ACATTCTTTG CGGCCACTAC CACGACGCGA TCGTCGCGAA CGACCGCGCA ATCGCCGCCA ACCAGCGTGT GCTCGACCGC CATCCGCAGT GGCTCGAATT CCGGCTGTAC TGCGTGCACG ACATCCATTT CAAGATCTAT GCGGCGATGA TGCTCGGCCG TTTCTCGGCG GCGTGGCAAG GGGTGCACGA GCTCGAGGCG CTGATCACGG AAGCGCTGCT GCGCGTCGAG CAGCCGCCGA TGGCCTACCT CCTCGAAGGC TTCCTGTCGG TGCGCGTGCA CGTGCTGATC CGGTTCGGCA AATGGCGCGA GATCCTCGAG CAGCCGTTCC CCGCGAACGC CGCGCTGTAC TGCAACACGA CCGCGATGCT CCATTACGCG CGCGGCATCG CCTACGCGAA CCTGAAGCTC GCCGGCCCCG CCGCGCAGGC GCGCCGCGCG TTCTCGATCG CGCAGGCGGC CCTTCACGAG CATCGGTATG TGACGAACAA CACCTGCGCG GATCTGCTGA AGATCGCCGC TTGCGTGCTC GACGGCGAGA TCGCGTATCA CGAGGACCGG TTCGACGAAG CGTTCGCGCA TCTGCGGCGC GCGGTCGAAC TCGACGACGG GCTCGAATAC ATGGAGCCGT GGGGCTGGAT GATGCCGACG CGCCACCCGC TCGGCGCGCT GCTGCTCGCG CAAGGGCATG TCGAGGAAGC CGAGCGCGTC TATCGCGCCG ATCTCGGGCT CGATTCGACG ATCTACCGCT CGCTCCAGCA TCCGGGCAAC GTGTGGAGCC TGCAGGGCTA CGTGAGCTGC CTGCGCAGGC TCGGCAAGCA CGAAACGGCG GACGCGCTGC AACCGGCGCT CGACGTCGCG CGCGCGCGGG CCGACGTCGA GATCGGCGTG TCGTGCTTCT GCGCGGCGCC CAGCAAGCGC GGTTGCTGCC ACTGA
|
Protein sequence | MTLGRKHAVG MGSRNMGDYL AQDARIERIT VRDPEARTWF DRGLLWAYGF NFEAAVDCFQ EAARIDPDCV LAYWGIAYAS GCNYNKQWKV FHPRMIARCM KLARDAIRRA QACRAGASAL EMALVRAIEC RFQAEGAHDE ALLRRWNDDY ARAMRDVYLS CPDNLDVAAL FADALINRTP WKLWDMSSGK TADGADTDEA IAVLERALAQ VDANGLAPHP GLLHVYIHTI EMSPTPEKAL RAADALRDLA PDVGHLLHMA SHIDILCGHY HDAIVANDRA IAANQRVLDR HPQWLEFRLY CVHDIHFKIY AAMMLGRFSA AWQGVHELEA LITEALLRVE QPPMAYLLEG FLSVRVHVLI RFGKWREILE QPFPANAALY CNTTAMLHYA RGIAYANLKL AGPAAQARRA FSIAQAALHE HRYVTNNTCA DLLKIAACVL DGEIAYHEDR FDEAFAHLRR AVELDDGLEY MEPWGWMMPT RHPLGALLLA QGHVEEAERV YRADLGLDST IYRSLQHPGN VWSLQGYVSC LRRLGKHETA DALQPALDVA RARADVEIGV SCFCAAPSKR GCCH
|
| |