Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_3665 |
Symbol | |
ID | 3691183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | - |
Start bp | 4008333 |
End bp | 4010123 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637730120 |
Product | hypothetical protein |
Protein accession | YP_335030 |
Protein GI | 76810522 |
COG category | [S] Function unknown |
COG ID | [COG1479] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.135844 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCA CCCCGACTAC GCTTACGCTT AAGCAGTTTT TTTCGGTGAG CAACGAGCAA TTTCTCATCC CCGCTTATCA GCGCCGCTAC GCGTGGGGGC AGCGCCAGCA GCGTGAATTG TTCGACGACC TGCGGCTACT TGCCTCCGGC GATACCCACC TTTTGGGCAC AGTACTTTTC CTCTCCGACA CTCATAATCC CGTTATCAAC CAGCTTGAAC TGGTAGATGG CCAGCAACGT GTTACGACCA TCACTATCTT GATGAGCGTA CTGGCTCGTC GATTCGAGCA AGAGCCGGGT TATGAAAAGA CGGCGCAGAA AATCGCGGAG TTGCTGCAGT GTGAGGGCGT GAGCGGGCAA GTCGTACCGA AGCTTCAGCT GGGCGATTTG GATCATGGCG ATTACGAGCG CATCATGAAT GGAGGCGATA TTTCGGAGGT GGCGAATGGT TGTCTGAAAG GGGCTTGCGA GTATTTCACT GAGTGGGTCG AGCAGTTGTC CGTGAACGAG CTCAATGTGT TCTTCCACAA GCTCATGAAT AGCGCATCAA TCATCCGATT GGACGTGGGT GCAGCTAAAG ATGCCTATAA GCTATTCGAA ACGATCAACA ACCGGGGGTT GCGACTCAAA CCAACGGACA TCATCAAGAA CTTTCTTCTG GGGCATGCAT CGTCATTGCC CGCCGGCACA CTGGACAAAG TGAAGGGAGA TTGGCGCAAA TTGATCGTGG CCCTTGACGG CTTGGATAGT GATGACTTTT TCCGCCAGTG GCTGGCGGGC AAACTACACC GAAAGGTCAC CAAGAGCAAA CTGGTGGCCG ACTTCAAGGC CAATTACCTG CGTCACGTAC AAGAGGCCGA GAGCATGACC GAATTCATGT CGTCGACCAT AAAAGATGAC GAAGATGAGG AGGAGATCGA GGATGTGGCT ATCCTCGATG ACGAGGAGGA CGGCACGGAG ACACTGGCAA AAGTCAGGAA AGTCAAGCTA ACGGCGTTTG CTACGGCACT GCGCCAGTCG GCCGAGCTGT ACTCTAAGTT GCTGTGGGGA ACGACGACGT CGGCTAAGAT CAATCGGCAC ATAGGCAACC TGTGGCGGAT AAAGGCGTTC TCTGCCTTTA CCTGGCTGCT GGATATGTTC GGACGCAAGG ATTTGGACGA GAAAGCTCAA ATTCGATTGT TGAAGGCATT GGAGGCATTC ATGATGCGCC GGCATATCTG CGAGAAGCGC ACCAACGAGC TGGAGACGAT TTTTGCAAAC ATGACCAGCA TTGCCGATAG CGACTACGAG AAAGCCGTTA TTAAGATTCT GAGGGAGCAC ACGCCCGACG ACGAGGAGTT CGAGTCTGCG TTCGCCTCGT TCCCTTTTGT ACCCGCGGTG ATCGATCGCG CACGCTATGC ATTGGAGATG TTCGAGTACC GGGCTATCGG ACATAAGAAC GAATACTACC TAGCCGATCC GGATGAGCTC GAGCTTGAGC ACATCATCCC GAAGGCGGCC GACAAGGCCA GCACAAAAAA GGAGTTCGGT GACTGGCCAA GCTACTTGGG CGACGGTTGG AAAGCCAAGC ACGCCAAAAT GCTTCATCGT ATCGGCAACA TGACCTTGCT GGCCGACGAA CTGAACGTGG TGGCGTCAAA CAACCCCTTC CTATCCAAGC GCAAAGAGTA CGCTAGCTCC AACATCCGCC TAACAAACGA TCTGTCGACG CTCAATCAAT TCAAGTTCAA GCAGGTGGAT GACCGATCCA AGGAGTTCGC CAAATGGGCA GTGCAGATAT GGAGGGTCTA G
|
Protein sequence | MKITPTTLTL KQFFSVSNEQ FLIPAYQRRY AWGQRQQREL FDDLRLLASG DTHLLGTVLF LSDTHNPVIN QLELVDGQQR VTTITILMSV LARRFEQEPG YEKTAQKIAE LLQCEGVSGQ VVPKLQLGDL DHGDYERIMN GGDISEVANG CLKGACEYFT EWVEQLSVNE LNVFFHKLMN SASIIRLDVG AAKDAYKLFE TINNRGLRLK PTDIIKNFLL GHASSLPAGT LDKVKGDWRK LIVALDGLDS DDFFRQWLAG KLHRKVTKSK LVADFKANYL RHVQEAESMT EFMSSTIKDD EDEEEIEDVA ILDDEEDGTE TLAKVRKVKL TAFATALRQS AELYSKLLWG TTTSAKINRH IGNLWRIKAF SAFTWLLDMF GRKDLDEKAQ IRLLKALEAF MMRRHICEKR TNELETIFAN MTSIADSDYE KAVIKILREH TPDDEEFESA FASFPFVPAV IDRARYALEM FEYRAIGHKN EYYLADPDEL ELEHIIPKAA DKASTKKEFG DWPSYLGDGW KAKHAKMLHR IGNMTLLADE LNVVASNNPF LSKRKEYASS NIRLTNDLST LNQFKFKQVD DRSKEFAKWA VQIWRV
|
| |