Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_2168 |
Symbol | |
ID | 3691462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | - |
Start bp | 2397108 |
End bp | 2402072 |
Gene Length | 4965 bp |
Protein Length | 1654 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637728624 |
Product | hemagluttinin motif-containing protein |
Protein accession | YP_333563 |
Protein GI | 76812117 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG5099] RNA-binding protein of the Puf family, translational repressor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.306442 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAA TATTTCGGGT GATCTGGTGC CGTGTCAAGG CTGCGTGCGT CGTTGTTTCG GAAGAGGCCT GCCTGCGCGG CGGAAAGAGC CATAGCTGCC GGCAAGGCAG CCGTGCGGCA GGTGAGGAAT CTGTTCGTTT CGCGTTGTCG TCCATTGCGC TGGCGGCGTG CATATTGATC GGGAGCCTTG GGTCCACGCT GCCGGCTGTG GCTGGTACGG TAATAGGCGG CGGGGCTCAA TACCCCAATT CGGTAGGCGG AACGAGTTCT ACGACCGGCG ATTTGGGCAA CAGCTATATT GGTGCGAGCG GTATGGGTAC CGCCATTACT GGGGATAATG ATTGCCTGAG CTTGACTTCG ACCCGCAACG TGCTAAACAG TGCGAATGTC GGCTGGCTTT TGGGAACGAC TTCACAGACC ACCGATCCTG GTCCGTTGTA CCCCGGCCCC GGCGCGGAAA ACAACCAGAC TATCAGTTAC AGTGGCACCT CCAATTTTTC GGGTGGTGGA AATAGCGCGG CTGGTGCCCA GGCGACCCTC GCCTGGGGCT TCAATTCCTT CGCAGCGGGG TGTGGGAACA AATCGCTTGG CATGGGTTCG AGTACGCTCG GGCTCAATAA CATGGCCAGC TTGGCCGGTT CAACTGCGAT CGGGATCGCT AATACCGCGT CAGGAGCGGG CGGAACCGCG ATCGGTCTCT ACAACACTGC CGCTGGCACG GGCAGCGTTG CAATGGGCAT TGGCTCTCAA GCGACAGGCA ATGGAACGAT CGCACTCGGA TACGGCGGCG GCGGCACCTC GAGCAACGCC ACATTGGCGA GCGGCAGCAA TGCGATCGCG ATAGGCGGAG ACGCGACCAA AGGCGCGCAG GCGACGGGAA GCGATTCGAT CGCCATGGGC AGCGAGGCGG CCGCCAGCAG TTCCAGCACG ACCGCAATCG GGCAATATGC CACCGCTAGC AATACGAATG CCACGGCGCT CGGCGCGGGC GGAACATCCG CCGCCACTGG CGTAATCGCA AGCGGTGCGG GCGCCGTCGC GCTCGGTGGC AACAGTACGC AAGGCGCGCA GGCGTTGGCA AGCAACGCCA TTGCGATCGG CGGCCAATCG CAAGCCGCTA GCGCCGGTGC GATCGCGATC GGCCAGAGCG CACTGGCCAC GGGTGGGCAA GCGGTGTCCG TCGGCGTGGG CAATACCGCG AACGGCAATG GCGCGGTGGC GATCGGCGAT CCGAACGTCG CGACCGGCAC GGGAGCGGTG GCGCTGGGGG CGAACAATAC CGCCACCGGA CAAGGCGCGG TTGCGCTCGG CAACGCTGAT ATTGCCACGG GCCAAGGTTC GGTTGCGCTC GGTAATGTGT CGACGGCCGC CGGTGCAGGA TCGGTGGCCT TCGGCTCGAA TGCCGTAGCC AACAACACGA ACGATGTGGC GCTGGGCTCC GGATCTGTGA CAGCCGCGCC GAATCCGACG GGGAGTGCGA CGATCGGCGG AACCACATAT TCGTTTGAGG GAACCAATCC GACGAGCGTT GTGAGCGTGG GCGCAGTGGG CGCGGAACGC CAGATCACGA ACGTCGCGGC GGGACAACTG ACGGCGACGA GCACGGACGC AGTCAACGGT TCGCAACTTT ACTCGACGAA TCAGGCGATT AATACATTAT CGACGTCGAC TTCGACGGGC CTGTCGTCGG CGAATAGCTC GATCGCGTCG TTATCCACTG GCCTCGCATC GTCCGGCAAC CTGGCTTCGC TGTCGACGTC CACGTCGACA GGCCTGTCAT CGGCGAATAG CTCGATCGCG TCGCTGTCGA CGTCGACTTC AACGGGCCTC TCGACGACCA ACAGCAACAT CGGCTCGCTG TCGACAGGCC TTAGCACGAC TAATAGCACG GTCGCATCGT TATCGACGTC CACGTCGACG GGATTGTCGT CGGCGACCAG CTCGATCACG TCGTTGTCGA CGTCGACCTC GAGCGGCATC AGTACCGCTC AGAGCGGCGT CAATTCGCTG TCGACTGGCC TGTCGACCAC CAACAGCGCA GTCACCTCGT TATCGACGTC CACGTCGACG GGATTATCGT CGGCGACCAG CTCGATCACG TCGTTGTCTA CGTCGACGTC AACTGGAATC GGCTCGCTGT CGACCGGTCT GAGCACGACC GACAGTTCCG TAACGTCGCT GTCGACGTCG ACCTCGAGCG GCATCAGTAC CGCTCAGAGC GGCGTCAATT CGCTGTCGAC TGGCCTGTCG ACCACCAACA GCACGGTCAC CTCGTTATCG ACGTCCGCGT CGACGGGATT ATCGTCGGCG AACAGCTCGA TCACGTCGTT GTCGACGTCG ACCTCGACTG GATTGAGCTC GTTGTCGACG GGAATCGCGA ATTCCGTTCA GTATGACAAC ACCAGCCATA CTCAAGTCAC GTTGGGAGGA GCCGGCGCGA CGACGCCGGT GACGCTGACC AATGTCGCCG CGGGCGTGAA TCCCACCGAT GCGGTGAATA TGAGTCAGCT GACTTCGCTG TCGACGTCAA CGTCAACGGG ACTGTCGACG ACCAACAGCA ACATTGGCTC GCTGTCGACC GCTCTGAGCA CGACCGACAG TTCCGTGACG TCGCTGTCGA CGTCGACCTC GAGCGGCATC AGTACTGCTC AGAGCGGTGT CAATTCGCTG TCGACTGGCC TGTCGACCAC CAACAGCACG GTCGCATCGC TATCGACGTC CACGTCGACG GGATTGTCGT CGGCGACCAG CTCGATCACG TCGTTGTCTA CGTCGACGTC AACTGGAATC GGCTCGCTGT CGACCGGTCT GAGCACGGCC AACAGTTCCG TGACGTCGCT GTCGACGTCG ACCTCGAGCG GCATCAGTAC CGCTCAGAGC GGCGTCAATT CGCTGTCGAC TGGCCTGTCG ACCACCAACA GCACGGTCGC ATCGCTATCG ACGTCCACGT CGACGGGATT GTCGTCGGCG ACCAGCTCGA TCACGTCGTT GTCTACGTCG ACGTCAACTG GAATCGGCTC GCTGTCGACC GGTCTGAGCA CGGCCAACAG TTCCGTGACG TCGCTGTCGA CGTCGACCTC GAGCGGCATC AGTACCGCTC AGAGCGGCGT CAATTCGCTG TCGACTGGCC TGTCGACCAC CAACAGCGCA GTCACTTCGT TGTCGACGTC CACGTCGACG GGGCTGAGCT CGCTGTCCAC GGGTCTGTCC GCGACCAACA GCAACATCAG TTCGCTGTCG ACTTCGACCT CGAACGGCCT GAGCACAGCT AACAGCAATA TCTCGTCGCT GTCCACTGGG CTGAGTTCGC TGTCGACCGC AGTCAACGGC GGCGGGACCA AGTACTTTCA CGCCAACTCG GCGCAGCCGG ACAGTCAGGC GCTGGGGACG AACGCGATCG CCGTCGGACC GGCGGCCACG GCATCCGGCG CAAGCGGTAT CGCGATCGGC GATACCGCGA ACGCGGCTGC AACTGGCGCG GTGGCGATTG GCCAGACCGC CGTTGCAACG GGTGGTCAGG CGGTCTCGAT CGGTGTTGCG AATACGGCGA GCGGCGACGG CGCGGTCGCG ATCGGTGATC CGAACGTCGC GACGGGTACT GGTGCGGTCG CGCTGGGCGC GAACAATTCT GCGAACGGCC AGGGCGCCGT TGCGCTCGGC AATGCGAACG TGGCAACCGG GACAGGCTCG CTGGCGCTCG GCAGCACATC GACGGCGGCG GGGGGCGGCT CGATCGCGTT GGGCACGAAT GCAATCGCGA ACAATGCCAA CGATGTCGCG CTGGGTTCCG GTAGCGTAAC CGCAGCCGCG AATCCGGTCG CCAGTGCGCT GATCGCCGGT CAGACGTATT CGCTCGCCGG CGGTTCACCG ACAAGCGTGG TGAGTGTCGG TGCGCCGGGT GCAGAACGAC AAATCACCAA CGTCGCGGCA GGGCAGGTTT CCGCCACGTC GACCGACGCG GTGAACGGAT CGCAGATGAA CGCCGTGACC CAAGCGCTGG TATCGTTGTC GACTTCGACA GCCAATGCGT TGTCGACGAC GCAAAACGGT CTGAGTTCGT TGTCGACAGG GCTGAGCACG ACGCAGAGCA GTGTGAGTTC GCTGTCGACG GGCCTTAGCA CGACGAGCGG TAACGTGTCG TCGCTGTCGA CCGGCTTGAG TACGACGCAG AGCGATGTCG CGTCGCTGTC AACGGGACTG AGTACGACGA ACAGCAATCT CGCGTCGTTG TCGACTGCGG TGTCCAACGG AGGGATTCAC ACCAACGGCG CGGGCGGCAC GTCGATGGGG CCGGGAGCCG ACGCGTCGGG CAGCAACAGC ACGGCGGTCG GTGGCGCAGC GTCTGCTTCG GGAGCCAATG CAACGGCGCT TGGCCAGGCG TCGAACGCAT CGGGCAATAA TTCGACGGCA TTAGGGCAAG CATCGAGTGC ATCCGGAAGC GGCTCCACCG CGGTGGGGCA GGGCGCGAGC GCGTCGGGCG ATGGGTCCTC GGCATTCGGC CAAGGGGCGA TTGCCTCCGG CACGAACTCG ACGGCGCTGG GCGCTCATTC GACGGCATCG GCGCCGAATT CAGTGGCGAT CGGTGCGAAC TCGGTAGCTT CCGCGCCGAA CACGGTGTCG TTCGGGTCTC AGGGCCATGA GCGTCGACTG ACGAACGTCG CACCGGGGAT GGACGGTACC GACGCAGCGA ACATGAGCCA GCTTTGGGGC GTGCAATCGA GCGTAGACCA AGCCGCGCGC CGTGCGTATT CGGGTGTGGC GGCTGCCACC GCGCTGACGA TGATTCCGGA AGTCGATCCT GGCAAGACGA TTGCAGTTGG GATCGGTGCC GGCAGCTATC AAGGGTATTC GGCATCCGCG ATCGGTGTGT CCGTGCGGTT CTCCGACAAC CTGAAGGCGA AGCTCGGCAT GGGGATCAGC AGTCAGGGCA GCACGTACGG CGGAGGCATC TCGTACCAGT GGTAA
|
Protein sequence | MNKIFRVIWC RVKAACVVVS EEACLRGGKS HSCRQGSRAA GEESVRFALS SIALAACILI GSLGSTLPAV AGTVIGGGAQ YPNSVGGTSS TTGDLGNSYI GASGMGTAIT GDNDCLSLTS TRNVLNSANV GWLLGTTSQT TDPGPLYPGP GAENNQTISY SGTSNFSGGG NSAAGAQATL AWGFNSFAAG CGNKSLGMGS STLGLNNMAS LAGSTAIGIA NTASGAGGTA IGLYNTAAGT GSVAMGIGSQ ATGNGTIALG YGGGGTSSNA TLASGSNAIA IGGDATKGAQ ATGSDSIAMG SEAAASSSST TAIGQYATAS NTNATALGAG GTSAATGVIA SGAGAVALGG NSTQGAQALA SNAIAIGGQS QAASAGAIAI GQSALATGGQ AVSVGVGNTA NGNGAVAIGD PNVATGTGAV ALGANNTATG QGAVALGNAD IATGQGSVAL GNVSTAAGAG SVAFGSNAVA NNTNDVALGS GSVTAAPNPT GSATIGGTTY SFEGTNPTSV VSVGAVGAER QITNVAAGQL TATSTDAVNG SQLYSTNQAI NTLSTSTSTG LSSANSSIAS LSTGLASSGN LASLSTSTST GLSSANSSIA SLSTSTSTGL STTNSNIGSL STGLSTTNST VASLSTSTST GLSSATSSIT SLSTSTSSGI STAQSGVNSL STGLSTTNSA VTSLSTSTST GLSSATSSIT SLSTSTSTGI GSLSTGLSTT DSSVTSLSTS TSSGISTAQS GVNSLSTGLS TTNSTVTSLS TSASTGLSSA NSSITSLSTS TSTGLSSLST GIANSVQYDN TSHTQVTLGG AGATTPVTLT NVAAGVNPTD AVNMSQLTSL STSTSTGLST TNSNIGSLST ALSTTDSSVT SLSTSTSSGI STAQSGVNSL STGLSTTNST VASLSTSTST GLSSATSSIT SLSTSTSTGI GSLSTGLSTA NSSVTSLSTS TSSGISTAQS GVNSLSTGLS TTNSTVASLS TSTSTGLSSA TSSITSLSTS TSTGIGSLST GLSTANSSVT SLSTSTSSGI STAQSGVNSL STGLSTTNSA VTSLSTSTST GLSSLSTGLS ATNSNISSLS TSTSNGLSTA NSNISSLSTG LSSLSTAVNG GGTKYFHANS AQPDSQALGT NAIAVGPAAT ASGASGIAIG DTANAAATGA VAIGQTAVAT GGQAVSIGVA NTASGDGAVA IGDPNVATGT GAVALGANNS ANGQGAVALG NANVATGTGS LALGSTSTAA GGGSIALGTN AIANNANDVA LGSGSVTAAA NPVASALIAG QTYSLAGGSP TSVVSVGAPG AERQITNVAA GQVSATSTDA VNGSQMNAVT QALVSLSTST ANALSTTQNG LSSLSTGLST TQSSVSSLST GLSTTSGNVS SLSTGLSTTQ SDVASLSTGL STTNSNLASL STAVSNGGIH TNGAGGTSMG PGADASGSNS TAVGGAASAS GANATALGQA SNASGNNSTA LGQASSASGS GSTAVGQGAS ASGDGSSAFG QGAIASGTNS TALGAHSTAS APNSVAIGAN SVASAPNTVS FGSQGHERRL TNVAPGMDGT DAANMSQLWG VQSSVDQAAR RAYSGVAAAT ALTMIPEVDP GKTIAVGIGA GSYQGYSASA IGVSVRFSDN LKAKLGMGIS SQGSTYGGGI SYQW
|
| |