Gene BURPS1710b_2168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2168 
Symbol 
ID3691462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp2397108 
End bp2402072 
Gene Length4965 bp 
Protein Length1654 aa 
Translation table11 
GC content63% 
IMG OID637728624 
Producthemagluttinin motif-containing protein 
Protein accessionYP_333563 
Protein GI76812117 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG5099] RNA-binding protein of the Puf family, translational repressor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.306442 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA TATTTCGGGT GATCTGGTGC CGTGTCAAGG CTGCGTGCGT CGTTGTTTCG 
GAAGAGGCCT GCCTGCGCGG CGGAAAGAGC CATAGCTGCC GGCAAGGCAG CCGTGCGGCA
GGTGAGGAAT CTGTTCGTTT CGCGTTGTCG TCCATTGCGC TGGCGGCGTG CATATTGATC
GGGAGCCTTG GGTCCACGCT GCCGGCTGTG GCTGGTACGG TAATAGGCGG CGGGGCTCAA
TACCCCAATT CGGTAGGCGG AACGAGTTCT ACGACCGGCG ATTTGGGCAA CAGCTATATT
GGTGCGAGCG GTATGGGTAC CGCCATTACT GGGGATAATG ATTGCCTGAG CTTGACTTCG
ACCCGCAACG TGCTAAACAG TGCGAATGTC GGCTGGCTTT TGGGAACGAC TTCACAGACC
ACCGATCCTG GTCCGTTGTA CCCCGGCCCC GGCGCGGAAA ACAACCAGAC TATCAGTTAC
AGTGGCACCT CCAATTTTTC GGGTGGTGGA AATAGCGCGG CTGGTGCCCA GGCGACCCTC
GCCTGGGGCT TCAATTCCTT CGCAGCGGGG TGTGGGAACA AATCGCTTGG CATGGGTTCG
AGTACGCTCG GGCTCAATAA CATGGCCAGC TTGGCCGGTT CAACTGCGAT CGGGATCGCT
AATACCGCGT CAGGAGCGGG CGGAACCGCG ATCGGTCTCT ACAACACTGC CGCTGGCACG
GGCAGCGTTG CAATGGGCAT TGGCTCTCAA GCGACAGGCA ATGGAACGAT CGCACTCGGA
TACGGCGGCG GCGGCACCTC GAGCAACGCC ACATTGGCGA GCGGCAGCAA TGCGATCGCG
ATAGGCGGAG ACGCGACCAA AGGCGCGCAG GCGACGGGAA GCGATTCGAT CGCCATGGGC
AGCGAGGCGG CCGCCAGCAG TTCCAGCACG ACCGCAATCG GGCAATATGC CACCGCTAGC
AATACGAATG CCACGGCGCT CGGCGCGGGC GGAACATCCG CCGCCACTGG CGTAATCGCA
AGCGGTGCGG GCGCCGTCGC GCTCGGTGGC AACAGTACGC AAGGCGCGCA GGCGTTGGCA
AGCAACGCCA TTGCGATCGG CGGCCAATCG CAAGCCGCTA GCGCCGGTGC GATCGCGATC
GGCCAGAGCG CACTGGCCAC GGGTGGGCAA GCGGTGTCCG TCGGCGTGGG CAATACCGCG
AACGGCAATG GCGCGGTGGC GATCGGCGAT CCGAACGTCG CGACCGGCAC GGGAGCGGTG
GCGCTGGGGG CGAACAATAC CGCCACCGGA CAAGGCGCGG TTGCGCTCGG CAACGCTGAT
ATTGCCACGG GCCAAGGTTC GGTTGCGCTC GGTAATGTGT CGACGGCCGC CGGTGCAGGA
TCGGTGGCCT TCGGCTCGAA TGCCGTAGCC AACAACACGA ACGATGTGGC GCTGGGCTCC
GGATCTGTGA CAGCCGCGCC GAATCCGACG GGGAGTGCGA CGATCGGCGG AACCACATAT
TCGTTTGAGG GAACCAATCC GACGAGCGTT GTGAGCGTGG GCGCAGTGGG CGCGGAACGC
CAGATCACGA ACGTCGCGGC GGGACAACTG ACGGCGACGA GCACGGACGC AGTCAACGGT
TCGCAACTTT ACTCGACGAA TCAGGCGATT AATACATTAT CGACGTCGAC TTCGACGGGC
CTGTCGTCGG CGAATAGCTC GATCGCGTCG TTATCCACTG GCCTCGCATC GTCCGGCAAC
CTGGCTTCGC TGTCGACGTC CACGTCGACA GGCCTGTCAT CGGCGAATAG CTCGATCGCG
TCGCTGTCGA CGTCGACTTC AACGGGCCTC TCGACGACCA ACAGCAACAT CGGCTCGCTG
TCGACAGGCC TTAGCACGAC TAATAGCACG GTCGCATCGT TATCGACGTC CACGTCGACG
GGATTGTCGT CGGCGACCAG CTCGATCACG TCGTTGTCGA CGTCGACCTC GAGCGGCATC
AGTACCGCTC AGAGCGGCGT CAATTCGCTG TCGACTGGCC TGTCGACCAC CAACAGCGCA
GTCACCTCGT TATCGACGTC CACGTCGACG GGATTATCGT CGGCGACCAG CTCGATCACG
TCGTTGTCTA CGTCGACGTC AACTGGAATC GGCTCGCTGT CGACCGGTCT GAGCACGACC
GACAGTTCCG TAACGTCGCT GTCGACGTCG ACCTCGAGCG GCATCAGTAC CGCTCAGAGC
GGCGTCAATT CGCTGTCGAC TGGCCTGTCG ACCACCAACA GCACGGTCAC CTCGTTATCG
ACGTCCGCGT CGACGGGATT ATCGTCGGCG AACAGCTCGA TCACGTCGTT GTCGACGTCG
ACCTCGACTG GATTGAGCTC GTTGTCGACG GGAATCGCGA ATTCCGTTCA GTATGACAAC
ACCAGCCATA CTCAAGTCAC GTTGGGAGGA GCCGGCGCGA CGACGCCGGT GACGCTGACC
AATGTCGCCG CGGGCGTGAA TCCCACCGAT GCGGTGAATA TGAGTCAGCT GACTTCGCTG
TCGACGTCAA CGTCAACGGG ACTGTCGACG ACCAACAGCA ACATTGGCTC GCTGTCGACC
GCTCTGAGCA CGACCGACAG TTCCGTGACG TCGCTGTCGA CGTCGACCTC GAGCGGCATC
AGTACTGCTC AGAGCGGTGT CAATTCGCTG TCGACTGGCC TGTCGACCAC CAACAGCACG
GTCGCATCGC TATCGACGTC CACGTCGACG GGATTGTCGT CGGCGACCAG CTCGATCACG
TCGTTGTCTA CGTCGACGTC AACTGGAATC GGCTCGCTGT CGACCGGTCT GAGCACGGCC
AACAGTTCCG TGACGTCGCT GTCGACGTCG ACCTCGAGCG GCATCAGTAC CGCTCAGAGC
GGCGTCAATT CGCTGTCGAC TGGCCTGTCG ACCACCAACA GCACGGTCGC ATCGCTATCG
ACGTCCACGT CGACGGGATT GTCGTCGGCG ACCAGCTCGA TCACGTCGTT GTCTACGTCG
ACGTCAACTG GAATCGGCTC GCTGTCGACC GGTCTGAGCA CGGCCAACAG TTCCGTGACG
TCGCTGTCGA CGTCGACCTC GAGCGGCATC AGTACCGCTC AGAGCGGCGT CAATTCGCTG
TCGACTGGCC TGTCGACCAC CAACAGCGCA GTCACTTCGT TGTCGACGTC CACGTCGACG
GGGCTGAGCT CGCTGTCCAC GGGTCTGTCC GCGACCAACA GCAACATCAG TTCGCTGTCG
ACTTCGACCT CGAACGGCCT GAGCACAGCT AACAGCAATA TCTCGTCGCT GTCCACTGGG
CTGAGTTCGC TGTCGACCGC AGTCAACGGC GGCGGGACCA AGTACTTTCA CGCCAACTCG
GCGCAGCCGG ACAGTCAGGC GCTGGGGACG AACGCGATCG CCGTCGGACC GGCGGCCACG
GCATCCGGCG CAAGCGGTAT CGCGATCGGC GATACCGCGA ACGCGGCTGC AACTGGCGCG
GTGGCGATTG GCCAGACCGC CGTTGCAACG GGTGGTCAGG CGGTCTCGAT CGGTGTTGCG
AATACGGCGA GCGGCGACGG CGCGGTCGCG ATCGGTGATC CGAACGTCGC GACGGGTACT
GGTGCGGTCG CGCTGGGCGC GAACAATTCT GCGAACGGCC AGGGCGCCGT TGCGCTCGGC
AATGCGAACG TGGCAACCGG GACAGGCTCG CTGGCGCTCG GCAGCACATC GACGGCGGCG
GGGGGCGGCT CGATCGCGTT GGGCACGAAT GCAATCGCGA ACAATGCCAA CGATGTCGCG
CTGGGTTCCG GTAGCGTAAC CGCAGCCGCG AATCCGGTCG CCAGTGCGCT GATCGCCGGT
CAGACGTATT CGCTCGCCGG CGGTTCACCG ACAAGCGTGG TGAGTGTCGG TGCGCCGGGT
GCAGAACGAC AAATCACCAA CGTCGCGGCA GGGCAGGTTT CCGCCACGTC GACCGACGCG
GTGAACGGAT CGCAGATGAA CGCCGTGACC CAAGCGCTGG TATCGTTGTC GACTTCGACA
GCCAATGCGT TGTCGACGAC GCAAAACGGT CTGAGTTCGT TGTCGACAGG GCTGAGCACG
ACGCAGAGCA GTGTGAGTTC GCTGTCGACG GGCCTTAGCA CGACGAGCGG TAACGTGTCG
TCGCTGTCGA CCGGCTTGAG TACGACGCAG AGCGATGTCG CGTCGCTGTC AACGGGACTG
AGTACGACGA ACAGCAATCT CGCGTCGTTG TCGACTGCGG TGTCCAACGG AGGGATTCAC
ACCAACGGCG CGGGCGGCAC GTCGATGGGG CCGGGAGCCG ACGCGTCGGG CAGCAACAGC
ACGGCGGTCG GTGGCGCAGC GTCTGCTTCG GGAGCCAATG CAACGGCGCT TGGCCAGGCG
TCGAACGCAT CGGGCAATAA TTCGACGGCA TTAGGGCAAG CATCGAGTGC ATCCGGAAGC
GGCTCCACCG CGGTGGGGCA GGGCGCGAGC GCGTCGGGCG ATGGGTCCTC GGCATTCGGC
CAAGGGGCGA TTGCCTCCGG CACGAACTCG ACGGCGCTGG GCGCTCATTC GACGGCATCG
GCGCCGAATT CAGTGGCGAT CGGTGCGAAC TCGGTAGCTT CCGCGCCGAA CACGGTGTCG
TTCGGGTCTC AGGGCCATGA GCGTCGACTG ACGAACGTCG CACCGGGGAT GGACGGTACC
GACGCAGCGA ACATGAGCCA GCTTTGGGGC GTGCAATCGA GCGTAGACCA AGCCGCGCGC
CGTGCGTATT CGGGTGTGGC GGCTGCCACC GCGCTGACGA TGATTCCGGA AGTCGATCCT
GGCAAGACGA TTGCAGTTGG GATCGGTGCC GGCAGCTATC AAGGGTATTC GGCATCCGCG
ATCGGTGTGT CCGTGCGGTT CTCCGACAAC CTGAAGGCGA AGCTCGGCAT GGGGATCAGC
AGTCAGGGCA GCACGTACGG CGGAGGCATC TCGTACCAGT GGTAA
 
Protein sequence
MNKIFRVIWC RVKAACVVVS EEACLRGGKS HSCRQGSRAA GEESVRFALS SIALAACILI 
GSLGSTLPAV AGTVIGGGAQ YPNSVGGTSS TTGDLGNSYI GASGMGTAIT GDNDCLSLTS
TRNVLNSANV GWLLGTTSQT TDPGPLYPGP GAENNQTISY SGTSNFSGGG NSAAGAQATL
AWGFNSFAAG CGNKSLGMGS STLGLNNMAS LAGSTAIGIA NTASGAGGTA IGLYNTAAGT
GSVAMGIGSQ ATGNGTIALG YGGGGTSSNA TLASGSNAIA IGGDATKGAQ ATGSDSIAMG
SEAAASSSST TAIGQYATAS NTNATALGAG GTSAATGVIA SGAGAVALGG NSTQGAQALA
SNAIAIGGQS QAASAGAIAI GQSALATGGQ AVSVGVGNTA NGNGAVAIGD PNVATGTGAV
ALGANNTATG QGAVALGNAD IATGQGSVAL GNVSTAAGAG SVAFGSNAVA NNTNDVALGS
GSVTAAPNPT GSATIGGTTY SFEGTNPTSV VSVGAVGAER QITNVAAGQL TATSTDAVNG
SQLYSTNQAI NTLSTSTSTG LSSANSSIAS LSTGLASSGN LASLSTSTST GLSSANSSIA
SLSTSTSTGL STTNSNIGSL STGLSTTNST VASLSTSTST GLSSATSSIT SLSTSTSSGI
STAQSGVNSL STGLSTTNSA VTSLSTSTST GLSSATSSIT SLSTSTSTGI GSLSTGLSTT
DSSVTSLSTS TSSGISTAQS GVNSLSTGLS TTNSTVTSLS TSASTGLSSA NSSITSLSTS
TSTGLSSLST GIANSVQYDN TSHTQVTLGG AGATTPVTLT NVAAGVNPTD AVNMSQLTSL
STSTSTGLST TNSNIGSLST ALSTTDSSVT SLSTSTSSGI STAQSGVNSL STGLSTTNST
VASLSTSTST GLSSATSSIT SLSTSTSTGI GSLSTGLSTA NSSVTSLSTS TSSGISTAQS
GVNSLSTGLS TTNSTVASLS TSTSTGLSSA TSSITSLSTS TSTGIGSLST GLSTANSSVT
SLSTSTSSGI STAQSGVNSL STGLSTTNSA VTSLSTSTST GLSSLSTGLS ATNSNISSLS
TSTSNGLSTA NSNISSLSTG LSSLSTAVNG GGTKYFHANS AQPDSQALGT NAIAVGPAAT
ASGASGIAIG DTANAAATGA VAIGQTAVAT GGQAVSIGVA NTASGDGAVA IGDPNVATGT
GAVALGANNS ANGQGAVALG NANVATGTGS LALGSTSTAA GGGSIALGTN AIANNANDVA
LGSGSVTAAA NPVASALIAG QTYSLAGGSP TSVVSVGAPG AERQITNVAA GQVSATSTDA
VNGSQMNAVT QALVSLSTST ANALSTTQNG LSSLSTGLST TQSSVSSLST GLSTTSGNVS
SLSTGLSTTQ SDVASLSTGL STTNSNLASL STAVSNGGIH TNGAGGTSMG PGADASGSNS
TAVGGAASAS GANATALGQA SNASGNNSTA LGQASSASGS GSTAVGQGAS ASGDGSSAFG
QGAIASGTNS TALGAHSTAS APNSVAIGAN SVASAPNTVS FGSQGHERRL TNVAPGMDGT
DAANMSQLWG VQSSVDQAAR RAYSGVAAAT ALTMIPEVDP GKTIAVGIGA GSYQGYSASA
IGVSVRFSDN LKAKLGMGIS SQGSTYGGGI SYQW