Gene BURPS1710b_2229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2229 
Symbol 
ID3690239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp2489664 
End bp2492870 
Gene Length3207 bp 
Protein Length1068 aa 
Translation table11 
GC content67% 
IMG OID637728684 
ProductHep_Hag family protein 
Protein accessionYP_333623 
Protein GI76811430 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3846] Type IV secretory pathway, TrbL components 
TIGRFAM ID[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGGA TTTTCAAATC GATCTGGTGC GAACAGACGC GTACGTGGGT TGCGGCATCG 
GAGCATGCCG TGGCGCGCGG TGGCCGCGCG TCGAGCGTCG TCGCGTCCGC CGGCGGATTG
GAGAAGGTGC TCAAGCTGTC GATTCTGGGC GCGGCATCGC TGATTGCGAT GGGCGTGGTC
GGACCGTTTG CCGAGGAGGC AATGGCGGCG AATAACGCCG GTGTGTGTTT GACGTACAAC
GGTAGTAGCA ACAATACATC AGGTACTGGC GGCTGGTTCG CTGATGGTTG TAAATCGGCC
GGCTGGGTGC AGGGCATGGT TACGAATAGC AAGACGGATT GGGTCGGGCT GACCGCGGAC
GACACGCAGA TCGTGCTCGA CGGTAGCGCG GGCAGCATTT ACTTCCGGAC GGGCGGCATA
AACGGCAACG TGTTGACGAT GTCGAACGCG ACCGGCGGCG TATTGCTCAG CGGCCTCGCG
GCCGGCGTCA ATCCGACCGA TGCGGTCAAC ATGTCCCAGT TGACCTCGTT GTCGACGTCG
ACGGCAACCG GCATCACCTC GCTTTCGACG TCGACGGCAA CCAGCATCGC TTCGCTTTCG
ACGAGCATGC TGTCGCTCGG CGTGGGCGTC GTGACGCAAG ACGCCTCGAC CGGCGCGATC
AGCGTCGGCG CCAATTCGCC GGGCCTGACG GTGGATTTCG CGGGGGGCCA GGGCGCGCGC
ACGCTGACGG GCGTCGCCGC GGGCGTCAAC GCTACGGACG CGGTCAATGT CGGCCAGCTG
GCGTCGCTGT CGACGAGCAC GGCAGCGGGG CTTTCCACCG CCGCGAGCGG CGTCGCGTCG
CTGTCGACGT CGCTGCTCGG CGCGGCGGGC GATCTGGCGT CACTGTCGAC GAGCGCGTCG
ACGGGGCTCG CCACTGCGGA TAGCGGCATC GCGTCGTTGT CCACGTCGCT GCTCGGTACC
GCGGACAACG TGACGTCGCT GTCGACGAGC CTCAGCACGG TCAACGCGAA TCTGGCCGGC
CTGCAGACCT CGGTGGACAA CGTCGTGTCA TACGACGATC CGTCGAAGTC GGCGATCACC
CTCGGCGGCG CGGGCGTCAC GACGCCCGTC CTGCTGACGA ACGTGGCTGC GGGGAAGATC
GCCGCGACCA GCACGGACGC GGTGAACGGT TCGCAGCTTT ACACGCTCCA GCAGGAATTC
TCGCAGCAGT ACGATCTGCT GACGTCGCAA GTCTCGTCGC TCAGCACTTC GGTGTCGGGT
CTCCAAGGCA GCGTCTCGGC AAATACGGGA ACCGCGTCGG GTGACAACAG CACGGCGAGC
GGTGACAACG CGACCGCGTC GGGCACGAAC AGCACGGCCA ACGGCACGAA CTCGACCGCG
TCGGGTGATA ACAGCACGGC AAGCGGGACG AATGCGTCGG CGAGCGGCGA GAACAGCACG
GCGACGGGTA CGGACTCGAC CGCATCGGGT AGCAATAGCA CGGCCAACGG GACGAACTCG
ACTGCGTCGG GTGATAACAG CACGGCAAGC GGGACGAATG CGTCGGCGAC GGGCGAGAAC
AGCACGGCGA GCGGCACGAA CGCATCGGCG ACGGGCGAGA ACAGCACGGC GACGGGCACG
GCTTCGACCG CATCGGGCAG TAACAGCACG GCGAACGGCA CGAACTCGAC CGCGTCGGGC
GAGAACAGCA CGGCGACGGG TACGGACTCG ACCGCATCGG GCAGCAACAG CACGGCCAAC
GGGACGAACT CGACCGCGTC GGGTGATAAC AGCACGGCGA GCGGCACGAA CGCATCGGCG
ACCGGTGAGA ACAGCACGGC GACAGGCACG GATTCGACCG CATCGGGCAG CAACAGCACA
GCCAACGGCA CGAACTCGAC CGCGTCGGGC GATAACAGCA CGGCGAGCGG CACGAACGCA
TCAGCGACCG GTGAGAACAG CACGGCGACG GGTACGGACT CGACCGCATC GGGCAGCAAC
AGCACGGCCA ACGGGGCGAA CTCGACCGCA TCGGGCGATA ACAGCACGGC GAGCGGCACG
AACGCATCGG CGACGGGCGA GAACAGCACG GCGACAGGCA CGGATTCGAC CGCATCGGGC
AGCAACAGCA CGGCCAACGG AACGAACTCG ACTGCGTCGG GCAACAACAG CACGGCGAGC
GGGACGAACG CGTCGGCGAC GGGTGAAAAC AGCACGGCGA CGGGTACGGA CTCGGCCGCA
TCCGGTACGA ATAGCACGGC CAACGGCACG AACTCGACCG CGTCGGGCGA TAACAGCACG
GCGAGCGGGA CGAATGCGTC GGCGACCGGT GAGAACAGCA CGGCGACGGG TACGGCTTCG
ACCGCGTCGG GCAGCAACAG CACGGCCAAC GGTGCGAACT CGACGGCATC CGGCGCGGGG
GCGACGGCAA CGGGTGAAAA CGCCGCAGCC ACGGGCGCGG GCGCGACGGC GACCGGCAAC
AACGCATCGG CATCGGGCAC GAGCAGCACG GCCGGCGGTG CGAATGCAAT CGCGTCGGGC
GAGAACAGCA CGGCCAACGG CGCGAACTCG ACCGCATCCG GCAACGGCAG CTCGGCGTTC
GGCGAGAGCG CGGCGGCAGC CGGCGACGGC AGCACGGCGC TGGGTTCAAA CGCTGTCGCG
TCGGGTGTCG GCAGCGTCGC GACGGGCGCG GGTTCGGTCG CGTCCGGCGC GAACAGTTCG
GCGTACGGTA CGGGCTCGAA CGCGACGGGC GCGGGCAGCG TCGCCATCGG TCAGGGCGCG
ACGGCCTCGG GATCGAACTC GGTCGCGCTT GGCACCGGTT CTGTCGCGTC GGAGGACAAC
ACGGTATCGG TCGGCTCCGC AGGCAGCGAG CGCAGGATCA CCAACGTCGC CGCCGGCGTC
AATGCCACCG ACGCCGTCAA CGTCGGCCAG TTGAACAGCG CCGTGTCGGG CATCCGGAAT
CAGATGGACG GCATGCAAGG CCAGATCGAT ACGCTTGCAC GCGATGCGTA TTCCGGTATC
GCGGCCGCGA CCGCGTTGAC GATGATTCCG GACGTGGATC CGGGCAAGAC GCTGGCCGTG
GGCATCGGCA CGGCCAATTT CAAGGGCTAC CAAGCCTCCG CGCTCGGCGC GACCGCACGT
ATCACCCAGA ACCTCAAGGT GAAGACGGGC GTGAGCTACA GCGGCAGCAA CTACGTGTGG
GGCGCGGGCA TGTCGTATCA ATGGTAA
 
Protein sequence
MNRIFKSIWC EQTRTWVAAS EHAVARGGRA SSVVASAGGL EKVLKLSILG AASLIAMGVV 
GPFAEEAMAA NNAGVCLTYN GSSNNTSGTG GWFADGCKSA GWVQGMVTNS KTDWVGLTAD
DTQIVLDGSA GSIYFRTGGI NGNVLTMSNA TGGVLLSGLA AGVNPTDAVN MSQLTSLSTS
TATGITSLST STATSIASLS TSMLSLGVGV VTQDASTGAI SVGANSPGLT VDFAGGQGAR
TLTGVAAGVN ATDAVNVGQL ASLSTSTAAG LSTAASGVAS LSTSLLGAAG DLASLSTSAS
TGLATADSGI ASLSTSLLGT ADNVTSLSTS LSTVNANLAG LQTSVDNVVS YDDPSKSAIT
LGGAGVTTPV LLTNVAAGKI AATSTDAVNG SQLYTLQQEF SQQYDLLTSQ VSSLSTSVSG
LQGSVSANTG TASGDNSTAS GDNATASGTN STANGTNSTA SGDNSTASGT NASASGENST
ATGTDSTASG SNSTANGTNS TASGDNSTAS GTNASATGEN STASGTNASA TGENSTATGT
ASTASGSNST ANGTNSTASG ENSTATGTDS TASGSNSTAN GTNSTASGDN STASGTNASA
TGENSTATGT DSTASGSNST ANGTNSTASG DNSTASGTNA SATGENSTAT GTDSTASGSN
STANGANSTA SGDNSTASGT NASATGENST ATGTDSTASG SNSTANGTNS TASGNNSTAS
GTNASATGEN STATGTDSAA SGTNSTANGT NSTASGDNST ASGTNASATG ENSTATGTAS
TASGSNSTAN GANSTASGAG ATATGENAAA TGAGATATGN NASASGTSST AGGANAIASG
ENSTANGANS TASGNGSSAF GESAAAAGDG STALGSNAVA SGVGSVATGA GSVASGANSS
AYGTGSNATG AGSVAIGQGA TASGSNSVAL GTGSVASEDN TVSVGSAGSE RRITNVAAGV
NATDAVNVGQ LNSAVSGIRN QMDGMQGQID TLARDAYSGI AAATALTMIP DVDPGKTLAV
GIGTANFKGY QASALGATAR ITQNLKVKTG VSYSGSNYVW GAGMSYQW