Gene BURPS1106A_A1327 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1327 
Symbol 
ID4904714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1251645 
End bp1255040 
Gene Length3396 bp 
Protein Length1131 aa 
Translation table11 
GC content72% 
IMG OID640144433 
Productserine protease 
Protein accessionYP_001075362 
Protein GI126458173 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain
[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGATGA CCCGGCACAA AAAACGGAAA ACGATGAAGC GCAGCGGAGC GAAGCTGCTT 
GCGCCCGTGG TCGTCGCGGC CGCCGCCGCG GTCGCGGCCC GCCCCGGCTG GGCGCAGGCC
GCGCCGTACC CGGATCCGGG CCGGCGCGGC GATCCGGCGA GTTGGCGCAC GCCGGAATTC
ACGAACGCGT GGGGGCTCGG CGCGATGCAC GCCGAGTATG CATACGCGGC CGGCTATACC
GGCGCGAACG TCGCGATCGG CGTGCTGGAC TCCGGCTACT ACGCGCAGCA TCCGGAACTA
CCCGACAGCC GCTTCGTTCC GGTGACGGCC GCGGGCGTGT CCGGCGTGCT GAACCCGAAC
AACAACAATC ACGGCACGCT GGTGAGCGGC GTCGTCGGCG GCGCGCGCGA CGGTGTCGGC
ATGCACGGCG TCGCACCCGA CGCGACGGTG TACGAAGGCA ACACGAACGC GACCGACGGC
TTCCGGTTCG GCGTATCGGA TCCGAAGTTT CCCGCGTCGG ATGCGAAGTA TTTCGCCGAG
GCCTACGATG CGCTCGCCGC AAAGGGCGTG CGGATCATCA GCAACAGCTG GGGCTCGCAG
CCGGCCAACG AGAACTACAG CACCCTGAAC AAACTCACCG ATGCCTACAA GCTGCACGAG
GCGGTGCGCA CGGCGACCGG CCGGGGCACG TGGCTCGACG CGGCGGCGAA GGTGTCGCGC
GACGGCGTGA TCAACAACTT CAGCTCGGGC AACACCGGCT ACGACAACGC GAGCCTGCGC
GGCGCGTATG CGTACTTCCA CCCCGAACTC GAAGGGCACT GGATGACGAC GACGGGCTAC
GACCAGTTGA GCGGCCAGGT CTACAACCAA TGCGGGATCG CGAAGTGGTG GTGCGTGATG
GCGCCCACGG GCGTGCCGTC GACGTCATAT TCGGGCGGCG CGGCGGCGCC GACCGGGGCG
ACCTACGCGA ACTTCAACGG CACGTCGGCC GCCGCGCCGC ACGCGTCGGC GGCGCTCGCG
CTGATCATGG AGCGCTTTCC GTACATGACG AGCGAGCAGG CGCTGTCGGT GCTGTTCACG
ACCGCGCAGA ACATGGAGCC GGACCCGAGC CGGCCGGACT ACACGAACAA CGGGCTGTTC
TCGACCGTGC ATCCGGCGAA GCCCGGCGCG TCGGGCGTGC CGAACGCGTT CGGCGGCTGG
GGGCTCGTCG ACCTGCGCCG GGCGATGAAC GGCCCGGGCC AACTGCTCGG CACGTTCGAC
GCGGCGCTGC CTGCGGGCAC CGCCGACGTG TGGTCGAACG ACATCTCCGA CGTCGCGCTC
GCCGCGCGCA AGCGCGAGGA CGACGCCGAG CACCGCGCGT GGCTCGACAC GCTGAGGACG
AAGGGATGGG AGCACGGGCT GCCCGCCGGC GCGAGCGATG GCGACCGGAT CGACTATGCG
CTCGGCGTCG CGCGCGAAAC CGCGTATCAG GCGCGCGAGT ATCAGGGCAG TCTCGTGAAA
TCGGGCGGCG GCACGCTGAC GCTCGCCGGC GCGAACACCT ATCGCGGGCC CACGACGGTC
GACGGCGGCG AATTGAGGAT CGACGGCTCG ATCGCCGCGC GCGCCGTCGT CAATCCGGCG
GGCCGGCTCA CGGTGAACGG CCGCGCGGCC GACATCGCGG TCAACGGCGG CGTGGCGACG
ATCGCCGGGA CGAGCGCGAA CCTGTCGATC GACCGGCAGG GCCGGGCCGC CGTGACAGGG
ACGACGGCGG ACGTGCGCGT CGCGAGCGGC TTTGCATCGC TCGGCGGCAC GAGCGGCAAC
GTCGCGGTGG GGGCGCTCGG CGTCGCCGCG ATCACGGGCC GCACGGCCGA CGTGGCGGTC
GACGGCGGCC GCGCGTCGCT CGACGGCGCG AGCGGCAACG TCGCGGTCGG CAACGGCGGC
GTCGTGAGCG GCAGCGGCAC GGTGCGCACG CTCACGGCGG CCGCGAACGG CACGGTCGCG
CCCGGCCATT CGGTCGGTAC GCTGACGGTG TCGGGCGACG TGCGCTTCGC GCCGGGTTCG
ATCTACGCGG TCGAGGTGTC GCCGGGCGGC GCGGGCGACC GGATCGTCGC GGGCGGCCGC
GCGCAAATCG ACGGCGGCGC GTTGGCGCTC GCGCTCGAGA ACACGCCGCC GCCGCTCACG
CCCGAGCAGT CGCGCTCGGT GCTCGGCCGT CGCTTCGAGA TCCTGAATGC GGCGGGCGGC
GTGGCCGGCC GTTTCGATGC GCCGAGCGGC TATCTGTTCG TCAATCCGGT GCTCGCCTAT
GGCCCGACGA CCGTGAGCCT CACGATCGAT CGTAACGCGA CGCCGTTTGC AAGCGTCGCG
CGGACCGCGA ACGAGCGCGG CGTCGCCGAT GCGCTCGAAA CGGCGGACCC GGGCAGCGCG
GTTTACAACA GCGTGCTGTT CGCGGCGTCC GCGCAGGCGC CGCAGGCGAC GCTCGCGCAA
CTGACGGGCG AGATCTATCC GGCCGCCTAC GCGGCGCTCG TCAACGAAAG CCGGCAAGTG
CGCGAAGCGG CGCTCGAGCG CCTGTGGACG GCGCGCGGCG CGCCGGGCCG CGCCGGCGCC
TGGGCGCGGC TGCTCGGCGC GTGGGGCAGC GCGCGCGGCG GCGACGTGAA CGGCTACACG
AGCTCGACGG GCGGCTTCCT CGCGGGCGCG GACGCGGCGC TGCTCGACAG CGTGCGGGCG
GGCGGCTTCG CCGGCTACAG CCACACCGGC GTGAACCTGA GGAATCAGCC GTCGTCCGCG
TCGTTCGACA GCTTCCATCT CGGCGCATAC GCGGGGTGGC AGCCCGGCGC ACTCGGCGTG
CGAATCGGCG CGGCGCATGC GTGGCATCGC GGCGGTGTCG ATCGCGCGGT GCAATATGGC
GCGGTTGCCG AGAACGAAAC GACGGCGCTG CACGCGGAAA CGACGCAGGT GTTCGGCGAG
GCCGGCTATC GGTTCGCGCT CGATGGCGCC GCGACGCTCG AGCCGTTCTT CGGCGTCGCG
TATGTGCATC TGAAGAACCA GGGGACGACG GAAACCGGCG GCGCGGCGGC GTTGCGCGTG
CGGCAAGGCA ATCACGACGT GACGTTCTCG ACGCTCGGCG TGCGCGGCGA AACGCGGCTT
GGCCTGACGT CGCGACTGCA GTTGACGCTG CAGGGCAGCG CGGGCTGGCA GCATGCGCTG
ACGGACGGGC AGCCGAGCGG CACGCTCGCG TTCGCGACGG GGAGCGACAC GTTCACCGTG
TCGAGCGTGC CGGTTGCGAA GGATGCGGCG GTGCTGAACG TGGGCGCCGG GCTCGAGCTC
GGCAAGAACG GATGGCTGCG CGTCGGCTAT TCCGGCTCGC TCGCGAGCCG TCAGTCCGAG
CACGCGGTGC AAGGCAGCCT GCACTGGAAG TTCTGA
 
Protein sequence
MLMTRHKKRK TMKRSGAKLL APVVVAAAAA VAARPGWAQA APYPDPGRRG DPASWRTPEF 
TNAWGLGAMH AEYAYAAGYT GANVAIGVLD SGYYAQHPEL PDSRFVPVTA AGVSGVLNPN
NNNHGTLVSG VVGGARDGVG MHGVAPDATV YEGNTNATDG FRFGVSDPKF PASDAKYFAE
AYDALAAKGV RIISNSWGSQ PANENYSTLN KLTDAYKLHE AVRTATGRGT WLDAAAKVSR
DGVINNFSSG NTGYDNASLR GAYAYFHPEL EGHWMTTTGY DQLSGQVYNQ CGIAKWWCVM
APTGVPSTSY SGGAAAPTGA TYANFNGTSA AAPHASAALA LIMERFPYMT SEQALSVLFT
TAQNMEPDPS RPDYTNNGLF STVHPAKPGA SGVPNAFGGW GLVDLRRAMN GPGQLLGTFD
AALPAGTADV WSNDISDVAL AARKREDDAE HRAWLDTLRT KGWEHGLPAG ASDGDRIDYA
LGVARETAYQ AREYQGSLVK SGGGTLTLAG ANTYRGPTTV DGGELRIDGS IAARAVVNPA
GRLTVNGRAA DIAVNGGVAT IAGTSANLSI DRQGRAAVTG TTADVRVASG FASLGGTSGN
VAVGALGVAA ITGRTADVAV DGGRASLDGA SGNVAVGNGG VVSGSGTVRT LTAAANGTVA
PGHSVGTLTV SGDVRFAPGS IYAVEVSPGG AGDRIVAGGR AQIDGGALAL ALENTPPPLT
PEQSRSVLGR RFEILNAAGG VAGRFDAPSG YLFVNPVLAY GPTTVSLTID RNATPFASVA
RTANERGVAD ALETADPGSA VYNSVLFAAS AQAPQATLAQ LTGEIYPAAY AALVNESRQV
REAALERLWT ARGAPGRAGA WARLLGAWGS ARGGDVNGYT SSTGGFLAGA DAALLDSVRA
GGFAGYSHTG VNLRNQPSSA SFDSFHLGAY AGWQPGALGV RIGAAHAWHR GGVDRAVQYG
AVAENETTAL HAETTQVFGE AGYRFALDGA ATLEPFFGVA YVHLKNQGTT ETGGAAALRV
RQGNHDVTFS TLGVRGETRL GLTSRLQLTL QGSAGWQHAL TDGQPSGTLA FATGSDTFTV
SSVPVAKDAA VLNVGAGLEL GKNGWLRVGY SGSLASRQSE HAVQGSLHWK F