Gene BURPS668_A1408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1408 
Symbol 
ID4886687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1317154 
End bp1320549 
Gene Length3396 bp 
Protein Length1131 aa 
Translation table11 
GC content72% 
IMG OID640131347 
Productserine protease 
Protein accessionYP_001062405 
Protein GI126445568 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain
[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0123331 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGATGA CCCGGCACAA AAAACGGAAA ATGATGAAGC GCAGCGGAGC GAAGCTGCTT 
GCGCCCGTGG TCGTCGCGGC CGCCGCCGCG GTCGCGGCCC GCCCCGGCTG GGCGCAAGCC
GCGCCGTACC CGGATCCGGG CCGGCGCGGC GATCCGGCGA GTTGGCGCAC GCCGGAATTC
ACGAACGCGT GGGGGCTCGG CGCGATGCAC GCCGAGTATG CATACGCGGC CGGCTATACC
GGCGCGAACG TCGCGATCGG CGTGCTGGAC TCCGGCTACT ACGCGCAGCA TCCGGAACTG
CCCGACAGCC GCTTCGTTCC GGTGACGGCC GCGGGTGTGT CCGGCGTGCT GAACCCGAAC
AACAACAATC ACGGCACGCT GGTGAGCGGC GTCGTCGGCG GCGCGCGCGA CGGTGTCGGC
ATGCACGGCG TCGCACCCGA CGCGACGGTG TACGAAGGCA ACACGAACGC GACCGACGGC
TTCCGGTTCG GCGTATCGGA TCCGAAGTTT CCCGCGTCGG ATGCGAAGTA TTTCGCCGAG
GCCTACGATG CGCTCGCCGC AAAGGGCGTG CGGATCATCA GCAACAGCTG GGGCTCGCAG
CCGGCCAACG AGAACTACAG CACCCTGAAC AAACTCACCG ATGCCTACAA GCTGCATGAG
GCGGTGCGCA CGGCGACCGG CCGGGGCACG TGGCTCGACG CGGCGGCGAA GGTGTCGCGC
GACGGCGTGA TCAACAACTT CAGCTCGGGC AACACCGGCT ACGACAACGC GAGCCTGCGC
GGCGCGTATG CGTACTTCCA CCCCGAACTC GAAGGGCACT GGATGACGAC GACGGGCTAC
GACCAGTTGA GCGGCCAGGT CTACAACCAA TGCGGGATCG CGAAGTGGTG GTGCGTGATG
GCGCCCACGG GCGTGCCGTC GACGTCATAT TCGGGCGGCG CGGCGGCGCC GACCGGGGCG
ACCTACGCGA ACTTCAACGG CACGTCGGCC GCCGCGCCGC ACGCGTCGGC GGCGCTCGCG
CTGATCATGG AGCGCTTTCC GTACATGACG AGCGAGCAGG CGCTGTCGGT GCTGTTCACG
ACCGCGCAGA ACATGGAGCC GGACCCGAGC CGGCCGGACT ACACGAACAA CGGGCTGTTC
TCGACCGTGC ATCCGGCGAA GCCCGGCGCG TCGGGCGTGC CGAACGCGTT CGGCGGCTGG
GGGCTCGTCG ACCTGCGCCG GGCGATGAAC GGCCCGGGCC AACTGCTCGG CACGTTCGAC
GCGGCGCTGC CTGCGGGCAC CGCCGACGTG TGGTCGAACG ACATCTCCGA CGTCGCGCTC
GCCGCGCGCA AGCGCGAGGA CGACGCCGAG CACCGCGCGT GGCTCGACAC GCTGAGGACG
AAGGGATGGG AGCACGGGCT GCCCGCCGGC GCGAGCGATG GCGACCGGAT CGACTATGCG
CTCGGCGTCG CGCGCGAAAC CGCGTATCAG GCGCGCGAGT ATCAGGGCAG TCTCGTGAAA
TCGGGCGGCG GCACGCTGAC GCTCGCCGGC GCGAGCACCT ATCGCGGGCC CACGACGGTC
GACGGCGGCG AATTGAGGAT CGACGGCTCG ATCGCCGCGC GCGCCGTCGT CAATCCGGCG
GGCCGGCTCA CGGTGAACGG CCGCGCGGCC GACATCGCGG TCAACGGCGG CGTGGCGACG
ATCGCCGGGA CGAGCGCGAA CCTGTCGATC GACCGGCAGG GCCGGGCCGC CGTGACAGGG
ACGACGGCGG ACGTGCGCGT CGCGAGCGGC TTTGCATCGC TCGGCGGCAC GAGCGGCAAC
GTCGCGGTGG GGGCGCTCGG CGTCGCCGCG ATCACGGGCC GCACGGCCGA CGTGGCGGTC
GACGGCGGCC GCGCGTCGCT CGACGGCGCG AGCGGCAACG TCGCGGTCGG CAACGGCGGC
GTCGTGAGCG GCAGCGGCAC GGTGCGCACG CTCACGGCGG CCGCGAACGG CACGGTCGCG
CCCGGCCATT CGGTCGGTAC GCTGACGGTG TCGGGCGACG TGCGCTTCGC GCCGGGTTCG
ATCTACGCGG TCGAGGTGTC GCCGGGCGGC GCGGGCGACC GGATCGTCGC GGGCGGCCGC
GCGCAAATCG ACGGCGGCGC GTTGGCGCTC GCGCTCGAGA ACACGCCGCC GCCGCTCACG
CCCGAGCAGT CGCGCTCGGT GCTCGGCCGT CGCTTCGAGA TCCTGAATGC GGCGGGCGGC
GTGGCCGGCC GTTTCGATGC GCCGAGCGGC TATCTGTTCG TCAATCCGGT GCTCGCCTAT
GGCCCGACGA CCGTGAGCCT CACGATCGAT CGTAACGCGA CGCCGTTCGC AAGCGTCGCG
CGGACCGCGA ACGAGCGCGG CGTCGCCGAT GCGCTCGAAA CGGCGGACCC GGGCAGCGCG
GTTTACAACA GCGTGCTGTT CGCGGCGTCC GCGCAGGCGC CGCAGGCGGC GCTCGCGCAA
CTGACGGGCG AGATCTATCC GGCCGCCTAT GCGGCGCTCG TCAACGAAAG CCGGCAAGTG
CGCGAAGCGG CGCTCGAGCG CCTGTGGACG GCGCGCGGCG CGCCGGGCCG CGCCGGCGCC
TGGGCGCGGC TGCTCGGCGC GTGGGGCAGC GCGCGCGGCG GCGACGTGAA CGGCTACACG
AGCTCGACGG GCGGCTTCCT CGCGGGCGCG GACGCGGCGT TGCTCGACGG CGTGCGGGCG
GGCGGCTTCG CCGGCTACAG CCACACCGGC GTGAACCTGA GGAACCAGCC GTCGTCCGCG
TCGTTCGACA GCTTCCATCT CGGCGCATAC GCGGGGTGGC AGCCCGGCGC ACTCGGCGTG
CGAATCGGCG CGGCGCATGC GTGGCATCGC GGCGGCGTCG ATCGCGCGGT GCAATATGGC
GCGGTTGCCG AGAACGAAAC GACGGCGCTG CACGCGGAAA CGACGCAGGT GTTCGGCGAG
GCCGGCTATC GGTTCGCGCT CGATGGCGCC GCGACGCTCG AGCCGTTCTT CGGCGTTGCG
TATGTGCATC TGAAGAACCA GGGGACGACG GAAACCGGCG GCGCGGCGGC GTTGCGCGTG
CGGCAAGGCA ATCACGACGT GACGTTCTCG ACGCTCGGCG TGCGCGGCGA AACGCGGCTT
GGCCTGACGT CGCGACTGCA GTTGACGCTG CAGGGCAGCG CGGGCTGGCA GCATGCGCTG
ACGGACGGGC AGCCGAGCGG CACGCTCGCG TTCGCGACGG GGAGCGACAC GTTCACCGTG
TCGAGCGTGC CGGTTGCGAA GGATGCGGCG GTGCTGAACG TGGGCGCCGG GCTCGAGCTC
GGCAAGAACG GATGGCTGCG CGTCGGCTAT TCCGGCTCGC TCGCGAGCCG TCAGTCCGAG
CACGCGGTGC AAGGCAGCCT GCACTGGAAG TTCTGA
 
Protein sequence
MLMTRHKKRK MMKRSGAKLL APVVVAAAAA VAARPGWAQA APYPDPGRRG DPASWRTPEF 
TNAWGLGAMH AEYAYAAGYT GANVAIGVLD SGYYAQHPEL PDSRFVPVTA AGVSGVLNPN
NNNHGTLVSG VVGGARDGVG MHGVAPDATV YEGNTNATDG FRFGVSDPKF PASDAKYFAE
AYDALAAKGV RIISNSWGSQ PANENYSTLN KLTDAYKLHE AVRTATGRGT WLDAAAKVSR
DGVINNFSSG NTGYDNASLR GAYAYFHPEL EGHWMTTTGY DQLSGQVYNQ CGIAKWWCVM
APTGVPSTSY SGGAAAPTGA TYANFNGTSA AAPHASAALA LIMERFPYMT SEQALSVLFT
TAQNMEPDPS RPDYTNNGLF STVHPAKPGA SGVPNAFGGW GLVDLRRAMN GPGQLLGTFD
AALPAGTADV WSNDISDVAL AARKREDDAE HRAWLDTLRT KGWEHGLPAG ASDGDRIDYA
LGVARETAYQ AREYQGSLVK SGGGTLTLAG ASTYRGPTTV DGGELRIDGS IAARAVVNPA
GRLTVNGRAA DIAVNGGVAT IAGTSANLSI DRQGRAAVTG TTADVRVASG FASLGGTSGN
VAVGALGVAA ITGRTADVAV DGGRASLDGA SGNVAVGNGG VVSGSGTVRT LTAAANGTVA
PGHSVGTLTV SGDVRFAPGS IYAVEVSPGG AGDRIVAGGR AQIDGGALAL ALENTPPPLT
PEQSRSVLGR RFEILNAAGG VAGRFDAPSG YLFVNPVLAY GPTTVSLTID RNATPFASVA
RTANERGVAD ALETADPGSA VYNSVLFAAS AQAPQAALAQ LTGEIYPAAY AALVNESRQV
REAALERLWT ARGAPGRAGA WARLLGAWGS ARGGDVNGYT SSTGGFLAGA DAALLDGVRA
GGFAGYSHTG VNLRNQPSSA SFDSFHLGAY AGWQPGALGV RIGAAHAWHR GGVDRAVQYG
AVAENETTAL HAETTQVFGE AGYRFALDGA ATLEPFFGVA YVHLKNQGTT ETGGAAALRV
RQGNHDVTFS TLGVRGETRL GLTSRLQLTL QGSAGWQHAL TDGQPSGTLA FATGSDTFTV
SSVPVAKDAA VLNVGAGLEL GKNGWLRVGY SGSLASRQSE HAVQGSLHWK F