Gene BURPS668_A1911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1911 
Symbol 
ID4887843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1856009 
End bp1858339 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content68% 
IMG OID640131849 
Productserine protease 
Protein accessionYP_001062906 
Protein GI126445312 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.438045 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCGGC GCGCGTGGGC GATAGTCGGG ATCGCGCATG GATACGGGCG AAGCGGAAAA 
CGGCACGGCG TCGGCACCGG CTTCGGACAG TCCCGGCACG ACGAAGGCAC GCTTGCGACG
CGCGCGCCGC GAATGCCCGC GGCGGACGCG TCGCAATCGA CGAGCCGGCA GCCAAGCGCG
AGCGCCGCTC GTCTTACGCT ACCATTACAC GCCATGCCCC TTGCAAAGAA CGCCATGCCT
CATGCTTCCT GGCCCGAACA GGCCGATCCG TATCAATTCC TCGAAGAACT GGACAGCGCC
GCGAGCGTCG GCTGGGTCGA CGCGCAAAAC GCCCGCACGC ACGATGCGCC CTGGCTCGAC
GAAGCGCACT ATCGCGCGCT GGTCGAGCGC TTCACCCGGG CGCTGCTGCC GCGCGAGCGC
CCGGTGATTC CGCAGCGCTG GCAGGACTGG GCGTACGACG TCTGGCAGGA CGAACAGCAT
CCGAAGGGCC TGTGGCGGCG CACGCGATGG ACGAGCTGGC GCAGCGGCCA CGCGGACTGG
CAAACGTTGA TCGACCTCGA TGCGCTTGGT GAAGCGCAAG GCGTGCAGTG GGTATTCGAC
GATCAGCTCA TCCTCGAGCC TGACGGCGAT CGTGCGCTGA TCGTGCTGTC CGACGGCGGC
GCCGACGCGG TGGTCGTCCG CGAGTTCGAC ATCGCGCAAT GCCGGTTCGT CGACGACGGC
TTCTCGATCG AAGCGGCCGG CAAGCATTCG GTCGAATGGA TCGATCGCGA CACGATCTAC
GTCGGCTGGG ACGACGGCGG CGCCACCGTC ACGCGCTCCG GCTATCCGCG CGAAGTCCGG
CGCTGGACGC GCGGCACGCC GCTGTCCAGC GCGCCCGTGG TGTTTCGCGG CGCGCGCGGC
GACATCTCGG TCGATGCGCA ATACGATCCG CTCGACCGGC ATCACGCGAT CGAGCAGGCG
ATCAATTTCT ACGACGCGAA CACGTATCGC CTCGCCGAGG ACGGCGCGTG GGCGCGCTAC
GACGTGCCAC CGCACGTCGA AGTCGGTTAC TGGAGCGGGT GGCTGCTGCT TCAGCCGCGG
CTCGACTGGA CTTGCGGCGG CGCGCGCTAC GCGGGCGGCA GCCTGCTCGC GATCCGCGAG
GACGCGTTCG TCGCCGGTGA GCGCGCGTTC GCCGCGCTGT TCGAGCCGAA CGAGCGCACG
TCCGCATGCG GCTGGACGCA CACGCGCCGC TACGTGCTGG TGTCGTGGCT CGACGACGTG
CTCACGCGCA CGATGCTCTG GCTTCCCGAA CGTCAGGATG ACGGAGCATG GCGCTGGCAT
GCTCGTCCGT TCCCCGTGCG AGGGCTCGCG CAAGTGGACG TGTCGCCCGT CGAGCCCACG
TTCGACGACG AGGTGTACGT GAGCGTCGAC GATTACCTGA AGCCGCCCGA GTATTCGCTC
GCGAATCTCG CCAGCGACGA CCTGTCCGCC TGGACGCTGC TCGACCGCTG GCCGACGCAG
TTCGACGCGT CCGAACTGAC GGTGCGGCGC GAACACGCGC GCTCGCGCGA CGGCACGCTC
GTGCCTTATA CGCTGGTCGG GCCGCGCGAC GTGCTGGACA ATGCGGCGCG CGCGCCGCGC
CCCTGCCTGT TGAACGGCTA CGGCGGCTTC GCGATTGCGC TCACGCCCGA TTACGATCCG
CTGCTCGGCA TCGGCTGGCT CGAGAAAGGC GGCATCGCGG TGTTCGCCCA TATTCGCGGC
GGCGGCGAGT TCGGCACGCA GTGGCACGAA TCGGCGCGGC AAACGCAACG GCAGCGATCG
TTCGACGATT TCATCGCGGT CGCCGAAAAA CTCGTCGCGG ACGGCGTGAC GAGCGCCGCG
CAACTGGGTA TTCGCGGCGG CAGCAACGGC GGGCTGCTGG TCGCGGCATG CATGATTCAG
CGCCCGGACC TGTTCGGCGC GGTGGTGAGC GACGTGCCGC TTCTCGACAT GCAGCGCTAT
GCGCTGCTGC ACGCGGGCGC ATCGTGGCTG GACGAATTCG GCGATCCCGA CGATCCGGCG
CATGCGTCGG CGCTCGCGGC CTACTCGCCG TATCACCGGG TCGCGCGCGA CATCGCGTAT
CCGCCCGCGC TGTTCACGAC ATCGACGAGC GACGACCGCG TGCATCCCGC CCATGCGAGA
AAAATGGTCG CGCGCATGCA GGCGCAAGGG CACCGGAACG TATGGCTGAT CGAGAAAACC
GATGGCGGCC ACGGCAGCGC GGACGCGATC GATACCGCCG AGCACGAAGC GATCGGCTAT
GTGTTTCTGT GGACTCACTT GTCCCGCGGC GCGCATGACG CGCGCGAGTG A
 
Protein sequence
MTRRAWAIVG IAHGYGRSGK RHGVGTGFGQ SRHDEGTLAT RAPRMPAADA SQSTSRQPSA 
SAARLTLPLH AMPLAKNAMP HASWPEQADP YQFLEELDSA ASVGWVDAQN ARTHDAPWLD
EAHYRALVER FTRALLPRER PVIPQRWQDW AYDVWQDEQH PKGLWRRTRW TSWRSGHADW
QTLIDLDALG EAQGVQWVFD DQLILEPDGD RALIVLSDGG ADAVVVREFD IAQCRFVDDG
FSIEAAGKHS VEWIDRDTIY VGWDDGGATV TRSGYPREVR RWTRGTPLSS APVVFRGARG
DISVDAQYDP LDRHHAIEQA INFYDANTYR LAEDGAWARY DVPPHVEVGY WSGWLLLQPR
LDWTCGGARY AGGSLLAIRE DAFVAGERAF AALFEPNERT SACGWTHTRR YVLVSWLDDV
LTRTMLWLPE RQDDGAWRWH ARPFPVRGLA QVDVSPVEPT FDDEVYVSVD DYLKPPEYSL
ANLASDDLSA WTLLDRWPTQ FDASELTVRR EHARSRDGTL VPYTLVGPRD VLDNAARAPR
PCLLNGYGGF AIALTPDYDP LLGIGWLEKG GIAVFAHIRG GGEFGTQWHE SARQTQRQRS
FDDFIAVAEK LVADGVTSAA QLGIRGGSNG GLLVAACMIQ RPDLFGAVVS DVPLLDMQRY
ALLHAGASWL DEFGDPDDPA HASALAAYSP YHRVARDIAY PPALFTTSTS DDRVHPAHAR
KMVARMQAQG HRNVWLIEKT DGGHGSADAI DTAEHEAIGY VFLWTHLSRG AHDARE