Gene BURPS1710b_A1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1101 
Symbol 
ID3693577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1378015 
End bp1379517 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content70% 
IMG OID637731355 
Productserine metalloprotease 
Protein accessionYP_336259 
Protein GI76818474 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0573282 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAATAT TGATTCGTAC TGCTTCGTTC AAGGCGACCG TCCTGTGTGC CGCGCTGGCC 
GGCCTCGTTT CGGCCGCGCA AGCGGAAACC GCGGCCGCGC CCCAGGTGCC GGGGCCCGCC
GACGCGGTCA ATCAGTTGAT CGTCAAGTTG CGCGCGGTGA AGACGCCGCC CGGTGCGACG
GCCGCGAAGG CCGAGCGCGC GGACGTTCAG GCCGTCATCG ATCGCGTGCT CGCCGCGCGC
AATGCGCGGG CGGCGGGGCG TGCGTTCGGC GCGGCCGCCG CATCCGCGCC CGGCAATCCG
GACGACCCCG CCGCGGGCAT TCGCATCAAG CGCGACATGT CGGGCGGCGC GACCGTGCTG
TCGCTGCAGC GCCACGTGTC GCTCGCGCAG GCCGAGGCGC TCGCGCGCGA CTTCGCGGCG
GACGGCGCGA TCGAATATGC GGAGCCCGAT GCGCGGATGC ATCCGTTCGT CGTGCCGAAC
GATACGCGCT ATTCGGAGCA ATGGGGCTAC TTCAATCCGA CCGCCGGCGC GAATCTGCCG
AAGGCTTGGG ATCGCACGAC CGGCTCCGCG CGCGTCGTCG TCGCCGTCAT CGATACCGGC
TACCGTCCGC ATGCGGATCT CGCCGCGAAC CTGCTGCCGG GCTACGACTT CATCTCCGAT
ATCCCGAGCG CGAACGACGG CAATGGCCGC GACAGCGACG CATCGGATCC CGGCGACTGG
GTGAGCGCGC AGGAAGACGG CGATCCGAGC GGCCCATTCT ATGGCTGCGG CGCGAGCGAC
AGCTCATGGC ACGGCACGCA CGTCGCGGGC ACGATCGGCG CGGTGACGAA CAACGGCGTC
GGCGTGGCGG GCATCTCGTG GGTCGGCAAG GTGCTGCCCG TGCGCGTGCT CGGCAAGTGC
GGCGGGATGC TGAGCGACAT CGCCGACGGC ATGCGCTGGG CGGCGGGCCT GCCGGTGCCG
GGCGCGCCGT CGAATCCGAA CCCGGCGAAG GTGCTGAACC TGAGCCTCGG CGGATACGGC
CGCACATGCA GCTCGACGTA CCAGAACGCG ATCAACGAAA TCACGTCGCG CGGCGCGAAC
GTCGTTGTCG CCGCGGGCAA TAACGGCGGC TCGGTGTCGA CGACTCAGCC GGCGAATTGC
CGGGGCGTGA TCGCGGTTGG CGCGATCGAC AGCCGCGGTG TGCGCGCGAG CTTCAGCAAC
ACCGGCGCCG CGGTGAAGAT CTCCGCGCCG GGCGTCGGCA TTCTGTCGAC GCTCAATGCG
GGCAAGACCT CGCCGGGCGC GGACAGCTAC GCGAGCTATA GCGGCACGAG CATGGCAACG
CCGCATGTCG CGGGCACGGT CGCGCTGATG CTCGCCGTCA ACTCGACGCT GTCGCCTTCG
CAGATCTTGC AGCGGCTGCA ATCGAGCGCG CGGCCGTTCT CGAGCGGATC GAGCTGCTCG
ACGAGCACGT GCGGCGCAGG GCTGCTCGAC GCAGGCAACG CGGTCGACGC CGCCGCGCAG
TGA
 
Protein sequence
MSILIRTASF KATVLCAALA GLVSAAQAET AAAPQVPGPA DAVNQLIVKL RAVKTPPGAT 
AAKAERADVQ AVIDRVLAAR NARAAGRAFG AAAASAPGNP DDPAAGIRIK RDMSGGATVL
SLQRHVSLAQ AEALARDFAA DGAIEYAEPD ARMHPFVVPN DTRYSEQWGY FNPTAGANLP
KAWDRTTGSA RVVVAVIDTG YRPHADLAAN LLPGYDFISD IPSANDGNGR DSDASDPGDW
VSAQEDGDPS GPFYGCGASD SSWHGTHVAG TIGAVTNNGV GVAGISWVGK VLPVRVLGKC
GGMLSDIADG MRWAAGLPVP GAPSNPNPAK VLNLSLGGYG RTCSSTYQNA INEITSRGAN
VVVAAGNNGG SVSTTQPANC RGVIAVGAID SRGVRASFSN TGAAVKISAP GVGILSTLNA
GKTSPGADSY ASYSGTSMAT PHVAGTVALM LAVNSTLSPS QILQRLQSSA RPFSSGSSCS
TSTCGAGLLD AGNAVDAAAQ