Gene BURPS668_A2861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2861 
Symbol 
ID4887023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2721808 
End bp2723301 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content71% 
IMG OID640132797 
Productserine metalloprotease 
Protein accessionYP_001063853 
Protein GI126442962 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0448584 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATTCGTA CTGCTTCGTT CAAGGCGACC GTCCTGTGTG CCGCGCTGGC CGGCCTCGTT 
CCGGCCGCGC AAGCGGAAAC CGCGGCCGCG CCCCAGGTGC CGGGGCCCGC CGACGCGGTC
AATCAGTTGA TCGTCAAGTT GCGCGCGGTG AAGACGCCGC CCGGTGCGAC GGCCGCGAAG
GCCGAGCGCG CGGACGTTCA GGCCGTCATC GATCGCGTGC TCGCCGCGCG CAATGCGCGG
GCGGCGGGGC GTGCGTTCGG CGCGGCCGCC GCATCCGCGC CCGGCAATCC GGACGACCCC
GCCGCGGGCA TTCGCATCAA GCGCGACATG TCGGGCGGCG CGACCGTGCT GTCGATGCAG
CGCCACGTGT CGCTCGCGCA AGCCGAGGCG CTCGCGCGCG ACTTCGCGGC GGACGGCGCG
ATCGAATATG CGGAGCCCGA TGCGCGGATG CATCCGTTCG TCGTGCCGAA CGATACGCGC
TATTCGGAGC AATGGGGCTA CTTCAATCCG ACCGCCGGCG CGAATCTGCC GAAGGCTTGG
GATCGCACGA CCGGCTCCGC GCGCGTCGTC GTCGCCGTCA TCGATACCGG CTACCGTCCG
CATGCGGATC TCGCCGCGAA CCTGCTGCCG GGCTACGACT TCATCTCCGA TATCCCGAGC
GCGAACGACG GCAATGGCCG CGACAGCGAC GCATCGGATC CCGGCGACTG GGTGAGCGCG
CAGGAAGACG GCGATCCGAG CGGCCCGTTC TACGGCTGCG GCGCGAGCGA CAGCTCATGG
CACGGCACGC ACGTCGCGGG CACGATCGGC GCGGTGACGG ACAACGGCGT CGGCGTGGCG
GGCATCTCGT GGGTCGGCAA GGTGCTGCCC GTGCGCGTGC TCGGCAAGTG CGGCGGGATG
CTGAGCGACA TCGCCGACGG CATGCGCTGG GCGGCGGGCC TGCCGGTGCC GGGCGCGCCG
TCGAATCCGA ACCCGGCGAA GGTGCTGAAC CTGAGCCTCG GCGGATACGG CCGCACATGC
AGCTCGACGT ACCAGAACGC GATCAACGAA ATCACGTCGC GCGGCGCGAA CGTCGTTGTC
GCCGCGGGCA ACAACGGCGG CTCGGTGTCG ACGACTCAGC CGGCGAACTG CCGGGGCGTG
ATCGCGGTCG GCGCGATCGA CAGCCGCGGT GTGCGCGCGA GCTTCAGCAA CACCGGCGCC
GCGGTGAAGA TCTCCGCGCC GGGCGTCGGC ATTCTGTCGA CGCTCAATGC GGGCAAGACC
TCGCCGGGCG CGGACAGCTA CGCGAGCTAC AGCGGCACGA GCATGGCAAC GCCGCATGTC
GCGGGCACGG TCGCGCTGAT GCTCGCCGTC AACTCGACGC TGTCGCCCTC GCAGGTCTTG
CAGCGGCTGC AATCGAGCGC GCGGCCGTTC TCGAGCGGAT CGAGCTGCTC GACGAGCACG
TGCGGCGCAG GGCTGCTCGA CGCAGGCAAC GCGGTCGACG CCGCCGCGCA GTGA
 
Protein sequence
MIRTASFKAT VLCAALAGLV PAAQAETAAA PQVPGPADAV NQLIVKLRAV KTPPGATAAK 
AERADVQAVI DRVLAARNAR AAGRAFGAAA ASAPGNPDDP AAGIRIKRDM SGGATVLSMQ
RHVSLAQAEA LARDFAADGA IEYAEPDARM HPFVVPNDTR YSEQWGYFNP TAGANLPKAW
DRTTGSARVV VAVIDTGYRP HADLAANLLP GYDFISDIPS ANDGNGRDSD ASDPGDWVSA
QEDGDPSGPF YGCGASDSSW HGTHVAGTIG AVTDNGVGVA GISWVGKVLP VRVLGKCGGM
LSDIADGMRW AAGLPVPGAP SNPNPAKVLN LSLGGYGRTC SSTYQNAINE ITSRGANVVV
AAGNNGGSVS TTQPANCRGV IAVGAIDSRG VRASFSNTGA AVKISAPGVG ILSTLNAGKT
SPGADSYASY SGTSMATPHV AGTVALMLAV NSTLSPSQVL QRLQSSARPF SSGSSCSTST
CGAGLLDAGN AVDAAAQ