Gene BURPS1106A_2749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2749 
Symbol 
ID4903054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2714243 
End bp2715277 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content69% 
IMG OID640135976 
ProductU32 family peptidase 
Protein accessionYP_001067000 
Protein GI126454060 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.254012 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAAA GCAGCCACTT CGCGACGGGC GCCGCGCCGA TCGAACTCGT GTGCCCGGCG 
GGCAGCCTGC CCGCGCTGAA GGCCGCGGTC GACAACGGCG CGGACTGCGT GTATCTCGGT
TTTCGCGACG CGACGAACGC GCGCAACTTC GCCGGCCTGA ACTTCGACGC GCAGGCGATC
GCGGCCGGCA TCCGCTATGC GCGCGAGCGC GGCCGCAAGG TGCTCGTCGC GCTCAACACG
TATCCGCAGC CGGACGGCTG GGCCGCGTGG CGGGAGGCGG TGGGCCGCGC GGCCGACGCG
GGCGTGGACG CGATCATCGT CGCCGATCCG GGGCTCATGC GCTTCGCGCG CGAGCGCTAC
CCGGAGCTGC GACTGCACCT GTCGGTGCAG GGCTCGGCGA CGAACTACGA GGCGATCAAC
TTCTATCACG AGCACTTCGG CGTTTCGCGC GCGGTGCTGC CGCGCGTGCT GTCGCTCGCG
CAGGTCGAAC AGGTGGCCGA GAACACGCCG GTCGAAATCG AGGTGTTCGG CTTCGGCAGT
CTGTGCGTGA TGGTCGAGGG GCGCTGCGCG CTGTCGTCGT ATGCAACGGG CGAATCGCCG
AACACGCGCG GCGTGTGCTC GCCCGCGAAG GCGGTGCGCT GGCAGAAGAC GCCGGACGGC
CTCGAATCGC GGCTGAACGG CGTGCTGATC GACCGCTACG AAGACGGCGA GAACGCCGGC
TATCCGACGC TCTGCAAGGG GCGCTTCACG GTGGCCGACG AGAGCTACTA CGCGATCGAG
GAACCGACGA GCCTGAACAC GCTCGAGCTG CTGCCGAAGC TGATGCAGAT CGGCATACGC
GCGATCAAGA TCGAAGGCCG TCAGCGCAGC CCCGCGTACG TCGCGCAGGT GACGCGCGTG
TGGCGCGATG CGATCGACCA GTGCACGGCG AACCTCGCGC GCTACTACGT GAAGCCCGCG
TGGATGACGG AACTGAACAA GGTCGCGGAA GGGCAGCAGC ATACGCTCGG CGCCTACCAC
CGGCCGTGGA AATGA
 
Protein sequence
MTQSSHFATG AAPIELVCPA GSLPALKAAV DNGADCVYLG FRDATNARNF AGLNFDAQAI 
AAGIRYARER GRKVLVALNT YPQPDGWAAW REAVGRAADA GVDAIIVADP GLMRFARERY
PELRLHLSVQ GSATNYEAIN FYHEHFGVSR AVLPRVLSLA QVEQVAENTP VEIEVFGFGS
LCVMVEGRCA LSSYATGESP NTRGVCSPAK AVRWQKTPDG LESRLNGVLI DRYEDGENAG
YPTLCKGRFT VADESYYAIE EPTSLNTLEL LPKLMQIGIR AIKIEGRQRS PAYVAQVTRV
WRDAIDQCTA NLARYYVKPA WMTELNKVAE GQQHTLGAYH RPWK