Gene BTH_II2193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II2193 
Symbol 
ID3845692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp2694759 
End bp2696003 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content70% 
IMG OID637839494 
Productproline iminopeptidase 
Protein accessionYP_440381 
Protein GI83717984 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGCGGCG CTACATCGCG CCGACGGGCC GCACGCTGGT GTGCATGCCG ATCGAGCGGA 
TCGCATACGC GCGGAAGGCG TAGCGGCGGC GACGCGGCCG GCACGGCCGA CGGCCGGCCG
CGTCCGAGGC GAACGCGCGC GCCGGTTTTC CGTCGACTCG GCATAATGAT GAGGCAGTCG
TTGCTTTCGA CGCCGTGCCG GATCGGCGCG GCCGGGCGCG TGCGGCGCAT CGCACGTGCC
GCATGCGCGC GTTGTCCGCG CGCGATCGCC CCAGCCGTTT TCCTCCATTC AACCGGAGTG
CCTCTCTTGT ACCCACCAAT CGAACCTTAC GCACACGGCT TGCTCGATAC CGGCGACGGC
CATCGCGTGT ATTGGGAGCT GTGCGGCAAC CCCGACGGCA AGCCGGCCGT CTTCCTGCAC
GGCGGCCCGG GCAGTGGCTG CAGCGCCGAG CACCGCCGCC TCTTCGACCC CGCGCGCTAC
AACGTGCTGC TGTTCGACCA GCGCGGCTGC GGGCGCTCCG CGCCGCACGC GAGCCTCGAG
AACAACACGA CATGGCATCT CGTCGACGAC ATCGAGCGAT TGCGCGAGAT GCTCGGCGTC
GAGCGCTGGC TCGTGTTCGG CGGCTCGTGG GGCAGCGCGC TCGCGCTCGC GTATGGCGAG
ACGCATCCGG CGCGCGTGAC CGAGCTCGTC GTGCGCGGCG TCTTCACGGT GCGCCGCTCA
GAGCTCCTCT GGTATTACCA GGAAGGCGCG TCGTGGCTGT TTCCGGACCT GTGGGAAGAC
TTCGTCGCGC CCATTGCGCC CGCCGAGCGC TCGGACCTGA TCGCCGCGTA TCGCCGCCGG
CTGACGGGCG GCGACGAAGC GGCGAAGCGC GAAGCGGCGC GCGCGTGGAG CATCTGGGAA
GGCCGGACGA TCACGCTGCT GCCGAATGCC GCGCACGAAG CGCATTTCGG CGACGCGCAT
TACGCGCTCG CGTTCGCCCG CATCGAAAAC CACTACTTCG TTCATCAAGG TTTCATGGAA
GACGGGCAGT TGCTGCGCGA CGCGCATCGT CTGGCGGACA TTCCAGGCGT GATCGTTCAG
GGGCGCTACG ACGTCGCGAC GCCCGCGCGC ACCGCGTGGG AGCTCGCGAA GGCGTGGCCG
CGCGCGTCGC TCGAGATCGT GCCGGACGCG GGCCACGCGT ACGACGAGCC GGGCATCCTG
CGCGCGCTGA TCGCGGCGAC CGACCGCTTC GCGCGCGAAC GCTGA
 
Protein sequence
MRGATSRRRA ARWCACRSSG SHTRGRRSGG DAAGTADGRP RPRRTRAPVF RRLGIMMRQS 
LLSTPCRIGA AGRVRRIARA ACARCPRAIA PAVFLHSTGV PLLYPPIEPY AHGLLDTGDG
HRVYWELCGN PDGKPAVFLH GGPGSGCSAE HRRLFDPARY NVLLFDQRGC GRSAPHASLE
NNTTWHLVDD IERLREMLGV ERWLVFGGSW GSALALAYGE THPARVTELV VRGVFTVRRS
ELLWYYQEGA SWLFPDLWED FVAPIAPAER SDLIAAYRRR LTGGDEAAKR EAARAWSIWE
GRTITLLPNA AHEAHFGDAH YALAFARIEN HYFVHQGFME DGQLLRDAHR LADIPGVIVQ
GRYDVATPAR TAWELAKAWP RASLEIVPDA GHAYDEPGIL RALIAATDRF ARER