Gene BTH_I2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I2044 
Symbol 
ID3847442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp2315739 
End bp2317823 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content70% 
IMG OID637841713 
Productserine protease 
Protein accessionYP_442568 
Protein GI83718906 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000458004 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATTGGGG AGGGCGGCGC CGGCTTTCGC GAGAGCCGGC GGCGATATTC GTTCACAGGT 
CGTCTGGCGG CCGGGGCGGG AGAGCGGCCT CGTTCGCGCC TGGGCAGCAG TCATCGCGGC
ACTCGTCCGC AAGCAGTTCG GCGGTTCGCT TTCATGGTGT TTCGACTGTA TTCAAACCGA
ATGAGGAACG ACAAAATGAC GTCAAGAAAA TGGGCCGGGC TTCGTGCTCC GCAAACGAAG
CACGCGATTT GCGCGGCGAC GCTTTTTGCC GCGACGACGT TGAGCGCGCA CGCGGCGGCG
CCGGCGTGGG TCGACACGCA GACCCGCGCT TATCCGGCAT TTCCGCAGCA GGCGCGTGCC
GCGTCGCAGG CTTCGGCAGC GGCGTCTGCG GCTGGAAAGG CGATCGACGC GGCGCCCGGC
GAGCCGGTGC GCGTTGTCGT CAGTCTCAAT CTCAACGACG AAGCAAAGCT CGATCGCTTC
CTGCTCGATC TGCATACCCC CGGCAGCGCC GCTTACGGCC GGCCCCTCAC GCCCGCCGAA
TTCACCGCGC GGCATGCGCC GACGCCTCAA CAGGTCGCGC TCGTCGAAGC GCATCTGCGC
CGGGCCGGGT TCCGCGACAT CGAGGTGTCG CCGAACCGGC TGCTGATCTC GGCGACGGGC
ACCGCGGCCG CGGTCAAGAC GGCGTTCAAC ACGCGGCTCA AGCGCTTCAC GCTCGAGGGC
CGGCGCGTCT ACGCGAACCA GGACGCGGCG CAGGTGCCCG CCGAGCTCGG CCGGATCGTC
GGCGCCGTGC TCGGGCTCGA CAACGCGACG CTCGCGCGCA CGTACAACCG CCAGGCGGCG
GTGACGGGCA CGGTCGGCGG CGCGAAGGCG TCGCTTGCCG CGCGCGCGAG CGACGCGACG
GCGGCCGCGA GCGGCACGCC CGTGCTGACG GGCCACGATC CGCTCGAATT CTCGCGAATC
TACCGCGCGG GCGCGACGCC GACGGCTTCA CTGACGACAG TCGGCGTGAT CATGGCGGGC
GACGCGGCAC CCGTGCTGCG GGATCTCGAC ACGTTCGCGG CGAAGGCGGG GCTCGCGCGC
GTCGCGGCGA CCGTCACACG CACCGGGCCG CCGGGCAGCG ACTACAACGA CAATTCGGGC
CTGAGCGAAT GGGATATGGA CAGCCAGGCG ATCGTCGGCG CGGCGGGCGG CGAAGTGAAG
GGAATCGTGT TCTACGCGGC GCCTTCGATG CTGCTCTCCG ACATCACCGA AGCGTACAAC
CGCGCAGTCG CGGACAATGT CGCGAAGGTG ATCAACGTGT CGCTCGGCGT GTGCGAGGCG
GATGCGCGCG CATCCGGCAC GCAGGCGGCG GATGACCGGA TCTTCAAGAG CGCGGTCGCG
CAGGGGCAGA CGTTCGTCGT CGCGGCGGGC GACGCGGGCG CGTACGAATG CAGCGTGAGC
CGCGTGTCGG GTGGCCAGGG CGTGCCGGCG CGCTCGAACT ACTCGGTCAG CGAGCCCGCG
ACGTCGCCGT ACGTCGTCGC GGTCGGCGGC ACGACGCTGT CGACCGACAA GACGACGCTC
GCGTATGCGG GCGAAGTCGC GTGGAACGAG GGCTTGCAGC CGATCGGCGT GTACGACGCG
TACGGCAGCT ACGACGGCAC GCAGCGTCTT TGGGCGACGG GCGGCGGTTA CAGCAAGAAC
GAAGCGGTGC CGGCGTGGCA GCGAAGCGTG CTCGGCGCGT CGGCGAGAAC GCGCGCGCTG
CCCGACGTCG CGTTCGATGC GGACGGCCGC AGCGGCGCGC ACGTCTATGT GAACGGCCGG
ACTGAGCAAT GGGGCGGCAC GAGCCTCGCG GCACCGATCT TCACGGGCAT CTGGGCGCGC
GTGCAATCCG ACAACGGCAA CCGGCTCGGC TTTCCGCTCG CGAGCCTCTA TCGCTACGTG
CCGTCAAACC GCGCGCTTGC GCGCGACGTG AAATCCGGCC ACAACGGTTC GGGCGGCTAC
GGCTACAAGG CGGGCGCGGG CTGGGACCCG GTGACGGGCT TCGGCAGCCT CGACGTCGCG
AACTTCGCCG CGTTCGTGAA GAAGACGGCC GATTTCGCGC GATAA
 
Protein sequence
MIGEGGAGFR ESRRRYSFTG RLAAGAGERP RSRLGSSHRG TRPQAVRRFA FMVFRLYSNR 
MRNDKMTSRK WAGLRAPQTK HAICAATLFA ATTLSAHAAA PAWVDTQTRA YPAFPQQARA
ASQASAAASA AGKAIDAAPG EPVRVVVSLN LNDEAKLDRF LLDLHTPGSA AYGRPLTPAE
FTARHAPTPQ QVALVEAHLR RAGFRDIEVS PNRLLISATG TAAAVKTAFN TRLKRFTLEG
RRVYANQDAA QVPAELGRIV GAVLGLDNAT LARTYNRQAA VTGTVGGAKA SLAARASDAT
AAASGTPVLT GHDPLEFSRI YRAGATPTAS LTTVGVIMAG DAAPVLRDLD TFAAKAGLAR
VAATVTRTGP PGSDYNDNSG LSEWDMDSQA IVGAAGGEVK GIVFYAAPSM LLSDITEAYN
RAVADNVAKV INVSLGVCEA DARASGTQAA DDRIFKSAVA QGQTFVVAAG DAGAYECSVS
RVSGGQGVPA RSNYSVSEPA TSPYVVAVGG TTLSTDKTTL AYAGEVAWNE GLQPIGVYDA
YGSYDGTQRL WATGGGYSKN EAVPAWQRSV LGASARTRAL PDVAFDADGR SGAHVYVNGR
TEQWGGTSLA APIFTGIWAR VQSDNGNRLG FPLASLYRYV PSNRALARDV KSGHNGSGGY
GYKAGAGWDP VTGFGSLDVA NFAAFVKKTA DFAR