Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1711 |
Symbol | |
ID | 3847115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 1921486 |
End bp | 1922487 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637841380 |
Product | peptidase, U7 family protein |
Protein accession | YP_442246 |
Protein GI | 83718579 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.354653 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGATC AAATCAACCC GCCCGATTCG TCTTCGTCCC CCGCTGCGAA CGCCGCCTCC CGAGAACCGA ACTGGGAGCG CGCCGTGCTC GAGCGCGTCG CGCTCGCGGC GATCAAGGAG CAGCGCGCCG CGCGACGCTG GAGGATCTTC TTCCGCTTCG CGTTCCTCGC CGTGCTCGGC GCGCTCGCGT TCGCGTTCCT CAGCGTGTCC GGCGACGGCA GCAAGCTCGC GAGCGGCCGT CACACGGCGG TCGTGACGAT CGACGGCGAG ATCGCCGCGA GCACCAACGC GAACGCCGAG GACATCAACA CGGCGCTCGA CAGCGCGTTC GAGGATTCGG GCACGGTCGG CGTCGTGCTG AAGATCAACA GCCCGGGCGG CAGCCCGGTG CAGGCGGGCA TCGTCTACGA CGAGATCCGC AGGCTGCGCA AGAAGTATCC GGCGAAGCCG CTTTACGTCG TCGTCTCCGA CATGTGCGCG TCGGGCGGCT ATTACATCGC GGCGGCGGCG GACAAGATCT ACGTCGACAA GGCGAGCATC GTCGGCTCGA TCGGCGTGCT GATGGACGGC TTCGGCTTCA CCGGCCTGAT GGACAAGCTG GGCGTCGAGC GGCGCCTGCA CACGTCGGGC GAGAACAAGG GCTTCTTCGA TCCGTTCTCG CCGGAGACGC CGAAGATGGA CGCGCACGCG CAGGAAATGC TCGACGAGAT CCATGCGCAG TTCATCAAGG CGGTGAAGGA CGGCCGCGGC GCGCGGCTGC ACGAATCGCC GGACATCTTT TCCGGACTCT TCTGGACGGG CGCGAAGAGC ATCGAGCTCG GCCTCGCCGA CGATTACGGG ACGACCGATT CTGTCGCGCG CGACGTGCTG AAGGCGCCGG ATCTCGTCGA CTACACGGTC AAGGAAAGCC TGACGGACCG CGTCGCGCGC CGCTTCGGCG CGGCGGTCGG CAAGGCCGCG CTGAAGGCGG CCGTCGCCGG CGCCGAGCTG AAGCTGCGCT GA
|
Protein sequence | MSDQINPPDS SSSPAANAAS REPNWERAVL ERVALAAIKE QRAARRWRIF FRFAFLAVLG ALAFAFLSVS GDGSKLASGR HTAVVTIDGE IAASTNANAE DINTALDSAF EDSGTVGVVL KINSPGGSPV QAGIVYDEIR RLRKKYPAKP LYVVVSDMCA SGGYYIAAAA DKIYVDKASI VGSIGVLMDG FGFTGLMDKL GVERRLHTSG ENKGFFDPFS PETPKMDAHA QEMLDEIHAQ FIKAVKDGRG ARLHESPDIF SGLFWTGAKS IELGLADDYG TTDSVARDVL KAPDLVDYTV KESLTDRVAR RFGAAVGKAA LKAAVAGAEL KLR
|
| |