Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1519 |
Symbol | |
ID | 3848991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 1719555 |
End bp | 1720775 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637841191 |
Product | serine protease |
Protein accession | YP_442064 |
Protein GI | 83721436 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000201764 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACA AATCCCTGCT GTCCATCCTG ACAACGGCCG CCTGCATTCA GGCATTCGCG GCGACGGCCT CGCTCGCGCA AGGCTCCGCG CATCCGCCGT CGTACGTCGA AGGCACTCGC GCGCCGAAAG GCTTCGCGCG ACCTCCGTTC CACACGAATC CGGCGCGCTT CTCGGCAACG ACCGCCTCGG GCCAGTCGCC CGCCGCCGTG CGGCACGCGT ACGGCTTCGA CTCGATCGCG AACCAGGGCG ACGGCATGAT CGTCGCGATC GTCGACGCAT ACGACGATCC GAAGATCGAA TCCGATCTCG GCGTCTTCAG CAAGCATTTC TCGCTGCCGC CCTGCACGAC GTCGAACGGC TGCTTCAAGA AGCTCTACGC GAACGGCAAG AAGCCGCGCG CCGACGCCGG CTGGTCGCTC GAGATGTCGC TCGACGTCGA ATGGGTGCAC GCGATCGCGC CGAAGGCGAA GATCGTGCTC GTCGAAGCGG CGTCGAACAG CTTCAACGAT CTGATGACCG CGGTGGACGC GGCGGTCGGC GCCGGCGCCT CCATCGTGTC GATGAGCTTC GGCGGCAGCG AATTCAGCTC GGAAACCGGC TTCGACAGCC ACTTCAGCGC GCCGTCGCAT GTCACGTTCG TCGCATCGTC CGGCGACAGC GGCAACGGCA CCGAATATCC GGCCGCGTCG CCGTACGTCG TCGCGGTCGG GGGCACGACG CTCGCCGTCG ACGCATCCGG CAACTACATC GGCGAAACCG CGTGGAGCGG CAGCGGCGGC GGCGTCAGCA CGTACGAACC GGAGCCGTCG GGCCAGGCGC TGTGGCCGAT TCCGTACGCC GGCAGCCGCG GCGTGCCCGA CGTCGCGTAC GACGCGAATC CGAGCTCGGG CTTCGCCGTG TACGACTCCG TCACCTATCA AGGGCAATCG GGCTGGTTCG TCGTCGGCGG CACGAGCGCG GGCGCGCCGC AGTGGGCGGC CCTCTTCGCG ATCGCGAACT CGATGCGCGC CGCGGCCGGC AAGGCGACGC TCGCCGGCCC GTACAACCAG CTCTATACGG TCGGCAAGCT GGCGTACGGC AGCGACTATC ACGACATCAC GTCGGGCACC AATGGCGGTT GCGGGACGAT CTGCACCGCG AGCGGCAGCT ACGACTATGT GACGGGATTG GGCTCGCCGC AGGCGCTCAA CCTGGTTCAG GCGCTCGTCG CGCAGCCCTG A
|
Protein sequence | MKNKSLLSIL TTAACIQAFA ATASLAQGSA HPPSYVEGTR APKGFARPPF HTNPARFSAT TASGQSPAAV RHAYGFDSIA NQGDGMIVAI VDAYDDPKIE SDLGVFSKHF SLPPCTTSNG CFKKLYANGK KPRADAGWSL EMSLDVEWVH AIAPKAKIVL VEAASNSFND LMTAVDAAVG AGASIVSMSF GGSEFSSETG FDSHFSAPSH VTFVASSGDS GNGTEYPAAS PYVVAVGGTT LAVDASGNYI GETAWSGSGG GVSTYEPEPS GQALWPIPYA GSRGVPDVAY DANPSSGFAV YDSVTYQGQS GWFVVGGTSA GAPQWAALFA IANSMRAAAG KATLAGPYNQ LYTVGKLAYG SDYHDITSGT NGGCGTICTA SGSYDYVTGL GSPQALNLVQ ALVAQP
|
| |