Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0616 |
Symbol | |
ID | 3844659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 720487 |
End bp | 721527 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637837921 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_438816 |
Protein GI | 83717849 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGTTC TCGGCATCGA AAGCTCCTGC GACGAAACCG GCCTCGCGCT CTACGACACC GGGCGCGGCC TGCTCGCGCA CGCGCTTCAC TCGCAGATTG CGATGCACCG CGAATACGGC GGCGTCGTCC CCGAGCTCGC GTCGCGCGAC CACATTCGCC GCGCGCTGCC GCTGCTCGAG GAAGTGCTCG CCGCTGGCGG CGCACGCCGC GAGGACATCG ACGCGATCGC GTTCACGCAA GGGCCCGGCC TCGCGGGCGC GCTGCTCGTC GGCGCGAGCA TCGCGAACGC GCTCGCGTTC GCGTGGGACA AGCCGACCAT CGGCATCCAT CACCTCGAAG GGCATCTGCT GTCGCCGCTC CTCGTTGCCG AGCCGCCGCC GTTTCCGTTC GTCGCGCTGC TCGTGTCGGG CGGCCACACG CAGTTGATGC GCGTGACCGA CGTCGGCGTC TACGAGACGC TCGGCGAAAC GCTCGACGAC GCCGCCGGCG AAGCGTTCGA CAAGACCGCG AAGCTGCTCG GCCTTGGCTA TCCCGGCGGA CCGGAAGTGT CGAAGCTCGC CGAAGCCGGC ACGCCGGGCG CGGTCGTGCT GCCGCGGCCG ATGCTCCATT CGGGGGATCT CGACTTCAGC TTCAGCGGGC TGAAGACCGC CGTGCTCACG CAAATGAAGA AGCTCGAAGC CGCGCACGCG AGCGGCGCCG AGCTCGATCG CGCGAAGGCC GATCTCGCGC GCGGCTTCGT CGACGCGGCC GTCGACGTGC TCGTCGCGAA GTCGCTCGCC GCGCTGAAGA AGACGCGGCT CAAGCGGCTC GTCGTCGCGG GCGGCGTCGG CGCGAACCGG CAGTTGCGCG CGGCGCTGTC GGCCGCCGCC GGGAAGCGCG GCTTCGACGT CCATTATCCC GACCTCGCGC TCTGCACCGA CAACGGCGCG ATGATCGCGC TCGCGGGCGC GCTGCGGCTC GCGCGCTGGC CGTCGCAGGC GAGCCGCGAC TACGCATTCA CGGTGAAACC GCGCTGGGAC CTCGCGTCGC TCGCGCGATA G
|
Protein sequence | MLVLGIESSC DETGLALYDT GRGLLAHALH SQIAMHREYG GVVPELASRD HIRRALPLLE EVLAAGGARR EDIDAIAFTQ GPGLAGALLV GASIANALAF AWDKPTIGIH HLEGHLLSPL LVAEPPPFPF VALLVSGGHT QLMRVTDVGV YETLGETLDD AAGEAFDKTA KLLGLGYPGG PEVSKLAEAG TPGAVVLPRP MLHSGDLDFS FSGLKTAVLT QMKKLEAAHA SGAELDRAKA DLARGFVDAA VDVLVAKSLA ALKKTRLKRL VVAGGVGANR QLRAALSAAA GKRGFDVHYP DLALCTDNGA MIALAGALRL ARWPSQASRD YAFTVKPRWD LASLAR
|
| |