Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I2153 |
Symbol | |
ID | 3847080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 2431738 |
End bp | 2432757 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637841822 |
Product | hypothetical protein |
Protein accession | YP_442674 |
Protein GI | 83718544 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.692536 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCTAC TGAAGAAGTG CGCCGCGCTC GCGGCGCTCG CCGTCGTATT GCTGGGCGCC GCGTGCGCGG GCGGCGCCTA TTATTGGGCC ACCCGGCCGC TCGCGCTCGC CGCGCCGACC CTCGATGTCA CGATCAAGCC CCGCAGCAGC GTGCGCAGCG TCGCGCAGCA GCTCATGCAC GGCGGCGTGC CTGTCGAGCC GCGCCTGTTC GTCGCGATGA CGCGCGCGCT GTTCCTGTCG AGCCGCCTCA AGTCAGGCAA CTACGAGTTC AAGACGGGCG TGACGCCTTA CGACGTGCTG CAGAAGGTTG CGCGCGGGGA CGTCAACGAA TACGTCGTGA CCGTGATCGA GGGCTGGACG TTCAGGCGCA TGCGCGCGGA GCTCGACGCG AATGCGGCGC TCACGCATTC GAGCGCGGGG ATGAGCGACG CGGCGCTGTT GCGCGCAATC GGCGCGTCCG ACGAGGCTGT CGCGCGCGGC GCGGGCGAGG GGCTGTTCTT TCCGGATACC TACCTGTTCG ACAAGGGCAC GAGCGATCTG AACGTCTATC GGCGCGCATA CCGGCTGATG CAGACGCGCC TTGCCGACGC GTGGACCGCG CGCCGGCCCG GCCTGCCGTT CAAGACGCCT TACGAGGCGC TGACGGTCGC GTCGCTCATC GAAAAGGAGA CGGGGCACGC GGCCGATCGC GCGTTCGTGT CGGGCGTGTT CGCGAACCGC CTGCGGGCCG GGATGCCGCT GCAGACCGAT CCTTCGGTGA TCTACGGAAT GGGCGACGCG TACGCGGGGC GGTTGCGCAA GCACGATCTG CAGACCGACA CTCCGTACAA TACCTACACG CGCCGCGGGC TGCCCCCGAC GCCGATCGCG CTGCCGGGCG AGGCGGCGCT CTACGCCGCA GTGAACCCGG CGGCGACGTC CGCGCTCTAT TTCGTCGCGA AGGGCGACGG CACGAGCGTC TTTTCCGACA CGCTCGGGGA TCACAACAAG GCCGTGGACA AATACATACG AGGTCAATGA
|
Protein sequence | MSLLKKCAAL AALAVVLLGA ACAGGAYYWA TRPLALAAPT LDVTIKPRSS VRSVAQQLMH GGVPVEPRLF VAMTRALFLS SRLKSGNYEF KTGVTPYDVL QKVARGDVNE YVVTVIEGWT FRRMRAELDA NAALTHSSAG MSDAALLRAI GASDEAVARG AGEGLFFPDT YLFDKGTSDL NVYRRAYRLM QTRLADAWTA RRPGLPFKTP YEALTVASLI EKETGHAADR AFVSGVFANR LRAGMPLQTD PSVIYGMGDA YAGRLRKHDL QTDTPYNTYT RRGLPPTPIA LPGEAALYAA VNPAATSALY FVAKGDGTSV FSDTLGDHNK AVDKYIRGQ
|
| |