Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I0096 |
Symbol | |
ID | 3849225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 102222 |
End bp | 104012 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637839769 |
Product | hypothetical protein |
Protein accession | YP_440656 |
Protein GI | 83721360 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000382235 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCCGAAC AACGGTACGC GCCGCATCAC ATCGCAGACC TCATCACAGT GCGCAATGTC GTCACACAGA TGAAGGAAAT GGCGGCAATG AGCCGTTATC GCCCACCCTT CACGCTCGTA CAACGGCCTG TGCCCCGTAT TCAGTTAGGA GCTCACGCTA GCCCCATCCT TGCACTGCTA GGACTGCGTG AACCGGCCAG ACAGATCAAG GCCCGCGCCG GTGAGGTGCA TGAATTTCAC CCGTATTTCG AAGCGTTTCT CGAAGTGGCC GAAATTGGGC TGCGCTATGA AGACGCGACC GGTGAAATGC GTACACTCGA ATACGATGAA TTCGACGGGG CAAGTTTCAG CAATCACCCC AAAGTGGCAC TGTACTGCGC TACCTTGAAC GATTTTTTTG TCCGGCTTGG AAAACGACTA GACAAAAAAT CCCGGGCGGC CGCCAAGGCA TTTGAGCGCA CGCCTTCGGA CAACCGCCAG CACGCAATGC GGTACGCGGC CCAGCTAATT GACCGCGCAC CTAGAACGCG CATCGTTCAC GTCACAATCC GTCGGCATCT CGATCTCTAC GGCAACCCAG TGTCGCAGGA TGAAATACGC GCATGCCGGC AAATGCTGAG CGATTATCTG GACGAGCCAA GCCCCAAACG CTCGCATCTC GGTCACTGCT CTTTTCTCAG AAGGTTTTCA GACGTCGGCT ATTGCCTCGA TGCTTTTGTT TTCCTGAGTG ATTCATTTCT CAAGCCCGCG GCAGACATTG CGCGAGATCT GACGGAATGG TGGGAAGCGT TCGCGCCAAA ATCCACAGCC TGTGTGACCC GCGTATTGCC CCCCGACAAG CTTGTAAAGG AGACGCTGGA CAGGATGACG CTCGTGACAG AACCGGATTT CTACGTCCGC CCAGCCCGTC TCAGTGGTGC GCCGAAAAAG CTCCGTCACT TCTGGTGCAC CCAGTTCCCG GTGAACCTGC GCGCGGCGCG CCGCAGAGGC AAAGCCGGTT CGTACCGGCC GACCGCAGAC CAGTCCCACA CCGTAGATCC GCTGCTTGAA GCGCTCCGGG AGGAAGAAAT AGAAGATAAC GGCGTTCAGA AGGAGCTGCG CTGGCGCCAG GCGCGGGAGA AAGAGCAGGC ACGACGTTCC GCGAGCCACA AGAAGGGAGC TCGCACGCGC GCGAAGAACC GGAAGACAGC GGAGCAGGAG CGCGCCTGGC AAAGCGCTCG TTTTGAGGCG TGCAAGAGCC GCATCATGTA CGACGATGAG ATCATCGATG TAGAGGCGGC CGAGGTAATT GACGTCGCTC AGGCGTTGCG CCGACATTCA CGCCGACGCA CGGTCCTCCC GTCAGCTAGC GCAACGCCGC GCCACGCCGA CAACCTGGCG CCCTGTTCGC CGGCGGCAGG CATCACGCCC AACGCACCGG AAATGGCTGC GGCGAGCTCG GATCGGGGAG TTGCAGCAGG GTCGCTTACC GAACCGCGCA TTTTCGACGA ACAGCCTGGT GTAATCACCC CTGCACCCGG TCTCGGCGAG ACTGAAGGCG CGACAGCCGC AACGTCCGGG CCCGCAACCA ATGCCGGACC AAAGACTCGC CGCAGGACTT CGGAAAAACA GGAGCGCGAC AAGCACGGGA GACTCCGGAC CATACAGGTT GAGGTCCGGA GGGCATCGCC CCTTACCAGG AGCAGTACGG CGATGAAAAA GGCGCCGAAC AATGCCGAAG CGAACACGCC CAGCACAGAG AGCGGTTTCG ACGCATCCAG CCCGCACACG ACTTCAGACC CGACGGAGTA G
|
Protein sequence | MAEQRYAPHH IADLITVRNV VTQMKEMAAM SRYRPPFTLV QRPVPRIQLG AHASPILALL GLREPARQIK ARAGEVHEFH PYFEAFLEVA EIGLRYEDAT GEMRTLEYDE FDGASFSNHP KVALYCATLN DFFVRLGKRL DKKSRAAAKA FERTPSDNRQ HAMRYAAQLI DRAPRTRIVH VTIRRHLDLY GNPVSQDEIR ACRQMLSDYL DEPSPKRSHL GHCSFLRRFS DVGYCLDAFV FLSDSFLKPA ADIARDLTEW WEAFAPKSTA CVTRVLPPDK LVKETLDRMT LVTEPDFYVR PARLSGAPKK LRHFWCTQFP VNLRAARRRG KAGSYRPTAD QSHTVDPLLE ALREEEIEDN GVQKELRWRQ AREKEQARRS ASHKKGARTR AKNRKTAEQE RAWQSARFEA CKSRIMYDDE IIDVEAAEVI DVAQALRRHS RRRTVLPSAS ATPRHADNLA PCSPAAGITP NAPEMAAASS DRGVAAGSLT EPRIFDEQPG VITPAPGLGE TEGATAATSG PATNAGPKTR RRTSEKQERD KHGRLRTIQV EVRRASPLTR SSTAMKKAPN NAEANTPSTE SGFDASSPHT TSDPTE
|
| |