Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I0517 |
Symbol | |
ID | 3847084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 581786 |
End bp | 584056 |
Gene Length | 2271 bp |
Protein Length | 756 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637840190 |
Product | hypothetical protein |
Protein accession | YP_441074 |
Protein GI | 83718548 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain [TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.11147 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGACG ACTTCTCGCG CGTGCTGCGT TTCAAGCCGC ATCTGCTGGT GCGCGACACC GGCTCCGGCG CGTTGTATGT GGTGGACGAA TTCAGACGCT CGGTGTTGCC GGGCGACGTG TTTCCCGCGC TCGCCGCGTG CATGCGCGAT CGCCTGACGA TCGCGCAGAC GTTCGCGGCG CTCGCCGCGC GGTTTTCTCA ATGGGAGGTG CTCGCCGCGC TCGATCACCT GGTGCGGCGC GGTTACGTGC GCGCCGACGC GCCCGGCGAG CGCGACGCGG CGATCGGCTT CTATGAGCGC ACGGGCGTCG ACGGCGATGC GGCGAGCGAC ATCGCGTCGC GCCTCGCGGT CGCCGTCGAC GCGTTCGGCG TGGACCCGCG CGCGCAGCTC GACGCATTCG CCGCGTGCGG CATCGGCGTC GCGCCCGACG CGCCGCTGAC GGTCGCGCTC GCGGACGGCT ACGAGCGCGC CGAATTGATC GACGCCGCCG AACGCGCGGC GGCGCGCGGC GACGCACTGC TCGTCGTCGT TCCCGATCGC GTGGAGCCGC TGCTGGGCCC GCTGCTCGGC CCGCCGGCCG GCGCGCGGTC GGCATCGGCC TCGACGGAGG AGGCGGGCGC CGACGCGCCG CCCTGCATCG AATGCGTGCG CTACTGGACC GCGCTGAACC GCCCGGTCGA GACGCTGCTC GCGCGCCTGC ACGGCAGCGA CGCGGCCCGC TTGCCGGGCG CGCATAGCCG TGCGAGCGAA GCCGCCGTCG CGGCCGTCGT CGCGTCGTTC GTCGAGCAGA TCGCGGTGAA CGCGCAGCGC CGCCGCCATG CGAGCGCGCA CATCGTGTCG CTGCGTGTCG ACACGCTCGC CACCGCCGCG CATCGCGTCG TCAGGCGGCC GCAATGTCCG CGCTGCGGCA ACGCGCGATG GATGCGCGAG CAGGCCGAGC GCCCGGTGAC GCTCGCATCG GCGGATGCGG GCGCGCGCCG CGAGGGCGGC TATCGGACAC TCGCCGCCGC CGAGCTCGTC AAACGCTACG GACATCTGAT TTCCCCGGTG AGCGGCCCGA TCGCCTATCT GCATCCGATG CCCGGGCGCA ACGCCGGCCT GCGGCACGTC TACGTCGCCG GCTACCTGGT GTGTCCGCCG AGCGCGCCGC GCGAGAACCG TTTCGACAAG CTGTGCTCGG GCAAGGGCGC GAGCGACGCG CAGGCGCGCG CGAGCGCGCT CGCCGAGGCG CTCGAACGCT TCAGCGGCGT TCATCAGGGC GACGAGGCGA CGCTGCGCGG CAGCCTCGCG GAGCTGTCCG CGCACGCGCC GCCGGGTGGC GGCCCGATCG ACGTCAACGC GCTGCAGCAA TACAGCGATC GCCAGTTCGA GCGGCGCGAG CGCCACAACG CGACGACCGA CGATCCGCGC AAGCAGGTGC CGCAGCGCTT CACGCGCGAC AGCGTGATCG ACTGGACGCC CGCATGGTCG ATCGCGACGG GCGCGCGGCG GCTCGTGCCG CTTGCCTACT GCTATGCGGA AACGCCGGCC GCGAGCGGCG CCGCCTATTG CGTGCACAAC CCGAACGGCT GCGCGGCGGG CGCGTGCATC GAGGAGGCGA TCCTGCAGGG CCTGCTCGAA CTCGTCGAGC GCGATGCCGT CGCGATCTGG TGGTACAACA TGCTGCGCCG GCCGGCCGTC GACATCGAGA GCTTCGGCGA TCCGTACTTC GATGCGCTCG TCGCCGACTA TGCGTCGCTC GGCTGGCGCC TGTGGGCGCT CGACATCACG CACGACCTGC GCATGCCGGT ATTCGTCGCG CTCGCGCGCG AAACGGCGAC AGGGCGTTTC TCGATCGGCT TCGGCTGTCA TCTCGACAGC CGGATCGCGT TGCAGCGCGC GCTCACCGAA GTGAACCAAC TGCTCGACGT CGGCGCGTCG GCGCCGCCGC CGTGGGACGC CGACAAGCTG CCGGACGACG CGTTCCTCCA TCCCGATCCC GCGCTGCCGC CGACGCGCGG GCCGTCGCGG GCGTCGCACG GCGCGTGCGA TCTGAAGGGC GACATCGAGA ATGGCGTCGC GCGCTTGTCC GCGGCGGGCA TCGATACGCT CGTCGTCGAC AAGACGCGGC CCGACATCGG CCTGCCGGTC GTGCAGGTGA TCGCGCCGGG CCTGTGTCAC TTCTGGCCGC GCTTCGGCGC GCCGCGGCTG TATTCGGTGC CGGTTGCCGA GCGCTGGTGC GAGCGGCCGC GCGACGAGGA CGAGCTCAAT CGCGCGCTGC TGTTCCTGTA G
|
Protein sequence | MLDDFSRVLR FKPHLLVRDT GSGALYVVDE FRRSVLPGDV FPALAACMRD RLTIAQTFAA LAARFSQWEV LAALDHLVRR GYVRADAPGE RDAAIGFYER TGVDGDAASD IASRLAVAVD AFGVDPRAQL DAFAACGIGV APDAPLTVAL ADGYERAELI DAAERAAARG DALLVVVPDR VEPLLGPLLG PPAGARSASA STEEAGADAP PCIECVRYWT ALNRPVETLL ARLHGSDAAR LPGAHSRASE AAVAAVVASF VEQIAVNAQR RRHASAHIVS LRVDTLATAA HRVVRRPQCP RCGNARWMRE QAERPVTLAS ADAGARREGG YRTLAAAELV KRYGHLISPV SGPIAYLHPM PGRNAGLRHV YVAGYLVCPP SAPRENRFDK LCSGKGASDA QARASALAEA LERFSGVHQG DEATLRGSLA ELSAHAPPGG GPIDVNALQQ YSDRQFERRE RHNATTDDPR KQVPQRFTRD SVIDWTPAWS IATGARRLVP LAYCYAETPA ASGAAYCVHN PNGCAAGACI EEAILQGLLE LVERDAVAIW WYNMLRRPAV DIESFGDPYF DALVADYASL GWRLWALDIT HDLRMPVFVA LARETATGRF SIGFGCHLDS RIALQRALTE VNQLLDVGAS APPPWDADKL PDDAFLHPDP ALPPTRGPSR ASHGACDLKG DIENGVARLS AAGIDTLVVD KTRPDIGLPV VQVIAPGLCH FWPRFGAPRL YSVPVAERWC ERPRDEDELN RALLFL
|
| |