Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0798 |
Symbol | |
ID | 3846685 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 935084 |
End bp | 936649 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637838101 |
Product | hypothetical protein |
Protein accession | YP_438995 |
Protein GI | 83716905 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03368] cellulose synthase operon protein YhjU |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.123535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTTCT GGAATCTGTA TTTCATTCTG AAGCTCTATC TGTTCGCGGC GGGTCATCTG AAGCCGCTGT GGATCGCGAA CCTCGGTTTC GCGCTGGCGC TTGCGCTGAG CTCGCCGGTG AGGCGGCGCG GCCTGCGGCT GCTGCGCCTC GCGCTCGCGC TCGCGATCGG CGTGCCGCTG ATGTACCGCG AAGCGGACGT GCCGCCGTTC GCGCGGCTCG TCGAGACGCT CGGCAGCCTG CGCTCGTTCA GCGCCGGCTA CTGGATGGAG CTCGTGCCGC GCTTCGTGCC GCCGACGCTC GCTTGGATCG CGCTCGGCGT CGTGATCGGC TATCTGATCG TCAATCGCTG GCTGCGCGTC TCGACGTTCG TGCTGCTCGC GCTGATCGCG CTGCCCGTGT GGCAGGCGGG CAGCGCGGCG CTCGCGCGCC TCGACGCGGC GGCCGTGGCC GGCGTGGCCG TGCCCGGGCC GCTCGGAACG GGGCCCGCCG CGCAGCCGCA AGACCACAAC GCGGCGCTCG CCGCGTTCCG CTCGCAGGAA TCGCAGCGGC AGGTGACGTT CGGCCGCCCG AGCGCCGATC CGGCGACGCA GTTCGACGTG ATCGTGCTGC ATGTGTGCTC GCTGTCGTGG GACGACCTCG ATGTCGCGAA GCTGCGCAAT CATCCGCTGC TCGGCCATTT CGACTATCTG TTCACGAACT TCAGCACGGC GGCGAGCTAT AGCGGCCCGG CCGCGATCCG CGTGCTGCGC GCGAGCTGCG GGCAGGAGGC GCATGCGGAC CTGTACAGGC CCGCGCCCGC GCAGTGTCAT CTGTTCGGGC AGCTCGCGGC CGCCGGTTTC GCGCCGCAGA CGCTGCTCAA TCACGACGGC CACTTCGACA ACTTCGTCCA GTTGATCCGC GACAACATCG GCGTGCCGAA CGCGCCGATG ATCTCGAATG CGGACGCGTC GGTCGCGATG CATGCGTTCG ACGGCTCGGC GATCAAGGAC GACTACGCGA CGCTCGCGAA CTGGTACGCG AAGCGCGGCG CGTCGGCGGG CCCGGTTGCG CTGTACTACA ACACGATCAG CCTGCACGAC GGCAACCAGT TGCCGAGCGG CCGGATGTCG AGCCTCGATT CGTATCCGCT GCGCGCGCGC AAGCTGATGG ACGACTTCGA CCGCTTCGCC GACCTGATCG CGTCGTCGGG ACGGCGGGCG GTGATCGTGT TCGTGCCCGA GCACGGTGCG GCGCTGCGCG GCGACGCGAA GCAGATCGCG GGGCTGCGCG AGATCCCGAC GCCGCGGATC GTGCACGGGC CGGTCGGCGT GCGGCTCGTG GGCTTCAAGG GCGACCACGG CGCGACGACC GTGATCGACG CGCCGACGAG CTTCCTCGCG CTCGCGCAGT TGCTGTCGAA TCTCGTGTCG AACAGCCCAT TCAAGCCGGG CGTGACGCTG TCGCAATACG CGGCCGATCT GCCGCAGACG CGGATGATCG GCGAGAACGA AGGCACGGTG ACGATGACGA CGCCGACGGG TTACGCGGTA AAGACGCCGG ACGGCGTATG GATCGACGAA AAATGA
|
Protein sequence | MTFWNLYFIL KLYLFAAGHL KPLWIANLGF ALALALSSPV RRRGLRLLRL ALALAIGVPL MYREADVPPF ARLVETLGSL RSFSAGYWME LVPRFVPPTL AWIALGVVIG YLIVNRWLRV STFVLLALIA LPVWQAGSAA LARLDAAAVA GVAVPGPLGT GPAAQPQDHN AALAAFRSQE SQRQVTFGRP SADPATQFDV IVLHVCSLSW DDLDVAKLRN HPLLGHFDYL FTNFSTAASY SGPAAIRVLR ASCGQEAHAD LYRPAPAQCH LFGQLAAAGF APQTLLNHDG HFDNFVQLIR DNIGVPNAPM ISNADASVAM HAFDGSAIKD DYATLANWYA KRGASAGPVA LYYNTISLHD GNQLPSGRMS SLDSYPLRAR KLMDDFDRFA DLIASSGRRA VIVFVPEHGA ALRGDAKQIA GLREIPTPRI VHGPVGVRLV GFKGDHGATT VIDAPTSFLA LAQLLSNLVS NSPFKPGVTL SQYAADLPQT RMIGENEGTV TMTTPTGYAV KTPDGVWIDE K
|
| |