Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0730 |
Symbol | |
ID | 3845764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 858567 |
End bp | 860987 |
Gene Length | 2421 bp |
Protein Length | 806 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637838033 |
Product | lectin repeat-containing protein |
Protein accession | YP_438927 |
Protein GI | 83717900 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.584993 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATTTCT TTCGATTCGC GTGGCGGCAC GCGCAATATG GGTTTTGCCT GTTGATCGCC GGCATGTTGC TGCCGCAAGG CGCGGCGGCG GCCGTCACGG CATCCTATTC GGACATCGTG GGCTCGGACA GCGGGCTGTG CGTGTCGACG GCCGGCAACT CGAGCGCATC GGGCGCGGGT ATCGTGCTGA TGTCGTGCGC GGGCCTGAGC AACACGACAT GGTCGTTCGT TCCCGTCGGC AACCGCTATC ACATCGTGCT GCAGGGCAGC GGCCTGTGCT TGAACGTGCC CGGCGGCTCG CCGAACAGCG GCACGCAACT GATCCAGTAT GCGTGCCAGG ACAGCAGCCT GACCAACGAT CAATGGACGA TCGTCGCGCT CGGGTCGAGC TACCGGATCG TGTCCGCGTC GAGCGGCATG TGCGTGAACG TGAGCGGCGG ATCGCGCGCG AGCGGCGCGG CGCTGATCCA GTATCCGTGC CAGAGCGCGA GCGCGCTCAA CGATCAGTTC AACCTGTACC TGCCTGTCGT TGCGTTCACC AACGTCGCGG CGGCGAACAG CAATCTCTGC TTGAGCGTCA ATGGCGGCTC GACGGCCGCC GGCGCATCGA TCGTCCAGGG GACGTGCTCG AATCAGGGCG GCACGAACTG GTCGCTGCTG CCCGCCGGCA GCGGCTATCA CGTCGTGTCG CAAGCCACCG GACAGTGCCT GAACGTGTAT GGCGGATACA AGACGAGCGC AGCGCCCGTC ATCCAGTATC CGTGCCAGGG CGACGCGCAG ACCAACGATC AATGGACGGT CGCGCCCGTC GGATCGAAGT ACCGGCTGAT TTCGGTGTCG AGCGGCATGT GCCTGAACGT GAGCGGCGGC TCGCTATCGC CGGGCGCGCC GTTGATCCAG TATCCGTGCC AGGACGCGAA CGCGCTCAAC GACCAGTTCT CGCTCGGCCT GCCGCAGTCG TTCCCGGTCA CGCTGCCGTC CGCATGGAGC CCGGTGATTC CTCTGCCCGT CAATCCAATC GGCATCGCGA ACACGCCGAA CGGCAAGCTC GTGATGTGGT CCGCGGATCA GCAGTTCAGC TTCCAGAACG ACGTCGGCGG CAAGGCAACG CAGACGCAGA CCGCGGTGTT CGATCCGGCG ACGAACACGG CGACGCCACA TATCGAAACG TCCGCGGGCT CGGACATGTT CTGCACCGGC ACAGCGATGC TGCCGGACGG CAAGCTGCTC GTGAACGGCG GCGACAGCAG CCCGAAGACG ACGCTGTACG ACTGGGCGAC CAACACGTGG AGCGCGGCGG CGGCAATGAA CATTGCGCGC GGCTATCAGG GCGACACGCT GCTGTCGAAC GGCTCGGTGC TGACGCTAGG CGGCTCGTGG AGCGGCGGCC AGGGCGGCAA GAACGCCGAG GTGTGGATGA ACGGCGGCGC GTGGACGGTG CTGCCCGGCG TGCCCGAGAC GAACATCGTC GGCCCCGATC CGCAAGGAAT CTATCGCGGC GACAACCATC TGTGGCTGTT CGCGCAGGCC GGCGGCACCG TGTTCCATGC GGGGCCGAGC TCGCAGATGA ACTGGATCTC GACCGAAGGC GGCGGCGCGA TCCGGTCGGC GGGCATGCGC GGGGTCGATC CGTTCAGCAT CAACGGCACC GCGTCGCTGT ACGACGTCGG CAAGATCCTG AAGGCGGGCG GCGCGAAATC GTATCAGCAG AACGGCGGCG TCACGACCTA CGCGTCGAAT TCGGTCTACC AGATCGACAT CACGCGCGGG CCGAATCAGC CGGCTGCGGT ACAGCGCCTG AACGGCATGA CTTATCAACG CGCGTTCGCG AACAGCGTGA TCCTGCCGAA CGGCAGCATC GTGATGATCG GCGGCCAGAG CGTGCCGATG CCGTTCACCG ACACGTCCGC AATCATGGTC CCCGAAATCT GGGACCCGGC AACGCAACGC TTCAACCTGC TCAAGCCGAT GCAGACGCCG CGCACCTATC ACAGCACGGC GATCCTGCTG CCGGACGGCC GCGTGTTCGC AGGCGGCGGC GGCCAGTGCG GCGCAGGCTG CGCGATGAAC CACCTGAACG CGGAAATTCT GACGCCGCCG TACCTGCTCA ACACGGACGG CACGCCGGCG CAGCGCCCGG CAATCACGAA TGCGCCTGCG TCGGCGCAGC TCGGGACGTC GATCACAGTA TCGACGCAAG GCCCGGTGAC GTCGTTTGTG CTGATGCGCC TGTCGTCCGT CACGCACACG ACGAACAACG ATCAGCGGCG CATCCCGCTC GCAATCACGT CGTCCGGCGC AACGAGCTAC CGGCTCGCGA TTCCGGCCGA CCCCGGCGTC GTCTTGCCCG GCTATTACAT GCTGTTCGCG CTGAACGCGC AGGGCGTGCC GAGCGTGTCG ACGTCAATCC GGATCTCGTG A
|
Protein sequence | MHFFRFAWRH AQYGFCLLIA GMLLPQGAAA AVTASYSDIV GSDSGLCVST AGNSSASGAG IVLMSCAGLS NTTWSFVPVG NRYHIVLQGS GLCLNVPGGS PNSGTQLIQY ACQDSSLTND QWTIVALGSS YRIVSASSGM CVNVSGGSRA SGAALIQYPC QSASALNDQF NLYLPVVAFT NVAAANSNLC LSVNGGSTAA GASIVQGTCS NQGGTNWSLL PAGSGYHVVS QATGQCLNVY GGYKTSAAPV IQYPCQGDAQ TNDQWTVAPV GSKYRLISVS SGMCLNVSGG SLSPGAPLIQ YPCQDANALN DQFSLGLPQS FPVTLPSAWS PVIPLPVNPI GIANTPNGKL VMWSADQQFS FQNDVGGKAT QTQTAVFDPA TNTATPHIET SAGSDMFCTG TAMLPDGKLL VNGGDSSPKT TLYDWATNTW SAAAAMNIAR GYQGDTLLSN GSVLTLGGSW SGGQGGKNAE VWMNGGAWTV LPGVPETNIV GPDPQGIYRG DNHLWLFAQA GGTVFHAGPS SQMNWISTEG GGAIRSAGMR GVDPFSINGT ASLYDVGKIL KAGGAKSYQQ NGGVTTYASN SVYQIDITRG PNQPAAVQRL NGMTYQRAFA NSVILPNGSI VMIGGQSVPM PFTDTSAIMV PEIWDPATQR FNLLKPMQTP RTYHSTAILL PDGRVFAGGG GQCGAGCAMN HLNAEILTPP YLLNTDGTPA QRPAITNAPA SAQLGTSITV STQGPVTSFV LMRLSSVTHT TNNDQRRIPL AITSSGATSY RLAIPADPGV VLPGYYMLFA LNAQGVPSVS TSIRIS
|
| |