Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1327 |
Symbol | |
ID | 3846572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 1580943 |
End bp | 1583903 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637838629 |
Product | phage-related tail transmembrane protein |
Protein accession | YP_439523 |
Protein GI | 83717435 |
COG category | [S] Function unknown |
COG ID | [COG5283] Phage-related tail protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAACG CCCTGAAACT GCGCGTGATG TTCGACATGA TCGACAACTT CACGAAGCCC CTGAAAAACG TGCTGAACAG CAACAAGGGG CTCGCGCAGG CGCTCAAGCA GACGCGCGGC GAGCTCGCCG AGCTCGGCAA GCAGCAGAAG GCCGTCGCCT CGTTCCGCGA GATGCGCACC GGGCTCGCGG GCACAGCGGA GAAGCTCGGC GAAGCGCGAA CGCGCGTGAA TGGCCTCGCC ACTGCGTTGC GTGCGGCCGA CCAACCCTCG CGCCAGATGA TTGCCGATTT TGAGAAGGCG AAGCAGTCCG CGGCGCGCCT GTCGATCGAG CACGAGAAGC AGTCCGCCCG CGTACGTGAG CTGCGCGCGC AGCTCGCGAG CACGGGCATC GATACACGCC AGCTCGCCGA GCACGAACGC ACGCTGCGCT CGAACATCGC GCAGACCACG GCGGCAATGC AAACGCAGAC GCGCCAGCTC GAAGCCATGG CCGAGCGCGA GAAGAAGCTC GGCGCGGCGC GCGGCAAGAT GCAGGCGCTA CAGGGCGTCG CCGGCGGCAT GGCGATCGGC GGTTACGCGG CGCGCTCGAC CGGCGCGCAC GCGCTCGGCG ATCTGCGCGA GGCGCTCGAC GAGACGAAGA AGATACAGAA CGAGCGCGCG CGCATTACGG CGCTAGGCCT CGGCGACCAG GCGACAAAGG ACGCCGAGAA GTACGTGCGC TCGATGAAGA TGATGGGCGT GAGCACGTCG GACAACATGA CGCTGATGCG CGACGCGCTG TCGATCTTCG CGGACGAGCA CCACGCGCAG ATGGTGATGC CGACGCTCGC GAAAATGAAG TTCGCGAACG AGGCGATGTT CGGCGCGGAA GACGCGCACG CGAACGAAGA GAAGTTCATG AACATGCTGA AGGTCATCGA GCTGCGCGGC GGCACGAAGG ACGAAGCGAC GTTCAGGAAC GAAGCGAACA TGGTACAGAA GGTGCTGTCG GCGACGGGCG GCCGCGTTGG CGGCGACGAG TGGCGCAACT TCATCCAGAC GGGCGGCATC GCGGCAAAGC AGATGCGCCA GGACGCGTTC TATTACCAGA TGGAGCCGCT GATTCAGGAA ATGGGCGGGC ACCAGGTCGG CACCGGGCTC ATGTCCGCAT ACAGCAACGT CTACCAGGGC AAGACGACCG TGCGGGCCGC ACAGGAGATG ATGAAGCTCG GGCTGCTCGA CAAGAAGAAC GTCGAGTACA ACAAAATCGG CATGATCAAG CGGATCAAGC CGGGCGCGCT GCTCGGCGGC GATCTGTTCA AGGCGTCGCC GCTCGAATGG CTCGAAAAGG TGCTGCTGCC GCAGATGGCG AAGAAGGGCG TCACGGACCC CGACAAAGTG AAGGACATGA TTTCGACGAT CTTCACGAAC CGGACGGCCG CGAATCTGTT CTCGACGATG TACATGCAGA GCCAGCAGAT CCACAAGAAC GAGAAGCTGA ACAAGGGCGC GTATGGCATC GACGAAATGC ACGACCTTGC GTCGAAACAG ACGCCCGGAA AGGAGCTCGA CGCACGCGCG AAGCTGCGTG ATCTGCTGAA TGAAATCGGC GAGCGCATCG CGCCGATGTA TAACGCCGCG CTCGACAAGA CGCGCGAGCT CGCCGACAGG CTGCTGACGA CGATTCAGGC ACACCCACAG GCAACGAAGG TAGTCGTCGC GCTCGCGGCC GGCTTCGCCG CGCTGCTCGC GGTGCTCGGC ACGTTCACGA TCGTCCTCGC CGGCGTGCTC GGCCCGCTCG CAGTCGTGCG TTTCAGCATG GCGACGCTCG GCATCCAGGG CGGCATCCTG TCGCGCGCGC TCGGCATCGG CGCGGCCGCA TGGCGGATGT TCGGCACGGC CGCGATGGGT GCCGGCCGCC TGTTGCTCAC GACGCCGATT GGCCTATACA CCGCGGCGTT CGCCGCCGCC GCGCTGCTGA TCTACCGCTA TTGGGGGCCG ATCAAGGCGT TCGTCGGGGG CGCGCTCACG GCGATCGGCG ATGCACTGGC GCCGATCGGC GTCGCGCTTC GGGGCGCATT GCAGCCGGTC GGTCGCGCGC TCGCGGCAGC AAAACCGCTG TGGAACGGGC TGGGCGGTGC GCTCTCGACG GTGGCCGGCT GGCTCGGCAA GCTGTTCGCG CCGGCGCGCG CGAGCGCCGA TGCCCTATCC GCGGCGGCGG CGGCCGGCCG CGGATTCGGT GCGGTGCTCG GCACGGTGTT GCGCGTCGCG CTCGTGCCGC TCACGTGGCT CGGCCGCGCG CTCGGCGGGC TCGCCGGCCT GTTCGTGGAA GCGATGGGCG ACGCGCGCGC GGCATTGAAC GGCGGGCTCG CCGCGCTCGG CACGCTGATT CTGAACTGGT CGCCGCTCGG CATGTTCTAC CGGGCGCTCG CGGGCGTGCT GTCGCTGTTC GGCGTCGAGC TGCCCGCGAA GTTCTCCGAG TTCGGCGGAC ACCTTATCGA CGGGCTCGTC GGCGGCATCA GCAGCGGACT GGGCAAGGTG AAAGACGCGA TTTCGAATAT GGCGAACAGC ACGGTGGGCT GGTTCAAAGA GAAGCTCGGC ATCCATAGCC CGAGCCGTGT ATTCGCGCAG CTCGGCGGCT TCGTCGGTGA AGGCGCCGCG CTCGGTATGC AGGGTGAGCA GCAGCGCATC GCGAAAGCGG CGCTCGGCCT TGCAACCGTA GCCGTCACGT CATTCGGCAC ACCGGCGCTC GCAAAGCCGA TGCCGCCGCT CGTGCAGGCG ACCGTGCCGA TCGATCGCCG CGCGCCGCTC GCCGCGCCAT CCGCGGCTTC ATCGCCGGCC GCGCCGGCGT CGCCGATCGT CATCAACATC TACCCGCAGG CCGGGCAGGA CCCGCACGCG ATCGCACGCG CCGTCGAAGC CGCGCTCGAT CGCCGCGAGC GCGCGAAGCA GTCGCGCATC GGCTCGCGCC TGTCGGACTG A
|
Protein sequence | MDNALKLRVM FDMIDNFTKP LKNVLNSNKG LAQALKQTRG ELAELGKQQK AVASFREMRT GLAGTAEKLG EARTRVNGLA TALRAADQPS RQMIADFEKA KQSAARLSIE HEKQSARVRE LRAQLASTGI DTRQLAEHER TLRSNIAQTT AAMQTQTRQL EAMAEREKKL GAARGKMQAL QGVAGGMAIG GYAARSTGAH ALGDLREALD ETKKIQNERA RITALGLGDQ ATKDAEKYVR SMKMMGVSTS DNMTLMRDAL SIFADEHHAQ MVMPTLAKMK FANEAMFGAE DAHANEEKFM NMLKVIELRG GTKDEATFRN EANMVQKVLS ATGGRVGGDE WRNFIQTGGI AAKQMRQDAF YYQMEPLIQE MGGHQVGTGL MSAYSNVYQG KTTVRAAQEM MKLGLLDKKN VEYNKIGMIK RIKPGALLGG DLFKASPLEW LEKVLLPQMA KKGVTDPDKV KDMISTIFTN RTAANLFSTM YMQSQQIHKN EKLNKGAYGI DEMHDLASKQ TPGKELDARA KLRDLLNEIG ERIAPMYNAA LDKTRELADR LLTTIQAHPQ ATKVVVALAA GFAALLAVLG TFTIVLAGVL GPLAVVRFSM ATLGIQGGIL SRALGIGAAA WRMFGTAAMG AGRLLLTTPI GLYTAAFAAA ALLIYRYWGP IKAFVGGALT AIGDALAPIG VALRGALQPV GRALAAAKPL WNGLGGALST VAGWLGKLFA PARASADALS AAAAAGRGFG AVLGTVLRVA LVPLTWLGRA LGGLAGLFVE AMGDARAALN GGLAALGTLI LNWSPLGMFY RALAGVLSLF GVELPAKFSE FGGHLIDGLV GGISSGLGKV KDAISNMANS TVGWFKEKLG IHSPSRVFAQ LGGFVGEGAA LGMQGEQQRI AKAALGLATV AVTSFGTPAL AKPMPPLVQA TVPIDRRAPL AAPSAASSPA APASPIVINI YPQAGQDPHA IARAVEAALD RRERAKQSRI GSRLSD
|
| |