Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1886 |
Symbol | |
ID | 3844615 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 2285699 |
End bp | 2287483 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637839187 |
Product | hypothetical protein |
Protein accession | YP_440080 |
Protein GI | 83717203 |
COG category | [S] Function unknown |
COG ID | [COG3455] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03349] type IV / VI secretion system protein, DotU family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.419438 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTGC TACGAAACCT CTCCTCCACG CTGCGCCGCG TATCGATGGC ATCCAACATG CTCGATCGCG CGGCGGCGAA TCCGGATCTC GTGACGCGCG TTCCGACGCT GTCGTCGCTG ACCAGCGCGG CGATGACGAG CTCCGCCGTA TCGACCACCG GCACGACGAA CGTGGCTGCG TCGGGTGGCG CTGCGTCGGG CATGGGCGTG CCGGGCGCAT ATCCGGCCGC CGACGTGCCG CATGCCGACG GTGCGCCCCG CAATCCGGCG GTGCTGCAAT TTCCGGTGCC GGGCGGCGCG CATGCGCCTG CCGACGCGCG GGACGCGGCA CCCGTCGTCT ACAGCGCGCA GGGCGAGCAG GCCGCGATCA TGAAGGCGGG CCTGCAGCAG GCGAGCTGGA ACAATCCGTT CGTGTCGCAC GCGCTGCCCG CCGTGCTGCA GTTGCAGCGG CATCTCGCGG CCGGGCCGCT CAATCAGGCC GCGATTCGCA CGCAGCTCGG GCTCGAAGTG CGGCTCTACC GCGAGCGGCT CGCCGGCTCC GGCTGCGAAT GGGAGCAGAT CCGCGATGCG TCGTACCTGT TGTGCACGTA TCTCGACGAG ACCGTCAACG ATTCGGCGCG CGAGCATTCG CAGGTCGTCT ATGACGGCGA GCGCAGTCTG CTCGTCGAGT TTCACGACGA CGCATGGGGC GGCGAGGATG CGTTCGCCGA CCTGTCGCGC TGGATGAAGG CGGACCAGCC GCCGATTCCG CTTCTGTCGT TCTACGAACT GATCCTGTCG CTCGGCTGGC AGGGGCGCTA CCGCGTGCTC GACCGCGGCG ACGTGCTGCT GCAGGATCTG CGCTCGCAAC TGCACGCGCT GATCTGGCAT CACGTGCCGC CCGAGCCGCT CGGCACCGAT CTCGCGACGC CCGCGAAACG GCGCCGTTCG TGGTGGACGG CCGGCCGCGC GGCGGCCGTC GCGCTCGGCG TGCTCGTGCT CGCGTACGGC GCGATCAGCC TCTGGCTCGA TTCGCAGGGG CGGCCGATCC GCAACGCGCT CGCCGCGTGG ATGCCGCCGA CGCGCACGAT CAACATCGCC GAGACGCTGC CGCCGCCGCT GCCGCAGATC CTCACCGAAG GGTGGCTCAC CGCGTACAAG CATCCGCAGG GATGGCTGCT CGTGTTCAAG AGCGACGGCG CGTTCGACGT CGGCAAGGCG AAGGTCCGGC CGGACTTCAT GCACAACATC GAGCGGCTCG GCCTCGCGTT CGCGCCCTGG CCGGGCGACC TCGAGGTGAT CGGCCACACC GATTCGCGGC CGATCCGCAC GAGCGAGTTC CCGGACAACC AGGCGCTGTC GGAGGCGCGG GCGCGCACGG TGGCCGACGA GCTGCGCGCG ACCGCGCTGC CGGGCGGCGC GCGTGCGCCG GAGAACGCGG TGCAGCGCAA CATCGAGTAT TCGGGGCGCG GCGACGCGCA ACCGATCGAC ACCGCGAAGA CGGCCGCCGC GTACGAGCGC AACCGCCGCG TCGACGTGCT GTGGAAGGTG ATTCCAGACG GTGCGTCGCA ACCGGGCCGC AGCCTGAATC TGCAGCAGCC GGAGAAGCCC GGCCAAGTGC CGATGCGTCC GGCGATGCCG GAAGGCGTCG AGATCGCGCC TGAAGGGCAA TTGCCGTATG CGACCACGGA GATCACGACC GCGACGCCGA CCACGATGCC AACCACGAAG CAAGCACCGA CACCGGTGCC GACACCAGCA CCGACGCCAG CAACGGGACC GACCACGGAG GGCCGTCAGC CATGA
|
Protein sequence | MSLLRNLSST LRRVSMASNM LDRAAANPDL VTRVPTLSSL TSAAMTSSAV STTGTTNVAA SGGAASGMGV PGAYPAADVP HADGAPRNPA VLQFPVPGGA HAPADARDAA PVVYSAQGEQ AAIMKAGLQQ ASWNNPFVSH ALPAVLQLQR HLAAGPLNQA AIRTQLGLEV RLYRERLAGS GCEWEQIRDA SYLLCTYLDE TVNDSAREHS QVVYDGERSL LVEFHDDAWG GEDAFADLSR WMKADQPPIP LLSFYELILS LGWQGRYRVL DRGDVLLQDL RSQLHALIWH HVPPEPLGTD LATPAKRRRS WWTAGRAAAV ALGVLVLAYG AISLWLDSQG RPIRNALAAW MPPTRTINIA ETLPPPLPQI LTEGWLTAYK HPQGWLLVFK SDGAFDVGKA KVRPDFMHNI ERLGLAFAPW PGDLEVIGHT DSRPIRTSEF PDNQALSEAR ARTVADELRA TALPGGARAP ENAVQRNIEY SGRGDAQPID TAKTAAAYER NRRVDVLWKV IPDGASQPGR SLNLQQPEKP GQVPMRPAMP EGVEIAPEGQ LPYATTEITT ATPTTMPTTK QAPTPVPTPA PTPATGPTTE GRQP
|
| |