Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A2089 |
Symbol | |
ID | 3692055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | + |
Start bp | 2543176 |
End bp | 2544891 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637732343 |
Product | hypothetical protein |
Protein accession | YP_337240 |
Protein GI | 76819156 |
COG category | [S] Function unknown |
COG ID | [COG3455] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03349] type IV / VI secretion system protein, DotU family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0458657 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTGC TACGAAACCT CTCCTCCACG CTGCGCCGCG TATCGATGGC ATCCAACATG CTCGATCGCG CGGCGGCGAA TCCGGATCTC GTGACGCGCG TGCCGACGCT GTCGTCGCTG TCGAGCGCGG CGATGACGAG CTCCGCCGTA TCGACCACCG GCACGACGAA CACGGTGGCG TCGGGCGGCG CGGCCGCGGG CGCGCCGAGC TTCGCGCCGC CGGCCGCCGA CGCGTTTCCC CAAGCCGACG GCGCGCCCCG CAACCCCGCG GTGCTGCAAT TCCCGGTGCC GGGCGGCGCG CCCGCGCCCG CCGACGCGCG GGTGGCCGCG CCCGTCGTCT ACAGCGCGCA GGGCGAGCAG GCCGCGATCA TGAAAGCAGG CCTGCAGCAG GCGAGCTGGA ACAACCCGTT CGTGTCGCAC GCGCTGCCCG CGGTGCTGCA ACTGCAGCGC CACCTCGCGG CCGGCCCGCT CAATCAGGCC GCGATCCGCA CGCAGCTCGG CCTCGAGGTG CGGCTCTACC GCGAGCGGCT CGCCGCCTCC GGCTGCGAAT GGGAGCAGAT CCGCGACGCG TCGTACCTGC TCTGCACGTA TCTCGACGAA ACCGTCAACG ACGCGGCGCG CGAGCACGCG CAAGTCGTCT ACGACGGCGA GCGCAGCCTG CTCGTCGAAT TCCACGACGA CGCGTGGGGC GGCGAGGACG CGTTCGCCGA CCTGTCGCGC TGGATGAAGA CCGAGCCGCC GCCGATTCCG CTTCTGTCGT TCTACGAACT GATCCTGTCG CTCGGCTGGC AGGGCCGCTA CCGCGTGCTC GACCGCGGCG ACGTGCTGCT GCAGGATCTG CGCTCGCAAC TGCACGCGCT GATCTGGCAT CACGTGCCGC CCGAGCCGCT CGGCACCGAG CTCGTCGCGC CCGCGAAGCG GCGCCGCTCG TGGTGGACGG CCGGGCGCGC GGCGGCCGTC GCGCTCGGCG TGCTGGTGCT CGCGTACGGC GCGATCAGCT TCTGGCTCGA TTCGCAGGGC CGCCCGATCC GCAACGCGCT CGCCGCGTGG ATGCCGCCCA CGCGCACGAT CAACATCGCC GAGACGCTGC CGCCGCCGCT GCCGCAGATT CTCACCGAAG GGTGGCTCAC CGCGTACAAG CATCCGCAAG GATGGCTGCT CGTGTTCAAG AGCGACGGCG CGTTCGACGT CGGCAAGGCG AACGTGCGGG CGGACTTCAT GCACAACATC GAGCGGCTCG GCCTCGCGTT CGCGCCGTGG CCGGGCGACC TCGAGGTGAT CGGCCACACC GATTCGCGGC CGATCCGCAC GAGCGAGTTC CCGGACAACC AGGCGCTGTC CGAAGCGCGG GCGCGCAACG TCGCCGACGA ACTGCGCAAG ACCGCGCTGC CGGGCGGCGC GCGCGCGCCG GAGAACGCGG TGCAGCGCAA CATCGAGTAC TCGGGGCGCG GCGACGCGCA GCCGATCGAC ACCGCGAAGA CGGCCGCCGC GTACGAGCGC AACCGCCGCG TCGACGTGCT GTGGAAGGTG ATTCCCGACG GCGCGCAGCA ATCGGGCCGC AGCCTGAACC TGCAGCAGCC GGAGAAGCCC GGGCAGGTGC CGATGCGTCC GGCGATGCCG GAGGGCGTGG AGATCGCGCC TGACGGGCAA CTGCCGTATG CGACGTCAAC CACGATGCCA GCAACGAGAC CGACCACGGA GGGCCGTCAG CCATGA
|
Protein sequence | MSLLRNLSST LRRVSMASNM LDRAAANPDL VTRVPTLSSL SSAAMTSSAV STTGTTNTVA SGGAAAGAPS FAPPAADAFP QADGAPRNPA VLQFPVPGGA PAPADARVAA PVVYSAQGEQ AAIMKAGLQQ ASWNNPFVSH ALPAVLQLQR HLAAGPLNQA AIRTQLGLEV RLYRERLAAS GCEWEQIRDA SYLLCTYLDE TVNDAAREHA QVVYDGERSL LVEFHDDAWG GEDAFADLSR WMKTEPPPIP LLSFYELILS LGWQGRYRVL DRGDVLLQDL RSQLHALIWH HVPPEPLGTE LVAPAKRRRS WWTAGRAAAV ALGVLVLAYG AISFWLDSQG RPIRNALAAW MPPTRTINIA ETLPPPLPQI LTEGWLTAYK HPQGWLLVFK SDGAFDVGKA NVRADFMHNI ERLGLAFAPW PGDLEVIGHT DSRPIRTSEF PDNQALSEAR ARNVADELRK TALPGGARAP ENAVQRNIEY SGRGDAQPID TAKTAAAYER NRRVDVLWKV IPDGAQQSGR SLNLQQPEKP GQVPMRPAMP EGVEIAPDGQ LPYATSTTMP ATRPTTEGRQ P
|
| |