Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2040 |
Symbol | |
ID | 4906230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 2012829 |
End bp | 2016833 |
Gene Length | 4005 bp |
Protein Length | 1334 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640145145 |
Product | hypothetical protein |
Protein accession | YP_001076073 |
Protein GI | 126458165 |
COG category | [S] Function unknown |
COG ID | [COG3523] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGATCG CGCGGATCGC CGCGCTGCTC GCGCTCGCCG CGCTCAGTTG GGCGAGCGCG CTCTATTTCG ACTGGCCGCT GTGGTGCGCG CCCGCCGTGT TCTGCGCGGC GCTCGCCGGA TGGCTGCTGT GCCGCGTCGC GCGCCGCGCG CTGCGCGCGG TTCGTGCGCG CGCGCAGCTC GCACGGCTCG ACGCATCCGA GCGCCTGCCC GCGGCCGACG CGCCGGAGCT GCGCGTCGCC GCGCGCTGGC GCACCGCGCT CGCCGCGCTC GGGCGCGCGC AGCCCGCGCC GCGCCTGCGC GGCCGGCCGC GCGACGCGCT GCCGTGGTAT CTCGTGATCG GCCCCGAGGG CGCGGGCAAG ACAACCGCGC TCGCACGCGC GCGCCTCGTG TCGCCGCTGC GGCACGCGCG CGACGACGCG GCCATCGCGC CGACCGAGGA CTGCGACTGG TGGTGCTTCG ACGACGCGGT CGTGCTCGAT CTCGCGGGCC GCTTCGCCGC GCCCGGCGCG ACCGACGACG ATCGCCGCGC ATGGCACGCG CTCCTCGAGC AGCTCGGCCG CGCGCGTGCG CGCCGGGGCG TGAACGGCGT CGTGGTCGCG ATCGACGCGC CGCGCCTCAT CGAAGCGAGC CGCGATGCGC TGACGATCGA AGGCAGCGCC ATTCGCGAAC GGCTCGAGCA ACTGATCCGC CTGTTCGACC GGCGCTTTCC GGTGTACGTG CTCGTCACGC AATGCGACCG GCTCTACGGC TTCGACGAAT GGGCCGCGCA GCTCGCGCCG GAACAACGCG AGCGCGCGTT CGGCTATCTC GGCGATCACG ACGCGGATGC GTTCGTCGCG CGCGCGCTCG ACAGCCTCGA CGCACGGCTC GGCGCACTGC GCGTCGCGCT GGCGGCGCGC GGCGAGCCGC CGTCGCCGCG CGCGCTGATG CTGCCGCACG AACTCGCGTG CCTGCGGCCC GCGCTCGAGG CGTTCGCACG CGCGGCGTTC GGCCCGAACG TCTATCAGGA AACACCGTAT CTGCGCGGGC TGCTGTTCGC GAGCGGCCGG CAGGCGGGCG GCGCGCCGTC GCTGACGCTG CCCGACTGGC TCGACGCCGC GCCGGCGCGC ACGCCCGGCG ATGCGGGCCT GTTCCTGCAC GACGTGTTCG CGCGCGTGCT GCCGGGCGAG CGCGACGCGT GGCTGCCCGT CGAGCGGCCG AGCCGCTCGC GGCTCGTGCT GCGGCGCCTC GCGCTCGCCG CGTGGCTCCT CGCGAGCATC GCGGCCGGGC TGCTGATGAG CGCATCGTTC GTCGGCGACA TGCGAACCGT CGAGCTGGTC CGCCGCGACT ATCCCGCGCA TCCGCGCTTC ACCGGCGAAT TCGCGCACGA CACGGCGACG CTCGAGCGGA TCGGCCGCGT GATCGCCGAC GTCGAGCGGC GCGACGAGCG GCGGCTCGTG CGCCCGCTCG CCGGCGCGAC GCCCGTCGGC CGGCTCGAAG CGGAACTCAA GCGCCGCTAC GTCGCGCACT ATCGGCGCTC GATCGAACCC ACCGCCGATC GGCTGTTCTT CGGCGAGCCG GACGGCGCGA GCGACGTGCG CGCGGCCGAC GATGCGCGGC TCGCCGCGCG CATCCGCAAT CTGGTCCGCT ACGTGAACCT GATGCAGGCG CGCCGGCGCG GCGCGGATCG CGAGACGCTC GCGCAAATGC CCGCACCGGC GGTCGCGCGC GCGACGGATG GCGGTGGCGC TCGCGGAGTG GACAGTGTGG GTGTTTCTGG TGTTTCTGGT GTTTCTGGTG TTTCTGGTGT TTCTGGTGTT TCTGGTGTTT CTGGTGTTTC TGGCGGTGTA GGCGGTGTAG GCGGTGTAGG CGGTGCGCGC GCCGCGGAAG GCGCGGGTCG CGCCGTCCGC GCGAGCGGCG GACGGCGCGC AGGCGACGCG CGCGCCGCGC GCGGCATGCG CGGCGCCGCC GGCAACGCCG GCGCCGGCGC TGAAATCGCA CCCGCCGCAG ACGGCGTTCG CGATCGCGAC GAAGGGTTCG CGCGCGTCAG CGCGCTCGCC GTCGACCTGA TCGCGTGGAG CGCGCCCGAC GATCGCGCGC TCGCCGCGCG CGTCGCGGCC GCGCAGGCAG AGCTCGAACG GCTCGCGTAC CGCGATCCGG ACGGCGCGTG GCTGCTCGCG CTGCCGGATG CCTCGATGCC GCGCGACGTG ACGCTCGCCG ATTTCTGGCC CGCCGCCGAG CCGCGCACGC TGCCCGACGC ACAGCCCGCC GCGCTGCGCG ACGTGCGCGT GCCCGCCGCG CTGACGGCCG CGCATCGCCC GGCCATCGAT GCGTTCCTCG ACGAGATGGC GCAGGCGGTC GCGAACCGGC CGAAGTTCGC GTTCCATCGC GACGCGTTCG ACACGTGGTA CCGCGCACGG CGCATCGACG CATGGCGCGA CTTCGTCGCA CGCTTCCCGC AGGGCGAGCA AGGGCTGACG ACGCAAGCGC AATGGCGCGC GGTGATCGAC CGGATCGCCG ATCGCCGCGA TCCGTTCGCG GCATTGCTCG CGCGCGTCGA TCGCGAATTC GAATCCGAGC GCGACGACAC GCTGCCGCCG TGGCTACGGT TCGTGCGCAC GACATCGCAC ATGCTCGCCC CCGCGATGCT CGCGCCCGCG CGCCTGCCCG CGGCGAGCGG CGGCCTGGGC GCCGCGCTCG GCTCGATCTC GCGCAGCGGC GGACGCGCGC TGCGCGAAGC GCTCGGCGGC GCGCCCGAGC AAGGCAGGCT CACGCTCGAG CGCGATGCCG CGCTGCGCGA CGCGCTTGTC GACTACGAGC GGCGCGTCGC CGCGCTCGCC GCCGATGCGC TTGCGGGCCC CGGCGCCGCG TACCGGCTCG CGGCCGATTT CCACGGCTTC GGCGTCGATC CGTCCGTCCA GGCTTCCGCG ATGCGCGCGG CCGACAGCGC GCTGCGCGAC GTCAAGCGGC TCGCGGGCGA TCGCGACGTC GGCGGCGACG TCGTCTGGAG CCTCGTCGGC GGCCCGCTGC GCGCGCTGAT CGCGTATGTC GAGCGGCAGG CGTCGTGCGC GCTGCAGGAC GACTGGGAGC GCGACGTGCT GTGGCCGCTC AGGCGCGCGG CGACGCGCGA CGACGTGGAG GGTCTGCTGT ACGGCCCGCA AGGCGCGATC TGGGCGTTCG TCGACGGCCC GGCGAAGCCG TTCGTGCGTG TCGGCGCGGC GCGTGCATCG GCGCTCGACA CGCTCGGCTA CCGGCTGCCG TTCACCGACG CGTTCCTGCC GCTCATCGAC GACGCGGCCG CCCGGCGCGT CGCGCAAGCG CGGCGCGACG CCGAGCGGCG CGCGCAACAG CAGGCGGCGC TCGAGCTGGA CGAGCGGATC GCATCGCTCG GCAAGCAGAT CGACGCGCTG CGCGCGCAAA CGGTGCGCTT CGGGATCGTC GCGCAGCCGA CCGACGTGAA CCCGGATGCG CAGGCCAAGC CGTTCGAAAC CGTGCTGACG CTGCAATGCG CGCCGCAGGC GCGCACGCTG ACGAACTACA ACCTGCGCGT GTCCGAGCAG ATCGACTGGC AACCCGATCG ATGCGGCGAC GCGACGCTGC GCATCTCGCT CGGCGGCGTC ACGCTCGTGC GCCGTTACGC GGGATCGCTC GGCGTCGCGC GCTTCGTCCA GGACTTCCGC TACGGCGTGC GCCGCTTCAC GCCCCGCGAT TTCCCGGACG CGAAAGCGCA GCTTCAACGT CTCGGCGTGC GCCACATCGA CGTGCGTTAC GACTTCTCCG GGCATGACGC GCTGCTCGCG CACGTCGGTC GGATCGACGC GCTCGAACGC GCGCGCCGCG ACGACCTCGC GCGGCAGCGG CGCGCCGCGA ACCGGCAGGA CGACGACGCG GGCGGCGCGA CGGCGATCGC GCGGGCGGCG GCGTCCGTCG CGAATGCGCC GGGCGCCGCG CTGCCCCGCC GCATCGGCGT GTGTTGGGGG GACCCGGCGC ACGACAGGCC CGATGGCGAC GGCGCGCAAC CGTGA
|
Protein sequence | MMIARIAALL ALAALSWASA LYFDWPLWCA PAVFCAALAG WLLCRVARRA LRAVRARAQL ARLDASERLP AADAPELRVA ARWRTALAAL GRAQPAPRLR GRPRDALPWY LVIGPEGAGK TTALARARLV SPLRHARDDA AIAPTEDCDW WCFDDAVVLD LAGRFAAPGA TDDDRRAWHA LLEQLGRARA RRGVNGVVVA IDAPRLIEAS RDALTIEGSA IRERLEQLIR LFDRRFPVYV LVTQCDRLYG FDEWAAQLAP EQRERAFGYL GDHDADAFVA RALDSLDARL GALRVALAAR GEPPSPRALM LPHELACLRP ALEAFARAAF GPNVYQETPY LRGLLFASGR QAGGAPSLTL PDWLDAAPAR TPGDAGLFLH DVFARVLPGE RDAWLPVERP SRSRLVLRRL ALAAWLLASI AAGLLMSASF VGDMRTVELV RRDYPAHPRF TGEFAHDTAT LERIGRVIAD VERRDERRLV RPLAGATPVG RLEAELKRRY VAHYRRSIEP TADRLFFGEP DGASDVRAAD DARLAARIRN LVRYVNLMQA RRRGADRETL AQMPAPAVAR ATDGGGARGV DSVGVSGVSG VSGVSGVSGV SGVSGVSGGV GGVGGVGGAR AAEGAGRAVR ASGGRRAGDA RAARGMRGAA GNAGAGAEIA PAADGVRDRD EGFARVSALA VDLIAWSAPD DRALAARVAA AQAELERLAY RDPDGAWLLA LPDASMPRDV TLADFWPAAE PRTLPDAQPA ALRDVRVPAA LTAAHRPAID AFLDEMAQAV ANRPKFAFHR DAFDTWYRAR RIDAWRDFVA RFPQGEQGLT TQAQWRAVID RIADRRDPFA ALLARVDREF ESERDDTLPP WLRFVRTTSH MLAPAMLAPA RLPAASGGLG AALGSISRSG GRALREALGG APEQGRLTLE RDAALRDALV DYERRVAALA ADALAGPGAA YRLAADFHGF GVDPSVQASA MRAADSALRD VKRLAGDRDV GGDVVWSLVG GPLRALIAYV ERQASCALQD DWERDVLWPL RRAATRDDVE GLLYGPQGAI WAFVDGPAKP FVRVGAARAS ALDTLGYRLP FTDAFLPLID DAAARRVAQA RRDAERRAQQ QAALELDERI ASLGKQIDAL RAQTVRFGIV AQPTDVNPDA QAKPFETVLT LQCAPQARTL TNYNLRVSEQ IDWQPDRCGD ATLRISLGGV TLVRRYAGSL GVARFVQDFR YGVRRFTPRD FPDAKAQLQR LGVRHIDVRY DFSGHDALLA HVGRIDALER ARRDDLARQR RAANRQDDDA GGATAIARAA ASVANAPGAA LPRRIGVCWG DPAHDRPDGD GAQP
|
| |