Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2103 |
Symbol | |
ID | 4888502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 2041988 |
End bp | 2043205 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640132040 |
Product | hypothetical protein |
Protein accession | YP_001063097 |
Protein GI | 126443311 |
COG category | [S] Function unknown |
COG ID | [COG5441] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.108299 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACCCC ACCGTACACG GATCTACGTT GCGGCAACCG TTGATACAAA GGGCGCGGAA GCGCATTTCG TCAAGGACCG GATCGCCGAC GCGGGGCTCG CGGCGGTCGT GGTCGACCTG TCGACCCGCG TGCCGGGGCG CGCGGCGGAC ATCGGCGCCG ACGCCGTCGC CGCGCACCAC CCCGACGGCG CGGCCGCCGT GTTCTGCGGC GACCGCGGCC GCGCGATCGC CGCGATGGCG GTTGCGTTCG AACACTACAT CAAAAGCCGC GACGACGTCG CCGCGCTGAT CGGCATCGGC GGCTCCGGCG GCACGGCGCT TGTCACGCCG GCGATGCAGG CGCTGCCGAT CGGCATGCCG AAGCTGATGA TCTCGACGAT GGCCTCGGGC GACGTGTCCG CGTACATCGG CTCGTCGGAC ATCGCGATGC TCTATTCGGT GGCGGACATC GCCGGCCTGA ACCGGATCTC GCGCCAGGTG CTCGCGAACG GCGCGCACAT GATCGCCGGC GCGGTGCGCG ACATGCAGCC GCCGCACGCC GATCTGAAGC CCGCGCTCGG CCTCACGATG TTCGGCGTGA CGACGCCGTG CATCCAGGCG GTCACCTCGC GGCTCGACGC GCGCTTCGAC TGCATCGTGT TCCACGCGAC GGGCAAAGGC GGCCCCGCGA TGGAAAAGCT CGCCGACAGC GGCCTGCTCG ACGGCGTGCT CGATCTCACC ACCACCGAAG TCTGCGATCT GCTGATGGGC GGCGTGCTCG CGTGCGGCGC GGACCGGTTC GACCTGATCG CGCGCAGCCG GGTGCCGTAC GTCGGCTCGT GCGGCGCGCT CGACATGGTG AACTTCGGCC ACATCGATAC GGTGCCGCCC CGCTACGCGC AGCGGCTGCT GTACAAGCAC AACCCGCAGG TCACGCTGAT GCGCACGACG CCCGACGAGA ACCGCCGGAT CGGCGAATGG ATCGGCGCGA AGCTGAACGC ATGCGACGGC CCGGTGCGCT TCCTGATTCC CGAAGGCGGG GTCTCCGCGC TCGACGCGCC GGGCCAGGCG TTCTGGAACC CGCAAGCCGA CGAGGCGCTG TTCGACGCGC TCGAGGCCAC CGTCGTGCAG ACCGAGCGCC GCCGCCTCGT GCGCGTCCCC GCGCACATCA ACGATGCGCG GTTCGCCGAC GCCGCCGTAG AACACTTCCT ATCGCTTCAC GCAGCACACC GGAATTGA
|
Protein sequence | MVPHRTRIYV AATVDTKGAE AHFVKDRIAD AGLAAVVVDL STRVPGRAAD IGADAVAAHH PDGAAAVFCG DRGRAIAAMA VAFEHYIKSR DDVAALIGIG GSGGTALVTP AMQALPIGMP KLMISTMASG DVSAYIGSSD IAMLYSVADI AGLNRISRQV LANGAHMIAG AVRDMQPPHA DLKPALGLTM FGVTTPCIQA VTSRLDARFD CIVFHATGKG GPAMEKLADS GLLDGVLDLT TTEVCDLLMG GVLACGADRF DLIARSRVPY VGSCGALDMV NFGHIDTVPP RYAQRLLYKH NPQVTLMRTT PDENRRIGEW IGAKLNACDG PVRFLIPEGG VSALDAPGQA FWNPQADEAL FDALEATVVQ TERRRLVRVP AHINDARFAD AAVEHFLSLH AAHRN
|
| |