Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3795 |
Symbol | |
ID | 4883479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 3707956 |
End bp | 3709329 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640129723 |
Product | hypothetical protein |
Protein accession | YP_001060790 |
Protein GI | 126440400 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000690611 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCATAG CCGTCGCCCC TTTGTTTGCC GCTTGTAGTG GAGGGGGCGG CGGCACCCCG GCCCCCATCG CCGTGCCGCA ATGCTCCGGT TCGAGCTGCG GCGTTCAGGG TCCGCCCAGC TCGACGGCGG CCAACACTTC GCTGTGCCCC GCCGACGCGA ACATCGGCAG CAGCACCTAT CTCGGCGGCG CCGGCGGCGG CGAGATCGTG AGCCTGAACA TCAACGCGAC CACGATGACG TACACGCTCA AGTGGCTCGA GTCGCCGGTT CCGCTCGCCA CCGGCACCGT CACGCCGACC CGCGCCGGCA CGACGATCAC GGGCAGCGTC GCGCATCCGC CCGCGGGCAC GCTGCCGACC GCCGAGCAGA CGCGCTGCGC GTTCGTGCTG CTGCCCGGTA GCGGCACGGC GCCCGCGACG AATTCGACGT ACTCGACCGC GGCCGACTTC AACCAGGCGA ACCCGCCGAT GATCCTGATC GGCTTCGGCG TCGCGGGCGG CGGCATTCCG GGTGCGACGA TCCAGTACAG CGGCCTCACG ATCATCCCGG GCGTGCTGCA GAACATCGGC CAGGTGCCGC AGCGCCATTT CGACTTCTAT CCGTTCCTCG GCTTCGCGAA CACGACGACC GATCTGTCGA AGCTGCCGGG CACGTACAAC GCCCTCGTCT ATCACACGGT GCCGTCGGGC AACTACGCGG CGAAGGCGAT CGCGTCGAAC GAGACGTTCG ATGCGAACGG CGCGTGCACA TCGACGAGCG CATCGGGCTG CATGACGACC GGCAATCCGT GGACGGCGAG CGGCAACGGC TACTTCAACA GCACGCAGGC GCCGCAGATC CTGCCGCAGA CGCAGTTGCC GCTCATCGGC GCGACCGGCA AATCGGCCGT CGCGCACATG GTGCTCGGCC AGTTGAACGG CGCGACCGTG CCTGTCGTCG TGCGCACGGG CAACGTGAAT CTCGGCACGC CGCCGCTGCA CACCGATGCG CAGGTGGACG ACGAATCGGG CATCGCGGTG CTCGGGCTCG CGCAGGCGAT CGCGTCGGGC GGCATCGACG GCGGCTACGC GGGCGCGGAC TCGAACTTCA AGTACACGGC GACGGTGATC AAGGGCACGA CGGGCACGTT CGTGAACCCG AGCACGCAGC AGGCCGAGAC GGGCTTCACG CTCGACTACG GCCAGTCGAC ACCGGGGCTG CTCGGCGTCA CGACGACCGA CACGTCGGCG CCGGGCTTCG TGATCGCGAG CGGCGGGCTA TATGCGGCGC TGGTCCAGGG CACCGTCAAC GGCGGCATCA CGCAGAGCTC GGCGATCGCC GGCCAGACGC CGTCCGCGCC CTACTTCGGC GTAGGCGCGC AAGTCAGCAA GTAA
|
Protein sequence | MAIAVAPLFA ACSGGGGGTP APIAVPQCSG SSCGVQGPPS STAANTSLCP ADANIGSSTY LGGAGGGEIV SLNINATTMT YTLKWLESPV PLATGTVTPT RAGTTITGSV AHPPAGTLPT AEQTRCAFVL LPGSGTAPAT NSTYSTAADF NQANPPMILI GFGVAGGGIP GATIQYSGLT IIPGVLQNIG QVPQRHFDFY PFLGFANTTT DLSKLPGTYN ALVYHTVPSG NYAAKAIASN ETFDANGACT STSASGCMTT GNPWTASGNG YFNSTQAPQI LPQTQLPLIG ATGKSAVAHM VLGQLNGATV PVVVRTGNVN LGTPPLHTDA QVDDESGIAV LGLAQAIASG GIDGGYAGAD SNFKYTATVI KGTTGTFVNP STQQAETGFT LDYGQSTPGL LGVTTTDTSA PGFVIASGGL YAALVQGTVN GGITQSSAIA GQTPSAPYFG VGAQVSK
|
| |