Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A3068 |
Symbol | |
ID | 4888656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2911238 |
End bp | 2912773 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640133004 |
Product | MlrC domain-containing protein |
Protein accession | YP_001064059 |
Protein GI | 126443475 |
COG category | [S] Function unknown |
COG ID | [COG5476] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.295091 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTTCG CCATACAGGA AAGCGACATG AACATTCTGA TTGCAGGCTT TCAGCACGAA ACCAACACGT TCGCACCGAC GCGCGCGTCG TATCAGAGCT TCGTTCGCGG CGAAGGCTTT CCGCCGCTCG CGCGCGGCGC CGGCGTGCTG TCGCTGCGCG ACGTCAACGT GCCGATCGGC GGCTTCATCC GGGCCGCGCA GGCGAGCGGG CACGCGCTGC TGCCTGTCGT GTGGGCGGGC GCGTGCCCGT CGGCGCACGT GACGAGCGGC GCGTTCGAGC GGATCGGCGG CGAGATCGTC GCCGCGGTCC AGGCGGGCGG CTTCGACGCG ATCTATCTCG ACCTGCACGG CGCGATGGTC ACCGAGCAGT TCGACGACGG CGAAGGCGAG CTCCTCGCGC GCGTGCGGCG GATCGTCGGC GAGCGGATGC CGATCGTCGT GTCGCTCGAT CTGCATGCGA ACGTCACCGC GCGAATGGCC GCGCATGCGA GCGCGCTCGT TGCCTACCGC ACGTATCCGC ACGTCGACAT GGCGCAAACC GGCGAGCGTG CCGCGCGGGT GCTCGAGCGG CTCGCGGCCG AGGCCCGGCC GCTGCATTGC GCGATACGCC GGCTGCCGTT CCTGATTCCG GTCAACGGCA TGTGCACGCA CGCGGAGCCG GCGAGCGGCG CGTACCGGCT GCTCGCGCAG CTCGAGCGGG ACGGCGTCGT ATCGATGTCG TTCGCCCCCG GCTTTCCGGC CGCCGATTTC CCGGAGTGCG GGCCGACCGT ATGGGCGCAC GCGTTCGAGG CCGACGCGGC GCAGCGCGCG GCCGACGCGC TGTTCGCGAA GCTCGTCGGC GACGAGGCGC GCTGGAGCGT GCCGTTGCTC GCGCCCGACG CGGCCGCCGC CGAGGCGATC CGCCTGAGCC GCACGGCGAC CAGGCCCGTC GTCATCGCCG ATACGCAGGA CAACCCCGGC GCGGGCGGCG ACGCCGATAC GATGGGCATG GTGCGCGCGC TGCTGCGCAG CGGCGCGCGG GACGCGGCGG TGGGCGTGAT CTGGGATCCC GACGCCGCGG CCGCCGCGCA CCGCGCGGGC GTCGGCGCGC GCATCGGCCT GCGTCTGGGC GGCCGCTCGC GCGTGCGGGG CGACGCGCCG CTCGATGCCG AATTCGAAGT CGAGCATCTG TCCGACGGCC GTTTCCGGTT CGACGGCCCG ATGTTCAACG GCGCGCACGG CGAGCTCGGG CCGGTGGCCT GCCTGCGGAT CGACGGCGTG CGGATCGCGG TGAGCACGAA CAAGATGCAG ACGTTCGAGC GCAACCAGTT CCGCGTGGCG GGCATCGAGC CCGAGCGCAC GAAGATCGTC GTGAACAAGA GCTCGGTGCA TTTTCGCGCG GACTTCGAGG CGATCGCCGA TGCGATCCTC GTCGCGAAAT CGCCGGGGCC GATGGCCGCC GATCCCGCGG ATCTCGCGTG GGCGCGTCTC GATCCCGACA TCCGCGTTCG GCCGAACGGA CCGACCTTGA GGTCGCTTCG CGCAATGGCG CGTTAG
|
Protein sequence | MPFAIQESDM NILIAGFQHE TNTFAPTRAS YQSFVRGEGF PPLARGAGVL SLRDVNVPIG GFIRAAQASG HALLPVVWAG ACPSAHVTSG AFERIGGEIV AAVQAGGFDA IYLDLHGAMV TEQFDDGEGE LLARVRRIVG ERMPIVVSLD LHANVTARMA AHASALVAYR TYPHVDMAQT GERAARVLER LAAEARPLHC AIRRLPFLIP VNGMCTHAEP ASGAYRLLAQ LERDGVVSMS FAPGFPAADF PECGPTVWAH AFEADAAQRA ADALFAKLVG DEARWSVPLL APDAAAAEAI RLSRTATRPV VIADTQDNPG AGGDADTMGM VRALLRSGAR DAAVGVIWDP DAAAAAHRAG VGARIGLRLG GRSRVRGDAP LDAEFEVEHL SDGRFRFDGP MFNGAHGELG PVACLRIDGV RIAVSTNKMQ TFERNQFRVA GIEPERTKIV VNKSSVHFRA DFEAIADAIL VAKSPGPMAA DPADLAWARL DPDIRVRPNG PTLRSLRAMA R
|
| |