Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_1874 |
Symbol | |
ID | 4900278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 1830342 |
End bp | 1831607 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640135104 |
Product | hypothetical protein |
Protein accession | YP_001066139 |
Protein GI | 126452511 |
COG category | [S] Function unknown |
COG ID | [COG2718] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTTCATC AAATCATCGA CCGCAGACTG GCCGGCAAGA ACAAGAGCAT TGCAAACCGC GAGCGCTTCC TGCGCCGCGT CAAGAACTAC ATTCGCCGCG CCGTGTCCGA CGCGGTGCGC GATCGCAGCA TCAAGGACAT CCAGAGCACG CAGAGCATCA CGATTCCCCG CAAGGACATC GCGGAGCCGA CGTTCCGGCA CGGGCCGGGC GGCAAGCGCG AGCTCGTGCA TCCGGGCAAC GCCGACTACG TGCGCGGCGA CAAGATTCCG CGCCCGCCCG GCGGCGCGGG GGGCGGCGGC AGCCAGGCGA GCAACGAAGG CGAAGGTCAG GACGATTTCG TGTTCGAGCT CTCCCGCGAG GAGTTCATGC AGTACTTCTT CGACGATCTC GAGCTGCCGC GCCTCGTCAA GACCCACCTG CTGACCGTGC CGAGCTGGAA GAACGTGCGC GCGGGCTGGG CGGCGGAGGG CACGCCGAAC AACATCGACG TCGTGCGTTC GCTGCGAAGC GCGCTCGGCC GGCGCATCGC GCTCGGCTCG CCGCTCGTCA ACGAACTGCG CGAGCTCGAA GAGAAGCTCG TCGCGCTGAA GGATGAGCCG GGCGACCATC GCGTCGAGAT CGCCCAGCTC GAGGACGCGA TCCATCACCT GAAGGGCCGC ATCTGGCGCA TTCCGTTCAT CGATCCGTTC GATCTGCGCT ACGTGAATCG CGTGAAGATG CCGCAGCCGT CGAGCCAGGC GGTGATGTTC TGCCTGATGG ATGTGTCGGG CTCGATGGAC GAGCAGCGCA AGGATCTCGC GAAGCGCTTC TTCATCCTGC TGTACCTGTT CCTGAAGCGC AACTACGAGC GGATCGAAGT GGTGTTCATC CGTCACCACA CGCGCGCGGA GGAAGTCGAC GAGGACACGT TCTTCCATTC GACCGAAAGC GGCGGCACGG TGGTGTCGAG CGCGCTCGAG CTGATGCGCA AGGTGATGGA GGAGCGCTAT TCGCCGACCG AATGGAACAT CTACGGCGCG CAGGCGTCGG ACGGCGACAA CTGGACCGAC GATTCGCCGA AGTGCCGCAA GATCCTCGAC GAGGACATCC TGACGAAGGT GCGCTACTTC GCGTACATCC AGGTCACGCC CGAGGAGCAG AACCTGTGGC TCGAATACGC GCAACTGGCG TTGTCACAAC CGCATCTCGC GATGAAGAAA GTGGAATCGG CTGCCGACAT CTACCCCGTG TTCCGGGAAC TCTTTGAAAA GCACGTGGAA ACCTGA
|
Protein sequence | MLHQIIDRRL AGKNKSIANR ERFLRRVKNY IRRAVSDAVR DRSIKDIQST QSITIPRKDI AEPTFRHGPG GKRELVHPGN ADYVRGDKIP RPPGGAGGGG SQASNEGEGQ DDFVFELSRE EFMQYFFDDL ELPRLVKTHL LTVPSWKNVR AGWAAEGTPN NIDVVRSLRS ALGRRIALGS PLVNELRELE EKLVALKDEP GDHRVEIAQL EDAIHHLKGR IWRIPFIDPF DLRYVNRVKM PQPSSQAVMF CLMDVSGSMD EQRKDLAKRF FILLYLFLKR NYERIEVVFI RHHTRAEEVD EDTFFHSTES GGTVVSSALE LMRKVMEERY SPTEWNIYGA QASDGDNWTD DSPKCRKILD EDILTKVRYF AYIQVTPEEQ NLWLEYAQLA LSQPHLAMKK VESAADIYPV FRELFEKHVE T
|
| |