Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_2336 |
Symbol | |
ID | 4901604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 2311429 |
End bp | 2312994 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640135565 |
Product | hypothetical protein |
Protein accession | YP_001066600 |
Protein GI | 126454265 |
COG category | [S] Function unknown |
COG ID | [COG0397] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTTTT CCAGAAGCGA GGCCGCACCG GCCGCCCCTC TCCCCGATCT CGCCGCGACG CTCGCCGCGC CGCGCGACGA TGCGTTCCAG CAACTCGGCG CCGCGTTCGT CACGCGGCTG CCCGCCGCGC CGCTGCCCGC GCCGTACGTC GTCGGCTTCT CCGACGACGC GGCGCGCATG CTCGGCCTCG AGCCCGCGCT GCGCGACGCG CCCGGCTTCG CCGAGCTGTT CTGCGGCAAC CCGACGCGCG ACTGGCCGCA GGCGTCGCTG CCGTACGCGT CGGTCTATTC GGGCCACCAG TTCGGCGTGT GGGCGGGCCA GCTCGGCGAC GGGCGCGCGC TCACCATCGG CGAACTCGCG CACGACGGCC GCCGCTACGA GCTGCAGTTA AAGGGCGCGG GACGCACGCC GTATTCGCGC ATGGGCGACG GCCGCGCGGT GCTGCGCTCG TCGATCCGCG AGTTCCTCTG CTCGGAGGCG ATGCACCATC TCGGCATACC GACGACGCGC GCGCTCGCCG TGATCGGCTC CGACCAGCCG GTGGTCCGCG AGGAAATCGA GACGTCGGCG GTCGTCACGC GCGTCGCGCA GAGCTTCGTG CGCTTCGGCC ATTTCGAGCA CTTCTTCGCG AACGATCGGC CCGAGCAGTT GCGCGCGCTC GCCGATCACG TGATCGAGCG TTTCTATCCG GCCTGCCGCG ACGCCGACGA TCCGTATCTC GCGCTGCTCG CGGAAGCGAC GCGGCGCACC GCGGAGCTCG TCGCGCAATG GCAGGCGGTC GGCTTCTGCC ACGGCGTGAT GAACACCGAC AACATGTCGA TCCTCGGCCT GACGATCGAC TACGGCCCGT TCGGCTTCAT CGACGCGTTC GACGCGAAGC ACGTGTGCAA CCATTCGGAC ACGCAGGGCC GCTACGCCTA CCGGATGCAG CCGCGCATCG CGCACTGGAA CTGCTTCTGC CTCGCGCAGG CGCTGCTGCC GCTCATCGGC CTGCACCGCG ACGCGCCGAG CGAAGACGCG CGCGCCGAGC GCGCGGTCGA GGACGCGCAC GCGGTGCTCG GCCGCTTTCC CGAGCAATTC GGCCCCGCGC TCGAGCGCGC GATGCGCGCG AAGCTCGGCC TCGCGCTCGA ACGCGAGGGC GACGCGGCGC TCGCGAACCA GTTGCTCGAG ATCATGGATG CGAGCCATGC CGATTTCACG CTGACGTTTC GCCATCTCGC GCGCGTGTCG AAGCACGACG CGCGCGGCGA CGCGCCCGTG CGGGATCTGT TCATCGATCG CGACGCGTTC GATCGCTGGG CGAACCTCTA TCGCGCGCGC CTGTCGGAAG AAGCGCGCGA CGACGCGTCG CGCGCGGCCG CGATGAACCG CGTGAACCCG AAATACGTGC TGCGCAACCA CCTCGCGGAA ACGGCGATCC GCCGCGCGAA GGAGAAGGAT TTTTCGGAGG TCGAGCGCCT CGCGGCCGTG CTGCGGCGCC CGTTCGACGA GCAGCTGGAG CACGACGCGT ATGCGGCGCT GCCGCCCGAC TGGGCGAGCA CGCTCGAGGT GAGCTGCTCG TCGTGA
|
Protein sequence | MSFSRSEAAP AAPLPDLAAT LAAPRDDAFQ QLGAAFVTRL PAAPLPAPYV VGFSDDAARM LGLEPALRDA PGFAELFCGN PTRDWPQASL PYASVYSGHQ FGVWAGQLGD GRALTIGELA HDGRRYELQL KGAGRTPYSR MGDGRAVLRS SIREFLCSEA MHHLGIPTTR ALAVIGSDQP VVREEIETSA VVTRVAQSFV RFGHFEHFFA NDRPEQLRAL ADHVIERFYP ACRDADDPYL ALLAEATRRT AELVAQWQAV GFCHGVMNTD NMSILGLTID YGPFGFIDAF DAKHVCNHSD TQGRYAYRMQ PRIAHWNCFC LAQALLPLIG LHRDAPSEDA RAERAVEDAH AVLGRFPEQF GPALERAMRA KLGLALEREG DAALANQLLE IMDASHADFT LTFRHLARVS KHDARGDAPV RDLFIDRDAF DRWANLYRAR LSEEARDDAS RAAAMNRVNP KYVLRNHLAE TAIRRAKEKD FSEVERLAAV LRRPFDEQLE HDAYAALPPD WASTLEVSCS S
|
| |