Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0669 |
Symbol | |
ID | 4886760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 642733 |
End bp | 643989 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640130609 |
Product | heptosyltransferase |
Protein accession | YP_001061668 |
Protein GI | 126443611 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0859] ADP-heptose:LPS heptosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.822838 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGATG AGCGCGCGGC GAAGCGATGG CCCGCGCGCG GCGAAGACAT GAAAATACTG TTCATCAAGC TCTCCGCATT GGGCGACGTG CTGGCCAGCA CGCCGCTGTT CGCGTCGACC AAGGCCGAGC ATCCGGACTG GTTCGTCGGG CACGTGGTGG CGCGTCCCTA TGCGGCTGCG ACGCGAAACA ATGCGCACGT CGATGCGCAA TTCATTGTCG ATTCGCCGCT GTCCGGTGGC GCGATTCGGA AAATCAGGGT GGCGGCGCGA ATATGGCGGT ACATGATGCG CGAACGATAC GACATCGCGG TTGTGCTGCA TCGGAGTTTT GTACTACAAC TCATATGTCG CCTCGCATCG GTCAGGAAAA CCATCGGCTA TGAAAGCCGA TTCTCATTCC TATTGAGCCA CTCCATTCCG TTTTCGATGC AGGGAAATCG AAGCGGGCTG GAATTGCGTT TGCTGAAGTC GGCGGGAATC ATCCACGACG AAAAGAAGAA ATTGAGGTTC GACATCGATT TCGGGAACGT GGACCGAAAC CGGCTGCGCG CGTTGCCCGC CGCGTTCATC GCCGTCAACG CGGGCGGCGG CAACGCGGAT GCGCAGGCAG CCAACAAGCT GTGGCCCGCC GAGCGTTACG GTGCATTGAT CAAGCGGTTG CCGTTGCCGG TCGTGATGCT CGGACACGGC GCGGCGGATG AAGACATCAG GGATCGGGTC GCGGCGACGG GGGCGAGGTT CGTCGACATG GTCGGCAAGA CGAATCTCGA CGAGACGGCG GTCATCCTCG AACGCTCGCG TCTGTATGTG GGCAACGACA GCGCGCTTTT GTATCTCGCG GCATCGCTCG GCGTGACGAC GATCGGGATC TACGGGCCTA CCGATCCCGC CGCGTTCAGT CCGTTGGGCG CGAACAATCT GTGGCTGAGT GGCAAGACGT CCTGTGCACC GTGTTATTCG TCGTTCGACG GGATCGGCGG GCGCATGTAC ACGTGCACGA ACAACATTTG CATGCAGGCC GTTACGGTCG AATCCGTCAG CGAGAGAATC CATGCAGCCC TCCATCAAGA TCAGAATCTA CAAGCGGCTG GACCGGATGT TGTCGGATCT GGTCAGGGCC GTGCCGCATC CGAAACGCGC GCTCGGGCGG ACACCGACGC GCGTGCTGAT CATCAAGCTC TCGGCGATGG GGGATTCGCT GTGCCTCTTT CCCACCGTTC GGCAACTGGC GCTCGCGTTC CCGGGGGCGA CGATTGA
|
Protein sequence | MADERAAKRW PARGEDMKIL FIKLSALGDV LASTPLFAST KAEHPDWFVG HVVARPYAAA TRNNAHVDAQ FIVDSPLSGG AIRKIRVAAR IWRYMMRERY DIAVVLHRSF VLQLICRLAS VRKTIGYESR FSFLLSHSIP FSMQGNRSGL ELRLLKSAGI IHDEKKKLRF DIDFGNVDRN RLRALPAAFI AVNAGGGNAD AQAANKLWPA ERYGALIKRL PLPVVMLGHG AADEDIRDRV AATGARFVDM VGKTNLDETA VILERSRLYV GNDSALLYLA ASLGVTTIGI YGPTDPAAFS PLGANNLWLS GKTSCAPCYS SFDGIGGRMY TCTNNICMQA VTVESVSERI HAALHQDQNL QAAGPDVVGS GQGRAASETR ARADTDARAD HQALGDGGFA VPLSHRSATG ARVPGGDD
|
| |