Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_3603 |
Symbol | |
ID | 4899407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 3508538 |
End bp | 3509518 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640136829 |
Product | glycerophosphoryl diester phosphodiesterase family protein |
Protein accession | YP_001067834 |
Protein GI | 126455274 |
COG category | [C] Energy production and conversion |
COG ID | [COG0584] Glycerophosphoryl diester phosphodiesterase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCAAC TGATCGACAC CCTCGGACGG CGCGCGGTGC TCATCGGCGT CGCGCTCGGC CTGGCCGCCT GCGCGGGCGG CGGCGGCCCG CCGGGCGAGG CGCCGGCGAC GCTGCCGCGC ATCGTCGCGC ATCGCGGCGG CGCGGCCGAT GCGCCGGAGA ACACACTCGA TGCGATCCGG GCGGCGGTCG CGAATCGGGC GGACGCGATT TGGCTGACCG TCCAACTGAG CCGCGACGGC GTGCCGGTGC TGTATCGGCC CGCCGATCTA TCGGCGCTCA CGCGCTCGAG CGGCCCGGTC GCCGGCCACA CGGCCGCGCA GCTCGCGCAG ATGAACGCCG GCTGGCAATT CCGCGATGCG GGCGGGCGGT ATCCGTATCG CGCGCGCCCG GTCGGCATTC CGACGTTGCG CGACGCGCTG CGCGCGATTC CGCCCGCGAT GCCGATCGTG CTCGACATGA AGGCGGTGCC CGCCGCGCCG CAGGCGAAGG CCGTCGCGGA CGTGCTGACG AGCGAGGCCG CGTGGCCGCG CGTGACGATC TATTCGACCG GTGCCGCTTA TCAGACCGCG TTCGCCTCGT ATCCGCAGGC ACGGCTCTTC GAATCGCGCG ATGCGACGCG CGGGCGGCTC GTCGACGTGC TGCTCGGCGG CGCGTGCGAA CGCGCGCCCG AGGCGCCTGC GACGGCGCCC ATATGGACCG GCTTCGAAAT GCATCGAAAC ATGACGGTGA GCGAGCGCTT CACGCTCGGC GAAGGCGTAT CGCCCGTGAA GGCGACGTTG TGGACGCCCG CGACCGTCGC GTGCTTCAGG CGGCGCGCGG ACGTGCGGAT TCTCGCGATC GCGGTGAACG ACGCCGACGA TTACCGCACG GCCGCGTGCC TCGGGCTCGA TGCGGTGCTC GCGGATTCGC CGCGCGAGAT GGCGGAAATC CGGTCGGCGC TGCGGGCGCG GCCGTTGCGG TGCGAGACGG GGGCGCGATA G
|
Protein sequence | MNQLIDTLGR RAVLIGVALG LAACAGGGGP PGEAPATLPR IVAHRGGAAD APENTLDAIR AAVANRADAI WLTVQLSRDG VPVLYRPADL SALTRSSGPV AGHTAAQLAQ MNAGWQFRDA GGRYPYRARP VGIPTLRDAL RAIPPAMPIV LDMKAVPAAP QAKAVADVLT SEAAWPRVTI YSTGAAYQTA FASYPQARLF ESRDATRGRL VDVLLGGACE RAPEAPATAP IWTGFEMHRN MTVSERFTLG EGVSPVKATL WTPATVACFR RRADVRILAI AVNDADDYRT AACLGLDAVL ADSPREMAEI RSALRARPLR CETGAR
|
| |