Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A3025 |
Symbol | |
ID | 4886866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2872311 |
End bp | 2874002 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640132961 |
Product | putative lipoprotein |
Protein accession | YP_001064016 |
Protein GI | 126444305 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.330625 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCGGACGA CGAAATCTCC CGGCATCTCC GTGATTCGGC AGCGGCTCGC CGTTGCGCTG CCGCTTGCGT TGACGCTCAC CCTCGCGAGC TGCGGCGGTG ACGATCTGAC GCCCGCCGCG CAGCGCTGGG CGATGCCCGG CACCGAACTG CCGCTCGGGC CGCAGGGCCT CGCGCAGAGC GTGTCGACGC AGACGCTCGC CGCAGGCGTC GCCTATTACC AGATCAAGCG CGGCGCGGCG AGCGCGGCCG ATTTCTGGAC CGTCAACCTC GGCTTCTACG CGACGCAGGC CGCGGCGCAG GCCGATGCGG CGAATCTCGC GGCGGCCGGC TTCGCGACGC GCGTCGACGC GTCGGCGGGC ACCGACCTGC AGGGCAAGGT GCTCGGCTAC TGGCTGTCGG CCGGCCGCTA CGCGACGCAG GCCGAGGCGA CGGCGGCCGC CGCACGCATC GCGCAGGCCA CGCAGAACCG CTACAAGCCG GGCACGCGGC ATACGTCGCT CGCCGGCGCG CCGACGACGG GGCCGTGGAT CGTCAACGTG CTCGCGATCG ACCCGTCGCG CGCCGGCGCG GCGCTGTCGC TCGCGCTGCC GGGCGGCGAC GATCTCGGTG CGGGCGGCGA GACGGTTTCG GCCGCGCGGG CGCGTGTGAA CGCGCTCGCC GGCGTCAACG GCGGCTTTTT CACGAACATC AATCCGTTCG GCGCGCCGCT GCCGCCGCGC TCGCCCGTCG GCGCGACGGT AGTCGACGGG CGGCTCGTCG CGGCAGCGAT CGGCAGGCGC CCCGGCCTGC TGCTCGCGCG CGACGCGAAC GGCCGCCAAC GCGCGACGGT CGTGCGCAAT CTCGCGACGG CGATCACGCT GACCGACGCG CAAGGCAGTG CGATCGCGGT CCAGACGCTG AACCGGCCGA TCCTCGGCAC GGTCGTCAAT TGCGGCGCGC AGGCGCGCAC GCCGACGAGC GAGCCGGCGC AGGACACGGT GTGCACGAAC GACGATGACC TCGTGATGTA CGACTCGCTA TATCTGCGCG GCGGTGCGTC GAACACGCTT GTCGACGCCG GCTACCAGGG CGCGCGATAC GAACTCGTGG TCGACGCGAA CGGCGCCGTC GTCGCCGGCC ATGCGACGCT CGGCGCGCCG CCGCCGCCGA ACGGCTACGT GCTGCAGGGG CTCGGCGCGA GCGCCGCGTG GCTGCAGGCG CATGCGACGC CGGGCACGCG CCTCGCGGTA TCGCGCCGGC TGTCGGCCGA CGGCGCGGAT CTCGCGCTCG CGTCGGGCAC GTCGCTCGTC GAGGCGGGGC CGACGCTGTC CGTGCCGAAT CTCGCGCAAA GCGCCGCGCA AGAGGGCTTC GCGCCGACGG TGGGCGGCGT CGACGCGGGC GAAGGCGCCG CGGCGAACGG CAACTGGTAC AACGGCTGGT ATGTCGCGCG CAATGGGCGC ACCGCGGCGG GCGTCGCGGC GGACGGCACG ATCCTGCTCG TCGAGATCGA CGGCCGGCAG CCCGCGTTGA GCGTCGGCAC GAGCATTCCG GAGACGGCGG CGGTGATGGC ATGGCTCGGT GCGACGTCGG CCGTCAATCT CGACGGCGGC GGCTCGAGCA ACATGGTGGT CGGCGGCAAG ATGGTCGGAC ATCCGTCCGA CGCCGTGGGC GAGCGGGGCG TCGGCGATAC GCTGATGCTG CTGCCGGGCT GA
|
Protein sequence | MRTTKSPGIS VIRQRLAVAL PLALTLTLAS CGGDDLTPAA QRWAMPGTEL PLGPQGLAQS VSTQTLAAGV AYYQIKRGAA SAADFWTVNL GFYATQAAAQ ADAANLAAAG FATRVDASAG TDLQGKVLGY WLSAGRYATQ AEATAAAARI AQATQNRYKP GTRHTSLAGA PTTGPWIVNV LAIDPSRAGA ALSLALPGGD DLGAGGETVS AARARVNALA GVNGGFFTNI NPFGAPLPPR SPVGATVVDG RLVAAAIGRR PGLLLARDAN GRQRATVVRN LATAITLTDA QGSAIAVQTL NRPILGTVVN CGAQARTPTS EPAQDTVCTN DDDLVMYDSL YLRGGASNTL VDAGYQGARY ELVVDANGAV VAGHATLGAP PPPNGYVLQG LGASAAWLQA HATPGTRLAV SRRLSADGAD LALASGTSLV EAGPTLSVPN LAQSAAQEGF APTVGGVDAG EGAAANGNWY NGWYVARNGR TAAGVAADGT ILLVEIDGRQ PALSVGTSIP ETAAVMAWLG ATSAVNLDGG GSSNMVVGGK MVGHPSDAVG ERGVGDTLML LPG
|
| |