Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1256 |
Symbol | |
ID | 4886774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 1186892 |
End bp | 1187863 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640131195 |
Product | IUNH family nucleoside hydrolase |
Protein accession | YP_001062253 |
Protein GI | 126442423 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1957] Inosine-uridine nucleoside N-ribohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTGC ACAAGATCAT CTACGACACG GATCCGGGCG TTGACGATTC GATGGCGCTC GTGTTCCAGG CGCTGCATCC GGACATCGAG CTGCTCGGTG TGACGAGCGT GTTCGGCAAC GCGACGATCG ACACGACGAC CCGCAACGCG CTGTATCTCG CCGGCCGCTT CGCGCCCGGC GTGCCCGTCG CGCGCGGCGC GGCCGCGCCG CTGCGGCGGC CGGCGCCCGA GCCGCTCGGC GGCATTCACG GCGACGACGG CCTCGGCAAC ACGGGCCTGA GCGTGTCCGT CGATGTCGCG GCCGCGCCGA ACCTCGACGC GCGGCCCGCG CATCGCTTCA TCATCGACAC GGTGCGCGCG CATCCGCACG AAATCACGCT GCTTGCGGTG GGGCCGCTCA CGAATCTCGC GCACGCGCTC GCCGAGGATC CGCAAGTCGC GATGCTCGTC AAGCAGGTCG TCATCATGGG CGGGGCGTTC GGCACCGCGG GCGTGCTCGG CAACGTATCG CCCGCCGCCG AGGCGAACAT CGCGGGCGAT CCCGATGCGG CCGACATCGT GATGTCGGCG CCATGGCCGC TCGCGGTCGT CGGGCTCGAC GTCACGCAAG CCACGATCAT GACGACCGAG TATCTCGCCG CGCTGCGCGA CGACGCGGGC GAGGCCGGCC GCTTCGTATG GGACGTGTCG CGCCACTACG AGGCGTTCCA TCGCGCGAGC GCGGGGCTCG CCGGGATCTA CGTGCACGAT TCGTCGGCGG TCGCGTATGT GGTCGCGCCT CAGCTGTACA GGACGCGCAC GGGCCCGGTG CGCGTGCTGA CGAGCGGCAT CGCGGTCGGC GAGACGATCC AGAAGCCGGC GGCGATGACC GTGCCCGCGC CCGACTGGGA CGGACGGCCG CCGCGCGACG TGTGCGTCGG CGTCGACGCG AACGCGCTGC TCGCGCTCTA CCGCAAGACG CTCGTCGGCT GA
|
Protein sequence | MSLHKIIYDT DPGVDDSMAL VFQALHPDIE LLGVTSVFGN ATIDTTTRNA LYLAGRFAPG VPVARGAAAP LRRPAPEPLG GIHGDDGLGN TGLSVSVDVA AAPNLDARPA HRFIIDTVRA HPHEITLLAV GPLTNLAHAL AEDPQVAMLV KQVVIMGGAF GTAGVLGNVS PAAEANIAGD PDAADIVMSA PWPLAVVGLD VTQATIMTTE YLAALRDDAG EAGRFVWDVS RHYEAFHRAS AGLAGIYVHD SSAVAYVVAP QLYRTRTGPV RVLTSGIAVG ETIQKPAAMT VPAPDWDGRP PRDVCVGVDA NALLALYRKT LVG
|
| |