Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_2791 |
Symbol | hutF |
ID | 3691398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | - |
Start bp | 3096697 |
End bp | 3098079 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637729247 |
Product | N-formimino-L-glutamate deiminase |
Protein accession | YP_334175 |
Protein GI | 76812004 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR02022] formiminoglutamate deiminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAAAA TCGATTCGAT GTTGTTTGCC GAGCAGGCGT ATCTGCCGGG CGGCTGGCGG CGCGACGTGC TGCTGCGCTG GAACGCGGCG GGCGCGCTCG TCGACGTGAG CGCGGATGCG GCGGCGCCCG CGGGCGTCGC GCGCGCGAAC GGGCCGCTGT TGCCGGGCAT GCCGAACCTG CACTCGCACG CGTTTCAGCG CGCGATGGCG GGGCTCACCG AATATCGCGC GAACCCGTCC GACACGTTCT GGAGCTGGCG CGACCTGATG TACCGCTTCG CGCTGAAGAT CACGCCCGAC GCGCTCGCCG CGGTCGCGCG CTGGCTCTAT GTCGAGATGC TGAAAAGCGG CTACACGTCG GTGTGCGAAT TCCATTACGT TCATCACGCG CCGGACGGCG CGCGCTATGC GCGGCCCGCG GAGCTGGCGG CGCGCGTGGT CGGCGCGGCG CGGGACGCGG GCATCGGCAT CACGATGCTG CCGGTCGCGT ATCAGTACAG CGGCTTCGGC GAGCGCGCGC CGCGCGACGA CCAGCGTCGT TTCATCAATA CGCCCGACGC GCTGCTCGCG CTGGTCGACG CGTTGCGCGG CGAGTTGCCC GAGCATGGCG GGCTGCGCTA CGGCATCGCG CCGCACTCGC TGCGCGCGGT GTCGGAAAGC GGGCTGCGAG CGCTCGTTGG CGCGATGCCG GCCGATGCGC CGGTGCACAT CCATATCGCC GAGCAGACCG CGGAAGTCGA CGATTGCATG CGCGCGTACG GCGCGCGGCC GGTGCGGTGG CTCCTCGAGC GCTTCGACGT GAACGCGCGC TGGTGCCTCG TGCACGCGAC GCATCTCGAC GCCGCCGAGA CGCAGGCGCT CGCACGCAGC GGCGCGACCG CGGGCCTGTG CCCGACGACC GAGGCGAATC TCGGCGACGG CATTTTTCCG GCGGTCGACT ATCTCGCGGC GGGCGGCGCG ATCGGCATCG GTTCGGACAG CCACGCGAGC GTCGACTGGC GCGCCGAGTT GCGGCTCTTC GAATACGGGC AGCGGCTCGT GCGGCGCGAG CGCAACGTGC TTGCCGACGC GGCGCAGACG CGCGTGGCGG ACCGGCTGTT CGCGGCGTCG CTCGCGGGCG GCGCGCGCGC GGCCGGGCGT GCCGTCGGCG CGCTCGAGGC GGGCCGACGC GCCGACTGGA TCGTGCTCGA TCCCGCGCAT CCGTCGATCG CCGAGCACGG CAGCGACACG TGGCTGTCGG GCGCGGTGTT CGCCGAGCAC GGCGACACGC CGGTGCTCGA CGTGTACGTC GGCGGCGAGC GGGTGGTGAA CGCGCGCAGG CATCGCGACG AAGAAGCGGC GTATGCCGGA TATCGCGCGG CGCTCGCGCA ACTATTGAGC TGA
|
Protein sequence | MTKIDSMLFA EQAYLPGGWR RDVLLRWNAA GALVDVSADA AAPAGVARAN GPLLPGMPNL HSHAFQRAMA GLTEYRANPS DTFWSWRDLM YRFALKITPD ALAAVARWLY VEMLKSGYTS VCEFHYVHHA PDGARYARPA ELAARVVGAA RDAGIGITML PVAYQYSGFG ERAPRDDQRR FINTPDALLA LVDALRGELP EHGGLRYGIA PHSLRAVSES GLRALVGAMP ADAPVHIHIA EQTAEVDDCM RAYGARPVRW LLERFDVNAR WCLVHATHLD AAETQALARS GATAGLCPTT EANLGDGIFP AVDYLAAGGA IGIGSDSHAS VDWRAELRLF EYGQRLVRRE RNVLADAAQT RVADRLFAAS LAGGARAAGR AVGALEAGRR ADWIVLDPAH PSIAEHGSDT WLSGAVFAEH GDTPVLDVYV GGERVVNARR HRDEEAAYAG YRAALAQLLS
|
| |