Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1825 |
Symbol | hutF |
ID | 3847451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 2049403 |
End bp | 2050785 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637841494 |
Product | N-formimino-L-glutamate deiminase |
Protein accession | YP_442357 |
Protein GI | 83718915 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR02022] formiminoglutamate deiminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.256581 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAGAA TCGATTCGAT GTTGTTTGCC GAGCACGCGT ACCTGCCCGG CGGCTGGCGG CGCGACGTGC TGCTGCGCTG GGACGCGACG GGCGCGCTCG TCGACGTGAG CGCGAACGCG CAAGCGCCCG CGGGCGTCGC GCGTGCGAAC GGGCCGCTCC TGCCCGGCAT GCCGAACCTG CATTCGCACG CGTTCCAGCG CGCGATGGCG GGGCTCACCG AATATCGCGC GAATCCGTCC GACACGTTCT GGAGCTGGCG CGACCTGATG TACCGCTTCG CGCTGAAGAT CACGCCCGAC GCGCTCGCCG CGGTCGCGCG CTGGCTCTAT GTCGAGATGC TCAAGTGCGG CTATACGTCG GTGTGCGAAT TCCATTACGT TCACCACGCG CCGGACGGCG CGCGCTATCC GCGGCCGGCG GAACTGGCGG CGCGCGTCGC CGGCGCGGCG CGGGATGCGG GCATCGGCAT CACGATGCTG CCCGTCGCGT ATCAGTACAG CGGCTTCGGC GAGCGCGCGC CGCGCGACGA TCAGCGCCGC TTCATCAACA CGCCGGACGC GCTGCTCGCG CTTCTCGACG CGCTGCGCGG CGAGTTGCCC GAGCATGGCG GGCTGCGCTA CGGAATCGCG CCGCATTCGC TGCGCGCGGT GTCGGAGAGC GGGCTGCGCG AACTGGTGGG CGCGATGCCG GCCGATGCGC CGGTGCACAT CCACATCGCC GAGCAGACCG CGGAAGTCGA CGATTGCGTG CGCGCGTACG GCGCGCGGCC CGTGCAATGG CTCCTCGATC GCTTCGACGT GGACGCGCGC TGGTGCCTCG TCCATGCGAC GCATCTCGAC GCGACGGAGA CGCAGGCGCT CGCGCGCAGC GGCGCGATCG CGGGCTTGTG TCCGACGACC GAGGCGAATC TCGGCGACGG CATCTTTCCG GCGGTCGAGT ATCTCGCGGC GGGCGGCATG ATCGGCATCG GCTCGGACAG CCACGCGAGC GTCGACTGGC GCGCCGAGTT GCGGCTGTTC GAATACGGGC AGCGGCTGGT GCGGCGCGAG CGCAACGTGC TTGCGGACGC GGCGCAGACT CACGTGGCGG ACCGCCTGTT CGCGGCGTCG CTCGCGGGCG GCGCGCGCGC GGCCGGGCGT CCCGTCGGCG CGCTCGAGGC GGGCCGGCGC GCCGACTGGA TCGTGCTCGA TCCCGCGCAT CCGTCGGTCG CCGAGCACGG CAGCGATACG TGGCTGTCGG GTGCGGTGTT CGCCGAGCAC GGCGACACGC CGGTGCTCGA CGTGTACGTC GGCGGCGAGC GGGTGGTGAG CGCGCGCCGG CATCGCGACG AAGAGGCGGC GTATGCCGGA TATCGCGCGG CGCTTGCGCA GCTATTGAAC TGA
|
Protein sequence | MTRIDSMLFA EHAYLPGGWR RDVLLRWDAT GALVDVSANA QAPAGVARAN GPLLPGMPNL HSHAFQRAMA GLTEYRANPS DTFWSWRDLM YRFALKITPD ALAAVARWLY VEMLKCGYTS VCEFHYVHHA PDGARYPRPA ELAARVAGAA RDAGIGITML PVAYQYSGFG ERAPRDDQRR FINTPDALLA LLDALRGELP EHGGLRYGIA PHSLRAVSES GLRELVGAMP ADAPVHIHIA EQTAEVDDCV RAYGARPVQW LLDRFDVDAR WCLVHATHLD ATETQALARS GAIAGLCPTT EANLGDGIFP AVEYLAAGGM IGIGSDSHAS VDWRAELRLF EYGQRLVRRE RNVLADAAQT HVADRLFAAS LAGGARAAGR PVGALEAGRR ADWIVLDPAH PSVAEHGSDT WLSGAVFAEH GDTPVLDVYV GGERVVSARR HRDEEAAYAG YRAALAQLLN
|
| |