Gene BURPS1710b_2791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2791 
SymbolhutF 
ID3691398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3096697 
End bp3098079 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content72% 
IMG OID637729247 
ProductN-formimino-L-glutamate deiminase 
Protein accessionYP_334175 
Protein GI76812004 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02022] formiminoglutamate deiminase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAAA TCGATTCGAT GTTGTTTGCC GAGCAGGCGT ATCTGCCGGG CGGCTGGCGG 
CGCGACGTGC TGCTGCGCTG GAACGCGGCG GGCGCGCTCG TCGACGTGAG CGCGGATGCG
GCGGCGCCCG CGGGCGTCGC GCGCGCGAAC GGGCCGCTGT TGCCGGGCAT GCCGAACCTG
CACTCGCACG CGTTTCAGCG CGCGATGGCG GGGCTCACCG AATATCGCGC GAACCCGTCC
GACACGTTCT GGAGCTGGCG CGACCTGATG TACCGCTTCG CGCTGAAGAT CACGCCCGAC
GCGCTCGCCG CGGTCGCGCG CTGGCTCTAT GTCGAGATGC TGAAAAGCGG CTACACGTCG
GTGTGCGAAT TCCATTACGT TCATCACGCG CCGGACGGCG CGCGCTATGC GCGGCCCGCG
GAGCTGGCGG CGCGCGTGGT CGGCGCGGCG CGGGACGCGG GCATCGGCAT CACGATGCTG
CCGGTCGCGT ATCAGTACAG CGGCTTCGGC GAGCGCGCGC CGCGCGACGA CCAGCGTCGT
TTCATCAATA CGCCCGACGC GCTGCTCGCG CTGGTCGACG CGTTGCGCGG CGAGTTGCCC
GAGCATGGCG GGCTGCGCTA CGGCATCGCG CCGCACTCGC TGCGCGCGGT GTCGGAAAGC
GGGCTGCGAG CGCTCGTTGG CGCGATGCCG GCCGATGCGC CGGTGCACAT CCATATCGCC
GAGCAGACCG CGGAAGTCGA CGATTGCATG CGCGCGTACG GCGCGCGGCC GGTGCGGTGG
CTCCTCGAGC GCTTCGACGT GAACGCGCGC TGGTGCCTCG TGCACGCGAC GCATCTCGAC
GCCGCCGAGA CGCAGGCGCT CGCACGCAGC GGCGCGACCG CGGGCCTGTG CCCGACGACC
GAGGCGAATC TCGGCGACGG CATTTTTCCG GCGGTCGACT ATCTCGCGGC GGGCGGCGCG
ATCGGCATCG GTTCGGACAG CCACGCGAGC GTCGACTGGC GCGCCGAGTT GCGGCTCTTC
GAATACGGGC AGCGGCTCGT GCGGCGCGAG CGCAACGTGC TTGCCGACGC GGCGCAGACG
CGCGTGGCGG ACCGGCTGTT CGCGGCGTCG CTCGCGGGCG GCGCGCGCGC GGCCGGGCGT
GCCGTCGGCG CGCTCGAGGC GGGCCGACGC GCCGACTGGA TCGTGCTCGA TCCCGCGCAT
CCGTCGATCG CCGAGCACGG CAGCGACACG TGGCTGTCGG GCGCGGTGTT CGCCGAGCAC
GGCGACACGC CGGTGCTCGA CGTGTACGTC GGCGGCGAGC GGGTGGTGAA CGCGCGCAGG
CATCGCGACG AAGAAGCGGC GTATGCCGGA TATCGCGCGG CGCTCGCGCA ACTATTGAGC
TGA
 
Protein sequence
MTKIDSMLFA EQAYLPGGWR RDVLLRWNAA GALVDVSADA AAPAGVARAN GPLLPGMPNL 
HSHAFQRAMA GLTEYRANPS DTFWSWRDLM YRFALKITPD ALAAVARWLY VEMLKSGYTS
VCEFHYVHHA PDGARYARPA ELAARVVGAA RDAGIGITML PVAYQYSGFG ERAPRDDQRR
FINTPDALLA LVDALRGELP EHGGLRYGIA PHSLRAVSES GLRALVGAMP ADAPVHIHIA
EQTAEVDDCM RAYGARPVRW LLERFDVNAR WCLVHATHLD AAETQALARS GATAGLCPTT
EANLGDGIFP AVDYLAAGGA IGIGSDSHAS VDWRAELRLF EYGQRLVRRE RNVLADAAQT
RVADRLFAAS LAGGARAAGR AVGALEAGRR ADWIVLDPAH PSIAEHGSDT WLSGAVFAEH
GDTPVLDVYV GGERVVNARR HRDEEAAYAG YRAALAQLLS