Gene BURPS668_2662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2662 
SymbolhutF 
ID4883035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2637009 
End bp2638391 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content72% 
IMG OID640128590 
ProductN-formimino-L-glutamate deiminase 
Protein accessionYP_001059686 
Protein GI126438717 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02022] formiminoglutamate deiminase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAAA TCGATTCGAT GTTGTTTGCC GAGCAGGCGT ATCTGCCGGG CGGCTGGCGG 
CGCGACGTGC TGCTGCGCTG GAACGCGGCG GGCGCGCTCG TCGACGTGAG CGCGGATGCG
GCGGCGCCCG CGGGCGTCGC GCGCGCGAAC GGGCCGCTGT TGCCGGGCAT GCCGAACCTG
CACTCGCACG CGTTTCAGCG CGCGATGGCG GGGCTCACCG AATATCGCGC GAACCCGTCC
GACACGTTCT GGAGCTGGCG CGACCTGATG TACCGCTTCG CGCTGAAGAT CACGCCCGAC
GCGCTCGCCG CGGTCGCGCG CTGGCTCTAT GTCGAGATGC TGAAAAGCGG CTACACGTCG
GTGTGCGAAT TCCATTACGT TCATCACGCA CCGGACGGCG CGCGCTATGC GCGGCCCGCG
GAGCTGGCCG CGCGCGTGGT CGGCGCGGCG CGGGACGCGG GCATCGGCAT CACGATGCTG
CCGGTCGCGT ATCAGTACAG CGGCTTCGGC GAGCGCGCGC CGCGCGACGA CCAGCGTCGT
TTCATCAATA CGCCCGACGC GCTGCTCACG CTGGTCGACG CGTTGCGCGG CGAGTTGCCC
GAGCATGGCG GGCTGCGCTA CGGCATCGCG CCGCACTCGC TGCGCGCGGT GTCGGAAAGC
GGGCTGCGAG CGCTCGTTGG CGCGATGCCG GCCGATGCGC CGGTGCACAT CCATATCGCC
GAGCAGACCG CGGAAGTCGA CGATTGCGTG CGCGCGTACG GCGCGCGGCC GGTGCGGTGG
CTCCTCGAGC GCTTCGACGT GAACGCGCGC TGGTGCCTCG TGCACGCGAC GCATCTCGAC
GCCGCCGAGA CGCAGGCGCT CGCACGCAGC GGCGCGACCG CGGGCCTGTG CCCGACGACC
GAGGCGAATC TCGGCGACGG CATTTTTCCG GCGGTCGACT ATCTCGCGGC GGGCGGCGCG
ATCGGCATCG GTTCGGACAG CCACGCGAGC GTCGACTGGC GCGCCGAGTT GCGGCTCTTC
GAATACGGGC AGCGGCTCGT GCGGCGCGAG CGCAACGTGC TTGCCGACGC GGCGCAGACG
CGCGTGGCGG ACCGGCTGTT CGCGGCGTCG CTCGCGGGCG GCGCGCGCGC GGCCGGGCGT
GCCGTCGGCG CGCTCGAGGC GGGCCGACGC GCCGACTGGA TCGTGCTCGA TCCCGCGCAT
CCGTCGATCG CCGAGCACGG CAGCGACACG TGGCTGTCGG GCGCGGTGTT CGCCGAGCAC
GGCGACACGC CGGTGCTCGA CGTGTACGTC GGCGGCGAGC GGGTGGTGAA CGCGCGCAGG
CATCGCGACG AAGAAGCGGC GTATGCCGGA TATCGCGCGG CGCTCGCGCA ACTATTGAGC
TGA
 
Protein sequence
MTKIDSMLFA EQAYLPGGWR RDVLLRWNAA GALVDVSADA AAPAGVARAN GPLLPGMPNL 
HSHAFQRAMA GLTEYRANPS DTFWSWRDLM YRFALKITPD ALAAVARWLY VEMLKSGYTS
VCEFHYVHHA PDGARYARPA ELAARVVGAA RDAGIGITML PVAYQYSGFG ERAPRDDQRR
FINTPDALLT LVDALRGELP EHGGLRYGIA PHSLRAVSES GLRALVGAMP ADAPVHIHIA
EQTAEVDDCV RAYGARPVRW LLERFDVNAR WCLVHATHLD AAETQALARS GATAGLCPTT
EANLGDGIFP AVDYLAAGGA IGIGSDSHAS VDWRAELRLF EYGQRLVRRE RNVLADAAQT
RVADRLFAAS LAGGARAAGR AVGALEAGRR ADWIVLDPAH PSIAEHGSDT WLSGAVFAEH
GDTPVLDVYV GGERVVNARR HRDEEAAYAG YRAALAQLLS