Gene BURPS668_2379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2379 
Symbol 
ID4882534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2349950 
End bp2351380 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content71% 
IMG OID640128307 
Producthydroxydechloroatrazine ethylaminohydrolase 
Protein accessionYP_001059411 
Protein GI126438750 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.750758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACGAC ACCCGAGCGC GCGAGCCGGC GCGCACTCCC TATCCCAGCC CCCCTCCCTT 
TCCCCGAACC GATCGAAGAC GCTCGTCGTC AAGCACGCCG ACGTGCTCGT GACGATGGAC
GGCGCGCGCC GCGAACTGCG CGATGCGGGC CTGTATGTCG AGGACAACCG GATCGTCGCG
GTCGGCCCGA GCGCCGAGTT GCCCGAGCAG GCGGACGAAG TGCTCGATCT GCGCGGGCAT
CTCGTGATCC CGGGGCTCGT CAATACGCAT CATCATATGT ATCAGAGCCT CACGCGCGCG
ATTCCCGCCG CGCAGAACGC CGAGCTGTTC GGCTGGCTCA CGAATCTATA CCGGATCTGG
GCGCATCTGA CGCCGGAGAT GATCGAGGTA TCGGCGCTGA CCGCGATGGC CGAGTTGCTG
CTGTCCGGCT GCACGACGTC GAGCGATCAT CTGTACATCT ATCCGAACGG CAGCCGGCTC
GACGACAGCA TCGCGGCCGC GCGGCGCATC GGCATGCGCT TTCACGCGAG CCGCGGCAGC
ATGAGCGTCG GGCAGCGCGA CGGCGGGTTG CCGCCCGATG CGGTCGTGGA GCGCGAGGCG
GACATCCTGC GCGATACGCA GCGCGTGATC GAGACCTACC ATGACGAAGG CCGCTATGCG
ATGCTGCGTG TCGCCGTCGC GCCGTGTTCG CCGTTCTCGG TGAGCCGCGG CCTGATGCGC
GACGCGGCGG CGCTCGCGCG CGAGCACCGC GTGTCGCTGC ACACGCACCT CGCGGAGAAC
GTGAACGACG TCGCGTACAG CCGCGAGAAG TTCGGGATGA CGCCGGCCGA GTATGCGGAG
GATCTCGGCT GGGTGGGGCG CGACGTGTGG CACGCGCATT GCGTGCGGCT CGACGAGCCC
GGCATCGCGC TTTTTGCGCG CACCGGCACG GGCGTCGCGC ATTGCCCTTG CTCGAACATG
CGGCTGGCGT CCGGGATTGC CCCCATCGCG CGAATGCGGC GCGCGGGCGT GCCGGTCGGG
CTCGGCGTCG ACGGTTGCGC GTCGAACGAC GGCGCGCAGA TGGTGGCCGA GGCGCGGCAG
GCGCTGCTGC TGCAGCGCGT CGGATTCGGG CCGGACGCGC TGAGCGCGCG CGACGCGCTC
GAGATCGCGA CGCTCGGCGG CGCGCGCGTG CTGAACCGCG ACGACATCGG CGCGCTCGCG
CCGGGCATGG CCGCGGATTT CGTCGCGTTC GACCTGCGCA CGCCGCAGTT CGCGGGCGCG
TTGCACGATC CCGTCGCGGC GCTCGTGTTC TGCGCACCGC CGCAGGCGGC GTACAGCGTC
GTCAACGGGC GCGTCGTCGT GCGGGAAGGG CGGCTGACGA CGCTCGAGAT CGAGCCGCTC
GTCGAGCGGC ACAACGCGCT GGCTCGCGCG CTTTGCGACG CGGCGCGCTG A
 
Protein sequence
MERHPSARAG AHSLSQPPSL SPNRSKTLVV KHADVLVTMD GARRELRDAG LYVEDNRIVA 
VGPSAELPEQ ADEVLDLRGH LVIPGLVNTH HHMYQSLTRA IPAAQNAELF GWLTNLYRIW
AHLTPEMIEV SALTAMAELL LSGCTTSSDH LYIYPNGSRL DDSIAAARRI GMRFHASRGS
MSVGQRDGGL PPDAVVEREA DILRDTQRVI ETYHDEGRYA MLRVAVAPCS PFSVSRGLMR
DAAALAREHR VSLHTHLAEN VNDVAYSREK FGMTPAEYAE DLGWVGRDVW HAHCVRLDEP
GIALFARTGT GVAHCPCSNM RLASGIAPIA RMRRAGVPVG LGVDGCASND GAQMVAEARQ
ALLLQRVGFG PDALSARDAL EIATLGGARV LNRDDIGALA PGMAADFVAF DLRTPQFAGA
LHDPVAALVF CAPPQAAYSV VNGRVVVREG RLTTLEIEPL VERHNALARA LCDAAR