Gene BURPS1710b_2526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2526 
SymbolatzB 
ID3688614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp2797463 
End bp2798893 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content71% 
IMG OID637728982 
Producthydroxydechloroatrazine ethylaminohydrolase 
Protein accessionYP_333918 
Protein GI76811186 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0262801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACGAC ACCCGAGCGC GCGAGCCGGC GCGCACTCCC TATCCCAGCC CCCCTCCCTT 
TCCCCGAACC GATCGAAGAC GCTCGTCGTC AAGCACGCCG ACGTGCTCGT GACGATGGAC
GGCGCGCGCC GCGAACTGCG CGATGCGGGC CTGTATGTCG AGGACAACCG GATCGTCGCG
GTCGGCCCGA GCGCCGAGTT GCCCGAGCAG GCGGACGAAG TGCTCGATCT GCGCGGGCAT
CTCGTGATCC CGGGGCTCGT CAACACGCAT CATCATATGT ATCAGAGCCT CACGCGCGCG
ATTCCCGCCG CGCAGAACGC CGAGCTGTTC GGCTGGCTCA CGAATCTATA CCGGATCTGG
GCGCATCTGA CGCCGGAGAT GATCGAGGTA TCGGCGCTGA CCGCGATGGC CGAGCTGCTG
CTGTCCGGCT GCACGACGTC GAGCGATCAT CTGTACATCT ATCCGAACGG CAGCCGGCTC
GACGACAGCA TCGCGGCCGC GCGGCGCATC GGCATGCGCT TTCACGCGAG CCGCGGCAGC
ATGAGCGTCG GGCAGCGCGA CGGCGGGTTG CCGCCCGATG CGGTCGTCGA GCGCGAGGCG
GACATCCTGC GCGATACGCA GCGCGTGATC GAGACCTACC ATGACGAAGG CCGCTATGCG
ATGCTGCGTA TCGCCGTCGC GCCGTGTTCG CCGTTCTCGG TGAGCCGCGG CCTGATGCGC
GACGCGGCGG CGCTCGCGCG CGAGCACCGC GTGTCGCTGC ACACGCACCT CGCGGAGAAC
GTGAACGACG TCGCGTACAG CCGCGAGAAG TTCGGGATGA CGCCGGCCGA GTATGCGGAG
GATCTCGGCT GGGTGGGGCG CGACGTGTGG CACGCGCATT GCGTGCGGCT CGACGAGCCC
GGCATCGCGC TTTTTGCGCG CACCGGCACG GGCGTCGCGC ATTGCCCTTG CTCGAACATG
CGGCTGGCGT CCGGGATCGC CCCCATCGCG CGAATGCGGC GCGCGGGCGT GCCGGTCGGG
CTCGGCGTCG ACGGTTGCGC GTCGAACGAC GGCGCGCAGA TGGTGGCCGA GGCGCGGCAG
GCGCTGCTGC TGCAGCGCGT CGGATTCGGG CCGGACGCGC TGAGCGCGCG CGACGCGCTC
GAGATCGCGA CGCTCGGCGG CGCGCGCGTG CTGAACCGCG ACGACATCGG CGCGCTCGCG
CCGGGCATGG CCGCGGATTT CGTCGCGTTC GACCTGCGCA CGCCGCAGTT CGCGGGCGCG
CTGCACGATC CCGTCGCGGC GCTCGTGTTC TGCGCACCGC CGCAGGCGGC GTACAGCGTC
GTCAACGGGC GCGTCGTCGT GCGGGAAGGG CGGCTGACGA CGCTCGAGAT CGAGCCGCTC
GTCGAGCGGC ACAACGCGCT GGCTCGCGCG CTTTGTGACG CGGCGCGCTG A
 
Protein sequence
MERHPSARAG AHSLSQPPSL SPNRSKTLVV KHADVLVTMD GARRELRDAG LYVEDNRIVA 
VGPSAELPEQ ADEVLDLRGH LVIPGLVNTH HHMYQSLTRA IPAAQNAELF GWLTNLYRIW
AHLTPEMIEV SALTAMAELL LSGCTTSSDH LYIYPNGSRL DDSIAAARRI GMRFHASRGS
MSVGQRDGGL PPDAVVEREA DILRDTQRVI ETYHDEGRYA MLRIAVAPCS PFSVSRGLMR
DAAALAREHR VSLHTHLAEN VNDVAYSREK FGMTPAEYAE DLGWVGRDVW HAHCVRLDEP
GIALFARTGT GVAHCPCSNM RLASGIAPIA RMRRAGVPVG LGVDGCASND GAQMVAEARQ
ALLLQRVGFG PDALSARDAL EIATLGGARV LNRDDIGALA PGMAADFVAF DLRTPQFAGA
LHDPVAALVF CAPPQAAYSV VNGRVVVREG RLTTLEIEPL VERHNALARA LCDAAR