Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2379 |
Symbol | |
ID | 4882534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 2349950 |
End bp | 2351380 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640128307 |
Product | hydroxydechloroatrazine ethylaminohydrolase |
Protein accession | YP_001059411 |
Protein GI | 126438750 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.750758 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACGAC ACCCGAGCGC GCGAGCCGGC GCGCACTCCC TATCCCAGCC CCCCTCCCTT TCCCCGAACC GATCGAAGAC GCTCGTCGTC AAGCACGCCG ACGTGCTCGT GACGATGGAC GGCGCGCGCC GCGAACTGCG CGATGCGGGC CTGTATGTCG AGGACAACCG GATCGTCGCG GTCGGCCCGA GCGCCGAGTT GCCCGAGCAG GCGGACGAAG TGCTCGATCT GCGCGGGCAT CTCGTGATCC CGGGGCTCGT CAATACGCAT CATCATATGT ATCAGAGCCT CACGCGCGCG ATTCCCGCCG CGCAGAACGC CGAGCTGTTC GGCTGGCTCA CGAATCTATA CCGGATCTGG GCGCATCTGA CGCCGGAGAT GATCGAGGTA TCGGCGCTGA CCGCGATGGC CGAGTTGCTG CTGTCCGGCT GCACGACGTC GAGCGATCAT CTGTACATCT ATCCGAACGG CAGCCGGCTC GACGACAGCA TCGCGGCCGC GCGGCGCATC GGCATGCGCT TTCACGCGAG CCGCGGCAGC ATGAGCGTCG GGCAGCGCGA CGGCGGGTTG CCGCCCGATG CGGTCGTGGA GCGCGAGGCG GACATCCTGC GCGATACGCA GCGCGTGATC GAGACCTACC ATGACGAAGG CCGCTATGCG ATGCTGCGTG TCGCCGTCGC GCCGTGTTCG CCGTTCTCGG TGAGCCGCGG CCTGATGCGC GACGCGGCGG CGCTCGCGCG CGAGCACCGC GTGTCGCTGC ACACGCACCT CGCGGAGAAC GTGAACGACG TCGCGTACAG CCGCGAGAAG TTCGGGATGA CGCCGGCCGA GTATGCGGAG GATCTCGGCT GGGTGGGGCG CGACGTGTGG CACGCGCATT GCGTGCGGCT CGACGAGCCC GGCATCGCGC TTTTTGCGCG CACCGGCACG GGCGTCGCGC ATTGCCCTTG CTCGAACATG CGGCTGGCGT CCGGGATTGC CCCCATCGCG CGAATGCGGC GCGCGGGCGT GCCGGTCGGG CTCGGCGTCG ACGGTTGCGC GTCGAACGAC GGCGCGCAGA TGGTGGCCGA GGCGCGGCAG GCGCTGCTGC TGCAGCGCGT CGGATTCGGG CCGGACGCGC TGAGCGCGCG CGACGCGCTC GAGATCGCGA CGCTCGGCGG CGCGCGCGTG CTGAACCGCG ACGACATCGG CGCGCTCGCG CCGGGCATGG CCGCGGATTT CGTCGCGTTC GACCTGCGCA CGCCGCAGTT CGCGGGCGCG TTGCACGATC CCGTCGCGGC GCTCGTGTTC TGCGCACCGC CGCAGGCGGC GTACAGCGTC GTCAACGGGC GCGTCGTCGT GCGGGAAGGG CGGCTGACGA CGCTCGAGAT CGAGCCGCTC GTCGAGCGGC ACAACGCGCT GGCTCGCGCG CTTTGCGACG CGGCGCGCTG A
|
Protein sequence | MERHPSARAG AHSLSQPPSL SPNRSKTLVV KHADVLVTMD GARRELRDAG LYVEDNRIVA VGPSAELPEQ ADEVLDLRGH LVIPGLVNTH HHMYQSLTRA IPAAQNAELF GWLTNLYRIW AHLTPEMIEV SALTAMAELL LSGCTTSSDH LYIYPNGSRL DDSIAAARRI GMRFHASRGS MSVGQRDGGL PPDAVVEREA DILRDTQRVI ETYHDEGRYA MLRVAVAPCS PFSVSRGLMR DAAALAREHR VSLHTHLAEN VNDVAYSREK FGMTPAEYAE DLGWVGRDVW HAHCVRLDEP GIALFARTGT GVAHCPCSNM RLASGIAPIA RMRRAGVPVG LGVDGCASND GAQMVAEARQ ALLLQRVGFG PDALSARDAL EIATLGGARV LNRDDIGALA PGMAADFVAF DLRTPQFAGA LHDPVAALVF CAPPQAAYSV VNGRVVVREG RLTTLEIEPL VERHNALARA LCDAAR
|
| |