Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_2422 |
Symbol | |
ID | 4901051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 2383689 |
End bp | 2385191 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640135650 |
Product | hydroxydechloroatrazine ethylaminohydrolase |
Protein accession | YP_001066682 |
Protein GI | 126454025 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTCGAGC GTTCGAACAT TCGAGCACAA GCGGCCGCCT CGGCGGCGCA ACGACACGGA GATCGAACGA CGATGGAACG ACACCCGAGC GCGCGAGCCG GCGCGCACTC CCTATCCCAG CCCCCCTCCC TTTCCCCGAA CCGATCGAAG ACGCTCGTCG TCAAGCACGC CGACGTGCTC GTGACGATGG ACGGCGCGCG CCGCGAACTG CGCGATGCGG GCCTGTATGT CGAGGACAAC CGGATCGTCG CGGTCGGCCC GAGCGCCGAG TTGCCCGAGC AGGCGGACGA AGTGCTCGAT CTGCGCGGGC ATCTCGTGAT CCCGGGGCTC GTCAACACGC ATCATCATAT GTATCAGAGC CTCACGCGCG CGATTCCCGC CGCGCAGAAC GCCGAGCTGT TCGGCTGGCT CACGAATCTA TACCGGATCT GGGCGCATCT GACGCCGGAG ATGATCGAGG TATCGGCGCT GACCGCGATG GCCGAGCTGC TGCTGTCCGG CTGCACGACG TCGAGCGATC ATCTGTACAT CTATCCGAAC GGCAGCCGGC TCGACGACAG CATCGCGGCC GCGCGGCGCA TCGGCATGCG CTTTCACGCG AGCCGCGGCA GCATGAGCGT CGGGCAGCGC GACGGCGGGT TGCCGCCCGA TGCGGTCGTC GAGCGCGAGG CGGACATCCT GCGCGATACG CAGCGCGTGA TCGAGACCTA CCATGACGAA GGCCGCTATG CGATGCTGCG TGTCGCCGTC GCGCCGTGTT CGCCGTTCTC GGTGAGCCGC GGCCTGATGC GCGACGCGGC GGCGCTCGCG CGCGAGCACC GCGTGTCGCT GCACACGCAC CTAGCGGAGA ACGTGAACGA CGTCGCGTAC AGCCGCGAGA AGTTCGGGAT GACGCCGGCC GAGTATGCGG AGGATCTCGG CTGGGTGGGG CGCGACGTGT GGCACGCGCA TTGCGTGCGG CTCGACGAGC CCGGCATCGC GCTTTTTGCG CGCACCGGCA CGGGCGTCGC GCATTGCCCT TGCTCGAACA TGCGGCTGGC GTCCGGGATT GCCCCCATCG CGCGAATGCG GCGCGCGGGC GTGCCGGTCG GGCTCGGCGT CGACGGTTGT GCGTCGAACG ACGGCGCGCA GATGGTGGCC GAGGCGCGGC AGGCGCTGCT GCTGCAGCGC GTCGGATTCG GGCCGGACGC GCTGAGCGCG CGCGACGCGC TCGAGATCGC GACGCTCGGC GGCGCGCGCG TGCTGAACCG CGACGACATC GGCGCGCTCG CGCCGGGCAT GGCCGCGGAT TTCGTCGCGT TCGACCTGCG CACGCCGCAG TTCGCGGGCG CGCTGCACGA TCCCGTCGCG GCGCTCGTGT TCTGCGCACC GCCGCAGGCG GCGTACAGCG TCGTCAACGG GCGCGTCGTC GTGCGGGAAG GGCGGCTGAC GACGCTCGAG ATCGAGCCGC TCGTCGAGCG GCACAACGCG CTGGCTCGCG CGCTTTGTGA CGCGGCGCGC TGA
|
Protein sequence | MFERSNIRAQ AAASAAQRHG DRTTMERHPS ARAGAHSLSQ PPSLSPNRSK TLVVKHADVL VTMDGARREL RDAGLYVEDN RIVAVGPSAE LPEQADEVLD LRGHLVIPGL VNTHHHMYQS LTRAIPAAQN AELFGWLTNL YRIWAHLTPE MIEVSALTAM AELLLSGCTT SSDHLYIYPN GSRLDDSIAA ARRIGMRFHA SRGSMSVGQR DGGLPPDAVV EREADILRDT QRVIETYHDE GRYAMLRVAV APCSPFSVSR GLMRDAAALA REHRVSLHTH LAENVNDVAY SREKFGMTPA EYAEDLGWVG RDVWHAHCVR LDEPGIALFA RTGTGVAHCP CSNMRLASGI APIARMRRAG VPVGLGVDGC ASNDGAQMVA EARQALLLQR VGFGPDALSA RDALEIATLG GARVLNRDDI GALAPGMAAD FVAFDLRTPQ FAGALHDPVA ALVFCAPPQA AYSVVNGRVV VREGRLTTLE IEPLVERHNA LARALCDAAR
|
| |