Gene BURPS1106A_2422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2422 
Symbol 
ID4901051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2383689 
End bp2385191 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content70% 
IMG OID640135650 
Producthydroxydechloroatrazine ethylaminohydrolase 
Protein accessionYP_001066682 
Protein GI126454025 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTCGAGC GTTCGAACAT TCGAGCACAA GCGGCCGCCT CGGCGGCGCA ACGACACGGA 
GATCGAACGA CGATGGAACG ACACCCGAGC GCGCGAGCCG GCGCGCACTC CCTATCCCAG
CCCCCCTCCC TTTCCCCGAA CCGATCGAAG ACGCTCGTCG TCAAGCACGC CGACGTGCTC
GTGACGATGG ACGGCGCGCG CCGCGAACTG CGCGATGCGG GCCTGTATGT CGAGGACAAC
CGGATCGTCG CGGTCGGCCC GAGCGCCGAG TTGCCCGAGC AGGCGGACGA AGTGCTCGAT
CTGCGCGGGC ATCTCGTGAT CCCGGGGCTC GTCAACACGC ATCATCATAT GTATCAGAGC
CTCACGCGCG CGATTCCCGC CGCGCAGAAC GCCGAGCTGT TCGGCTGGCT CACGAATCTA
TACCGGATCT GGGCGCATCT GACGCCGGAG ATGATCGAGG TATCGGCGCT GACCGCGATG
GCCGAGCTGC TGCTGTCCGG CTGCACGACG TCGAGCGATC ATCTGTACAT CTATCCGAAC
GGCAGCCGGC TCGACGACAG CATCGCGGCC GCGCGGCGCA TCGGCATGCG CTTTCACGCG
AGCCGCGGCA GCATGAGCGT CGGGCAGCGC GACGGCGGGT TGCCGCCCGA TGCGGTCGTC
GAGCGCGAGG CGGACATCCT GCGCGATACG CAGCGCGTGA TCGAGACCTA CCATGACGAA
GGCCGCTATG CGATGCTGCG TGTCGCCGTC GCGCCGTGTT CGCCGTTCTC GGTGAGCCGC
GGCCTGATGC GCGACGCGGC GGCGCTCGCG CGCGAGCACC GCGTGTCGCT GCACACGCAC
CTAGCGGAGA ACGTGAACGA CGTCGCGTAC AGCCGCGAGA AGTTCGGGAT GACGCCGGCC
GAGTATGCGG AGGATCTCGG CTGGGTGGGG CGCGACGTGT GGCACGCGCA TTGCGTGCGG
CTCGACGAGC CCGGCATCGC GCTTTTTGCG CGCACCGGCA CGGGCGTCGC GCATTGCCCT
TGCTCGAACA TGCGGCTGGC GTCCGGGATT GCCCCCATCG CGCGAATGCG GCGCGCGGGC
GTGCCGGTCG GGCTCGGCGT CGACGGTTGT GCGTCGAACG ACGGCGCGCA GATGGTGGCC
GAGGCGCGGC AGGCGCTGCT GCTGCAGCGC GTCGGATTCG GGCCGGACGC GCTGAGCGCG
CGCGACGCGC TCGAGATCGC GACGCTCGGC GGCGCGCGCG TGCTGAACCG CGACGACATC
GGCGCGCTCG CGCCGGGCAT GGCCGCGGAT TTCGTCGCGT TCGACCTGCG CACGCCGCAG
TTCGCGGGCG CGCTGCACGA TCCCGTCGCG GCGCTCGTGT TCTGCGCACC GCCGCAGGCG
GCGTACAGCG TCGTCAACGG GCGCGTCGTC GTGCGGGAAG GGCGGCTGAC GACGCTCGAG
ATCGAGCCGC TCGTCGAGCG GCACAACGCG CTGGCTCGCG CGCTTTGTGA CGCGGCGCGC
TGA
 
Protein sequence
MFERSNIRAQ AAASAAQRHG DRTTMERHPS ARAGAHSLSQ PPSLSPNRSK TLVVKHADVL 
VTMDGARREL RDAGLYVEDN RIVAVGPSAE LPEQADEVLD LRGHLVIPGL VNTHHHMYQS
LTRAIPAAQN AELFGWLTNL YRIWAHLTPE MIEVSALTAM AELLLSGCTT SSDHLYIYPN
GSRLDDSIAA ARRIGMRFHA SRGSMSVGQR DGGLPPDAVV EREADILRDT QRVIETYHDE
GRYAMLRVAV APCSPFSVSR GLMRDAAALA REHRVSLHTH LAENVNDVAY SREKFGMTPA
EYAEDLGWVG RDVWHAHCVR LDEPGIALFA RTGTGVAHCP CSNMRLASGI APIARMRRAG
VPVGLGVDGC ASNDGAQMVA EARQALLLQR VGFGPDALSA RDALEIATLG GARVLNRDDI
GALAPGMAAD FVAFDLRTPQ FAGALHDPVA ALVFCAPPQA AYSVVNGRVV VREGRLTTLE
IEPLVERHNA LARALCDAAR