Gene Bcep18194_A5267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A5267 
Symbol 
ID3750479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp2323081 
End bp2324493 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content68% 
IMG OID637763566 
Producthydroxydechloroatrazine ethylaminohydrolase 
Protein accessionYP_369505 
Protein GI78066736 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.294261 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTCG AGCAGCACGC TGGCGCACGC GCGCCGAACA CTTCCTCATC CCGCCCGAAA 
ACCCTGCTGG TCAAGCACGC CGACGTGCTC GTGACCATGG ACGACACGCG CCGCGAGCTG
CGCGACGCGG GGCTCTACAT CGAAGACAAC CGGATCGTCG CGGTCGGCCC GACCGCTGAG
CTGCCCGATA CGGCGGACGA AGTGCTCGAC CTGCGCGGCC ACCTCGTGAT TCCGGGCCTC
GTGAACACGC ATCACCACAT GTACCAGAGC CTGACGCGCG CGGTGCCTGC GGCGCAGAAC
GCCGAGCTGT TCGGCTGGCT CACGAACCTC TACAAGATCT GGGCGCACCT GACGCCCGAG
ATGATCGAAG TGTCGACGCT CACCGCGATG GCCGAGCTGC TGCAGTCGGG CTGCACGACG
TCGAGCGATC ATCTGTACAT CTACCCGAAC GGCAGCCGCC TCGACGACAG CATCGGCGCC
GCGCAGCGGA TCGGCATGCG TTTTCATGCG AGCCGCGGTG CGATGAGCGT CGGCCAGCGC
GATGGCGGGT TGCCGCCCGA TTCGGTCGTC GAGCGCGAGC CCGACATCCT GCGCGACACA
CAGCGCCTGA TCGAGACCTA TCACGATGAA GGCCGCTACG CGATGCTGCG CGTGGTCGTC
GCGCCGTGCT CGCCGTTCTC GGTGAGCCGC GACCTGATGC GCGACGCGGC CGTGCTCGCA
CGCGAATACG GCGTGTCGCT GCACACGCAC CTCGCGGAGA ACGTCAACGA CATCGCGTAC
AGCCGCGAGA AATTCGGGAT GACGCCCGCC GAATATGCGG AAGATCTCGG CTGGGTCGGC
CACGACGTGT GGCATGCGCA CTGCGTGCAG CTCGACGATG CGGGCATCAG CCTGTTCGCG
CGCACCGGTA CGGGCGTCGC GCACTGTCCG TGCTCGAACA TGCGGCTCGC GTCGGGTATC
GCGCCGGTGA AGAAGATGCG TCTCGCGGGC GTGCCGGTCG GCCTCGGCGT CGACGGCTCC
GCATCGAACG ACGGCGCGCA GATGGTGGCG GAAGTGCGGC AGGCGCTGCT GCTGCAGCGG
GTCGGTTTCG GGCCCGATGC GATGACCGCG CGTGAAGCGC TCGAAATCGC GACGCTCGGC
GGCGCGAAGG TGCTGAACCG TGACGATATC GGCGCGCTGA AGCCGGGCAT GGCCGCGGAC
TTCGCCGCAT TCGACCTGCG CCAGCCGCTG TTCGCGGGCG CGCTGCACGA TCCGGTCGCG
GCGCTCGTGT TCTGCGCGCC GTCGCAGACG GCGTACACGG TGGTGAACGG GAAGGTGGTG
GTGCGGGAAG GGCGTCTGGC GACGCTCGAC CTGCCGCCCG TTATCGCGCG TCACAACGCG
CTCGCGCAGG CACTGGTCGA GGCATCGCGC TGA
 
Protein sequence
MNLEQHAGAR APNTSSSRPK TLLVKHADVL VTMDDTRREL RDAGLYIEDN RIVAVGPTAE 
LPDTADEVLD LRGHLVIPGL VNTHHHMYQS LTRAVPAAQN AELFGWLTNL YKIWAHLTPE
MIEVSTLTAM AELLQSGCTT SSDHLYIYPN GSRLDDSIGA AQRIGMRFHA SRGAMSVGQR
DGGLPPDSVV EREPDILRDT QRLIETYHDE GRYAMLRVVV APCSPFSVSR DLMRDAAVLA
REYGVSLHTH LAENVNDIAY SREKFGMTPA EYAEDLGWVG HDVWHAHCVQ LDDAGISLFA
RTGTGVAHCP CSNMRLASGI APVKKMRLAG VPVGLGVDGS ASNDGAQMVA EVRQALLLQR
VGFGPDAMTA REALEIATLG GAKVLNRDDI GALKPGMAAD FAAFDLRQPL FAGALHDPVA
ALVFCAPSQT AYTVVNGKVV VREGRLATLD LPPVIARHNA LAQALVEASR