Gene BBta_2050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_2050 
Symbol 
ID5153053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp2121006 
End bp2122352 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content68% 
IMG OID640556988 
Producthydroxydechloroatrazine ethylaminohydrolase 
Protein accessionYP_001238144 
Protein GI148253559 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.888241 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCA TGCCGATCTG GATCAAGGAC CCACTCGCCA TTCTGGCCGA GGGTGCCGGC 
CGCGGCCTCG TCGTCAAGGA CGGCCGCATC GTCGAGCTGG TGCCGGCCGG CGCCGAGCCC
GCGACCGCGG GCGCGGTTGC ATACGATGCC AGTGGACATG TCGTGATCCC CGGGCTGATC
AACACGCACC ACCATTTCTA CCAGACGCTG ACGCGTGCGC TGCCGGCGGC GATGGACCGC
GAGCTGTTCC CGTGGCTGAA GGCGCTGTAT CCGATCTGGG CGAAGCTGAC GCCGGAGGCG
CTCGACGCGG CGGTCACGGT CGCGATGGCC GAGCTGATGC TGTCGGGCTG CACCACGACG
ACGGATCATC ATTACGTGTT CCCTGCCGGT CTCGATGACG CTGTCGGGAT CGAAGTCGAG
GCCGCGAAGC GGCTCGGCAT CCGGGTGCTG TTGACCCGAG GCTCGATGAA TCTGTCCGAG
CGCGACGGCG GATTGCCTCC GGACAGTGTG GTGCAGGACG AGGATACGAT CCTCGCCGAT
AGCGAGCGCG TGGTCGCGCA GTTTCACCAG CGCGGGCCGG ATGCGATGGT GCAGATCGCG
CTGGCGCCGT GCTCGCCCTT CTCCGTCACG GGATCCTTGA TGCAGCAGAC GGCTGCCTTG
GCGGAGAAGC TCGACGTGCG CCTGCACACG CATCTGGCGG AGACCGAGGA CGAGAACCGA
TTCTGCGAAG CCATGTTCGG TTGTCGTCCG CTCGATTATC TCGAGAAACA TGGCTGGCTC
GGCCCGCGGA CCTGGCTCGC GCACGGCATC TTCTTCAACG CCGACGAAAT GAAGCGCCTC
GGCAAGGCCA AGACGACGAT CAGCCATTGC GCCTGCTCGA ACCAGCTGCT TGCCTCCGGA
GCTTGCCCGG TGTGCGAGAT GGAAGATGCC GGTGTCGGCA TCGGTATCGG CGTCGACGGC
TCTGCCTCCA ATGACGGCTC CAATCTGATG CAGGAGCTGC GGGCCGCGTT CCTGATGCAG
CGCGCCCGCT ACGGCGTCAG CCGGCTCAGC CACAAGGACG CGCTGCGCTG GGCGACGAAG
GGCTCGGCAG CCTGCGTCGG CCGTCCCGAG CTCGGCGAGA TCGCGGTCGG CAACGCCGCC
GATCTCGCGC TGTTCAAGCT CGACGAGCTG CGCTTCTCCG GCGCCAGCGA TCCGATCGCG
GCGCTGGTGC TGTGCGGCGC GCACCGCGCC GACCGCGTCA TGGTCGGCGG CCGCTGGACG
GTGATCGACG GCGCCATTCC GGGCCTCGAC GTCGCCGCGC TGATCCGGCG CCACAGCGCG
GCGGCAGAAC GGATGCGGGC CGGCTGA
 
Protein sequence
MSTMPIWIKD PLAILAEGAG RGLVVKDGRI VELVPAGAEP ATAGAVAYDA SGHVVIPGLI 
NTHHHFYQTL TRALPAAMDR ELFPWLKALY PIWAKLTPEA LDAAVTVAMA ELMLSGCTTT
TDHHYVFPAG LDDAVGIEVE AAKRLGIRVL LTRGSMNLSE RDGGLPPDSV VQDEDTILAD
SERVVAQFHQ RGPDAMVQIA LAPCSPFSVT GSLMQQTAAL AEKLDVRLHT HLAETEDENR
FCEAMFGCRP LDYLEKHGWL GPRTWLAHGI FFNADEMKRL GKAKTTISHC ACSNQLLASG
ACPVCEMEDA GVGIGIGVDG SASNDGSNLM QELRAAFLMQ RARYGVSRLS HKDALRWATK
GSAACVGRPE LGEIAVGNAA DLALFKLDEL RFSGASDPIA ALVLCGAHRA DRVMVGGRWT
VIDGAIPGLD VAALIRRHSA AAERMRAG