Gene Mnod_1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_1111 
Symbol 
ID7304423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp1178751 
End bp1180127 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content72% 
IMG OID643598859 
Producthydroxydechloroatrazine ethylaminohydrolase 
Protein accessionYP_002496421 
Protein GI220921120 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCAGG ACGAGGCGGT GGCGGGAGCG GCGGGCCCGC GGCTGTGGTT GCGCGATCCG 
CTCGCCATCC TGGCCGAGGA GGCGGGCGGC GGCCTCGTGG TGGAGGGGAC CCGCATCGTC
GAGCGCGTGC CCGGCGGCGG CGCCCCCGCC TCCCCGGTGC ACGAGATCTT CGATGCCTCG
CGCCACGTCA TCCTCCCGGG CCTCGTCAAC ACCCATCACC ACGTCTTCCA GACCCTCACC
CGGGCCCATC CGGCGGCGAT CGACAAGCCG CTCTTCCCCT GGCTGAAGGC GCTCTATCCG
TACTGGGCCC GGCTGACTCC GGAGGCGTTC CGGCTGGCGA CGCGGCTCGC CTACACGGAA
CTGCTCCTCT CCGGCTGCAC CACCGCGGCC GATCATCACT ACCTGTTCCC GAGGGGCCTG
GAGGAGGCGG TCGACATCCA GGTCGCGGAG GCGCGCGCGC TCGGCATCCG GGCCTGCGTC
ACCCGCGGCT CGATGAGCCT GTCCGAGACC GAGGGCGGCC TGCCCCCCGA CAGCGTGACG
CAGGATCACG ACGCGATCCT CGCCGATTGC GAGCGGGTGC TGAACCTCTT CCACGACCGC
AGGCCCGGCG CGATGGTGCA GGTGGCGCTC AGCCCCTGCT CGCCCTTCGT GGTGACGAAG
CGCCTGATGC GCGAGAGCGC GGCGCTCGCC GAGGCGCATG ATTGCCGCCT GCATACGCAT
CTCGCCGAGA CCCGCGACGA GACCGACTAC TGCCTCGCGG CCTTCGGGCA GCGCCCGCTC
GACTATCTGG AGGAGGTCGG CTGGCTGTCG CCCAGGACGT GGCTGGCCCA CGGCATCCAT
TTCGACGATG CCGAGGTGGC ACGGCTCGGC CGCGCCGGCG TCGGCGTGTG CCATTGCCCG
ACCTCCAACA TGACGCTCGC CTCGGGCTTC TGCCGCACCT GCGAGCTCGA AGCGGCCGGA
AGCCCGGTCG GGCTCGGGGT CGACGGCTCG GCCTCAAACG ACGCCTCGAA CCTGATCGAG
GAGGTGCGCC ACGCCCTGAT GCTCAACCGG CTCACCTACG GGGCCGAGGC GGTGACGCAT
CGCGACGCCC TGCGCTGGGC CACCGAAGGC TCCGCCCGCT GCCTCGGGCG CGACGATATC
GGCCGCATCG CGGAGGGGCT GGAGGCCGAC CTCGCCCTGT TCACCCTCGA CGACCTGCGC
TTCTCCGGCA GCCACGATCC CCTGGCCGCG CTCGTCCTGT GCGGCGCGAG CCGGGCCGAC
CGGGTCATGG TGGCGGGCGC TTGGCGCGTC GTCGACGGGC AGCCGCTCGG GATCGACCTG
CGCGCGCTGC GGGAGGCGCA TGGGCGCATC GCCCGGGATC TCTTCGGGAT GGCTTGA
 
Protein sequence
MQQDEAVAGA AGPRLWLRDP LAILAEEAGG GLVVEGTRIV ERVPGGGAPA SPVHEIFDAS 
RHVILPGLVN THHHVFQTLT RAHPAAIDKP LFPWLKALYP YWARLTPEAF RLATRLAYTE
LLLSGCTTAA DHHYLFPRGL EEAVDIQVAE ARALGIRACV TRGSMSLSET EGGLPPDSVT
QDHDAILADC ERVLNLFHDR RPGAMVQVAL SPCSPFVVTK RLMRESAALA EAHDCRLHTH
LAETRDETDY CLAAFGQRPL DYLEEVGWLS PRTWLAHGIH FDDAEVARLG RAGVGVCHCP
TSNMTLASGF CRTCELEAAG SPVGLGVDGS ASNDASNLIE EVRHALMLNR LTYGAEAVTH
RDALRWATEG SARCLGRDDI GRIAEGLEAD LALFTLDDLR FSGSHDPLAA LVLCGASRAD
RVMVAGAWRV VDGQPLGIDL RALREAHGRI ARDLFGMA