Gene Mext_1697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1697 
Symbol 
ID5832952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1916284 
End bp1917648 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content71% 
IMG OID641367496 
Producthydroxydechloroatrazine ethylaminohydrolase 
Protein accessionYP_001639167 
Protein GI163851124 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.101399 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGAAT CGAGCACCCG CCCCCGCCGT CTCTGGCTCC GCGATCCGTT GGCGATCCTC 
GCGGACGGGG CCGGCGGCGG GCTCGTGGTA GAGGGCACCC GCATCGCCGA AGTGGTGGCC
GCGGGCGCCC GGCCCACGAG CCCGGTCGAT GAGACGTTCG ACGCCTCGCG CCACGTCGTC
ATCCCCGGTC TCGTCAACAC GCATCACCAC TTCTTCCAGA CGCTCACCCG CGCGCACCCG
ATCGCGATCA ACAAGCCGCT CTTCCCCTGG CTGAAGGCGC TCTCGACCAT CTGGCCGCGG
CTGACGCCGG ACGCCTTCCG GCTGGCGACG CGGCTCGCCT ACACGGAGCT TCTGCTGTCG
GGCTGCACCA CGGCGGGCGA TCACCACTAC CTGTTCCCGA AAGGACTTGA GGCGGCCGTC
GACATCCAGG TGGAGGAGGC GCGCAGCCTC GGCATCCGCG CCTTCGTCAC CCGCGGCTCG
ATGAGCCTGT CGGAGAAGGA TGGCGGCCTG CCGCCCGAGA CACTGGTGCA GGACGACGAC
ACGATCCTGG CCGACAGCGA GCGGGTGCTC GGCCTGTTCC ACGATCCCGA ACCCGGCGCG
ATGGTGCAGA TCGGGCTGGC TCCGTGCTCG CCGTTCAACG TCACCAAGCG GCTCATGCGC
CAGAGCGCCG CGCTGGCCGA GCGCCACGAT TGCCGGCTGC ACACCCATCT CGGCGAGACG
CTCGACGAGA ACGCCTTCTG CCTGGAGGCG TTCGGGCAGC GCCCGGTCGA TTACCTCGAA
GAGGTCGGCT GGATGGGGCC GCGGGCCTGG CTCGCCCACG GCATCCACTT CAACGACGAC
GAGGTGAGGC GCCTCGGCGC CGCCGGCGTC GGGGTGTGCC ATTGCCCGGC CTCGAACATG
GTGCTGGCCT CGGGCCAGTG CCGGACCTGC GAGCTGGAGG CGGCGGGCTC CCCCGTCGGC
CTCGGTGTCG ACGGCTCGGC CTCGAGCGAC AGCTCGAATC TGATGGAGGG CGTGCGCCAC
GCCCTGATGA TCAACCGCCT GACCTACGGC GCGGAAGCCG TGACCCATCT CGATGCCCTG
CGTTGGGCGA CGGAGGGCTC GGCCGCCTGC CTCGGGCGCA AGGACATTGG CCGGATCGAG
CCCGGCCGCG AAGCGGATCT GGCGCTGTTC ACCCTCGACG AACTCCGCTT CTCGGGCGCC
CACGACCCGC TCGCGGCTTT GGTGCTGTGC GGCGCTCACC GGGCGGACCG GGTGATGGTG
GCCGGGAACT GGCGGGTGAT CGACGGGGAG CCCGTCGGCA TCGAGACCGG GCGCCTGCGC
GAGGAGCACG GCCGGCTGGC ACGCACCCTG TTCGGAACGG CGTGA
 
Protein sequence
MMESSTRPRR LWLRDPLAIL ADGAGGGLVV EGTRIAEVVA AGARPTSPVD ETFDASRHVV 
IPGLVNTHHH FFQTLTRAHP IAINKPLFPW LKALSTIWPR LTPDAFRLAT RLAYTELLLS
GCTTAGDHHY LFPKGLEAAV DIQVEEARSL GIRAFVTRGS MSLSEKDGGL PPETLVQDDD
TILADSERVL GLFHDPEPGA MVQIGLAPCS PFNVTKRLMR QSAALAERHD CRLHTHLGET
LDENAFCLEA FGQRPVDYLE EVGWMGPRAW LAHGIHFNDD EVRRLGAAGV GVCHCPASNM
VLASGQCRTC ELEAAGSPVG LGVDGSASSD SSNLMEGVRH ALMINRLTYG AEAVTHLDAL
RWATEGSAAC LGRKDIGRIE PGREADLALF TLDELRFSGA HDPLAALVLC GAHRADRVMV
AGNWRVIDGE PVGIETGRLR EEHGRLARTL FGTA