Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1697 |
Symbol | |
ID | 5832952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 1916284 |
End bp | 1917648 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641367496 |
Product | hydroxydechloroatrazine ethylaminohydrolase |
Protein accession | YP_001639167 |
Protein GI | 163851124 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.101399 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGAAT CGAGCACCCG CCCCCGCCGT CTCTGGCTCC GCGATCCGTT GGCGATCCTC GCGGACGGGG CCGGCGGCGG GCTCGTGGTA GAGGGCACCC GCATCGCCGA AGTGGTGGCC GCGGGCGCCC GGCCCACGAG CCCGGTCGAT GAGACGTTCG ACGCCTCGCG CCACGTCGTC ATCCCCGGTC TCGTCAACAC GCATCACCAC TTCTTCCAGA CGCTCACCCG CGCGCACCCG ATCGCGATCA ACAAGCCGCT CTTCCCCTGG CTGAAGGCGC TCTCGACCAT CTGGCCGCGG CTGACGCCGG ACGCCTTCCG GCTGGCGACG CGGCTCGCCT ACACGGAGCT TCTGCTGTCG GGCTGCACCA CGGCGGGCGA TCACCACTAC CTGTTCCCGA AAGGACTTGA GGCGGCCGTC GACATCCAGG TGGAGGAGGC GCGCAGCCTC GGCATCCGCG CCTTCGTCAC CCGCGGCTCG ATGAGCCTGT CGGAGAAGGA TGGCGGCCTG CCGCCCGAGA CACTGGTGCA GGACGACGAC ACGATCCTGG CCGACAGCGA GCGGGTGCTC GGCCTGTTCC ACGATCCCGA ACCCGGCGCG ATGGTGCAGA TCGGGCTGGC TCCGTGCTCG CCGTTCAACG TCACCAAGCG GCTCATGCGC CAGAGCGCCG CGCTGGCCGA GCGCCACGAT TGCCGGCTGC ACACCCATCT CGGCGAGACG CTCGACGAGA ACGCCTTCTG CCTGGAGGCG TTCGGGCAGC GCCCGGTCGA TTACCTCGAA GAGGTCGGCT GGATGGGGCC GCGGGCCTGG CTCGCCCACG GCATCCACTT CAACGACGAC GAGGTGAGGC GCCTCGGCGC CGCCGGCGTC GGGGTGTGCC ATTGCCCGGC CTCGAACATG GTGCTGGCCT CGGGCCAGTG CCGGACCTGC GAGCTGGAGG CGGCGGGCTC CCCCGTCGGC CTCGGTGTCG ACGGCTCGGC CTCGAGCGAC AGCTCGAATC TGATGGAGGG CGTGCGCCAC GCCCTGATGA TCAACCGCCT GACCTACGGC GCGGAAGCCG TGACCCATCT CGATGCCCTG CGTTGGGCGA CGGAGGGCTC GGCCGCCTGC CTCGGGCGCA AGGACATTGG CCGGATCGAG CCCGGCCGCG AAGCGGATCT GGCGCTGTTC ACCCTCGACG AACTCCGCTT CTCGGGCGCC CACGACCCGC TCGCGGCTTT GGTGCTGTGC GGCGCTCACC GGGCGGACCG GGTGATGGTG GCCGGGAACT GGCGGGTGAT CGACGGGGAG CCCGTCGGCA TCGAGACCGG GCGCCTGCGC GAGGAGCACG GCCGGCTGGC ACGCACCCTG TTCGGAACGG CGTGA
|
Protein sequence | MMESSTRPRR LWLRDPLAIL ADGAGGGLVV EGTRIAEVVA AGARPTSPVD ETFDASRHVV IPGLVNTHHH FFQTLTRAHP IAINKPLFPW LKALSTIWPR LTPDAFRLAT RLAYTELLLS GCTTAGDHHY LFPKGLEAAV DIQVEEARSL GIRAFVTRGS MSLSEKDGGL PPETLVQDDD TILADSERVL GLFHDPEPGA MVQIGLAPCS PFNVTKRLMR QSAALAERHD CRLHTHLGET LDENAFCLEA FGQRPVDYLE EVGWMGPRAW LAHGIHFNDD EVRRLGAAGV GVCHCPASNM VLASGQCRTC ELEAAGSPVG LGVDGSASSD SSNLMEGVRH ALMINRLTYG AEAVTHLDAL RWATEGSAAC LGRKDIGRIE PGREADLALF TLDELRFSGA HDPLAALVLC GAHRADRVMV AGNWRVIDGE PVGIETGRLR EEHGRLARTL FGTA
|
| |