Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3024 |
Symbol | |
ID | 5835455 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 3370219 |
End bp | 3371577 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641368824 |
Product | hydroxydechloroatrazine ethylaminohydrolase |
Protein accession | YP_001640484 |
Protein GI | 163852441 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.883439 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCGG ATGGCACGGT CGCGCGGCGG CTCTGGATAC GCGATCCCCT CGCGATCCTG GCGGACGGAG CCGGCGGGGG CGTCGTCGTC GAGGGTAGCC GCATCGCCGA ACGCGTTCCG GCCGGCGGGC AGCCCGCCGC GCCGGTGGAC GAGACGTTCG ACGCCTCGCG GCACGTCGTC ATCCCCGGGC TCATCAACAC GCATCACCAC TTCTTCCAGA CGCTGACGCG GGCCCATCCG GCGGCGATCA ACAAGCCGCT CTTTCCCTGG CTGCAGGCGC TCTACGGGAT CTGGGGTCGG CTGACGCCAG AGGCGTTCCG GCTGGCGACG CGGCTCGCCT ATACCGAGCT GCTCCTGTCC GGCTGCACCA CGGCGGGCGA CCACCACTAC CTGTTCCCAA AGGGGCTGGA GAACGCCGTC GATATCCAGG TCGAGGAGGC GCGCTCCCTC GGTATCCGCG CCATCGTCAC CCGCGGCTCC ATGAGCCTGT CGCAGGATGA GGGCGGCCTT CCGCCAAAGG CGCTGGTGCA GGACGACGAC ACGATCCTCG CCGACAGCGA GCGGGTGCTG CGCCTGTTCC ATGATCCCGA ACCCGGCGCG ATGGTGCGGA TCGGGCTCGC TCCCTGCTCC CCCTTCGCCG TCACCAAGCG GCTGATGCGC GAGAGCGCCG TACTGGCCGA GCGCTACGAT TGCCCGCTTC ACACCCATCT CGGCGAAACT CGCGACGAGA ACGCCTTCTG CCTGGAGGCG TTCGGCCAGC GCCCGGTCGA TTACCTCGAA GAGGTCGGCT GGATGACGCG GCGGGCTTGG CTTGCCCACG GCATCCACTT CAACGACGAC GAGGTGCGCC GCCTCGGCGT GGCCGGCGTC GGGGTGTGCC ACTGCCCGAC CTCGAACATG GTGCTGGCCT CGGGCCATTG CCGCACCTGC GAGTTGGAGG CGGCGGGCTC CCCCGTCGGC CTTGGCGTCG ACGGCTCGGC CTCCAACGAC AGCTCGAACC TGATGGAGGG CGTGCGCCAC GCCCTGATGA TCAATCGCCT GACTTACGGC GCGGAGGCGG TGACCCATCT CGACGCCCTG CGTTGGGCGA CGGAGGGCTC GGCCGCCTGC CTCAACCGCA ACGACATCGG CCGGATCGAG CCCGGCCGCG AGGCGGATCT GGCCTTGTTC ATCCTGGACG AGCTGCGCTT CTCCGGCGCC CACGACCCGC TCGCGGCGCT TGTGCTGTGC GGGGCCCACC GGGCGGATCG GGTGATGGTC GCGGGCACGT GGCGGGTGAT CGACGGGCAA CCGCTCGGCA TCGAGACCGG ACGCCTGCGC GAGGAGCATT CGCGGCTCGC TCGGCACCTG TTCGGCTGA
|
Protein sequence | MAADGTVARR LWIRDPLAIL ADGAGGGVVV EGSRIAERVP AGGQPAAPVD ETFDASRHVV IPGLINTHHH FFQTLTRAHP AAINKPLFPW LQALYGIWGR LTPEAFRLAT RLAYTELLLS GCTTAGDHHY LFPKGLENAV DIQVEEARSL GIRAIVTRGS MSLSQDEGGL PPKALVQDDD TILADSERVL RLFHDPEPGA MVRIGLAPCS PFAVTKRLMR ESAVLAERYD CPLHTHLGET RDENAFCLEA FGQRPVDYLE EVGWMTRRAW LAHGIHFNDD EVRRLGVAGV GVCHCPTSNM VLASGHCRTC ELEAAGSPVG LGVDGSASND SSNLMEGVRH ALMINRLTYG AEAVTHLDAL RWATEGSAAC LNRNDIGRIE PGREADLALF ILDELRFSGA HDPLAALVLC GAHRADRVMV AGTWRVIDGQ PLGIETGRLR EEHSRLARHL FG
|
| |