Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3125 |
Symbol | |
ID | 4786638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3325734 |
End bp | 3327314 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640091696 |
Product | L-threonine ammonia-lyase |
Protein accession | YP_001022313 |
Protein GI | 124268309 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1171] Threonine dehydratase |
TIGRFAM ID | [TIGR01124] threonine ammonia-lyase, biosynthetic, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00487268 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.103539 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCGCG CCGCGCCGCG CCCCGCCCCC CGCACTTCCG TCGCCAAACC CGCTCTCCGC AAGGCTGGGG CCAAGCCCGA CTACCTGCAG CGCATCCTGA CCGCGCGGGT CTACGACGTG GCCATCGAAT CGGCACTGCA GTTCGCACCG CAACTGTCGG AGCGGATGGG CAACCAGGTC TGGCTCAAGC GCGAGGATGA ACAGGCGGTC TTCAGCTTCA AGCTGCGCGG CGCCTACAAC AAGATGGCGC ACCTGAGCCC GCAGCAGCTG GGGCGCGGCG TGATCTGCGC TTCCGCCGGC AACCACGCTC AGGGGGTTGC CCTGTCGGCC CGCAAGCTCC GCTGCCAAGC GACGGTCGTG ATGCCGGTCA CCACGCCGCA ACTCAAGATC GAGGCTGTGC GGGCGCTCGG CGGCAAGGTC GTGTTGCACG GCGAGAGCTA TTCCGATGCC TATGCCCATG CACTGAAACT CGAACGGGAG CACGGGCAGA CCTTCGTGCA TCCTTTCGAC GATCCCGACG TGATCGCCGG TCAGGGCACC ATCGCCATGG AGATCCTGCG CCAGCACCAG GGGCCGATCG ACGCCGTGTT CGTGGCAGTG GGCGGTGGCG GGCTGATCTC GGGCATCGCT GCATACATCA AGGCGGTCCG GCCGGAGATC CAGGTGATCG GCGTCCAGAC CACCGATTCC GATGCGATGC TGCGTTCGGT GCGCGCTGGC AAGCGCGTGA CGCTGCACGA CGTGGGGCTG TTCTCCGACG GCACCGCCGT CAAGCTGGTG GGCGAAGAGA CCTTTCGCCT GACGAAGCAG TGGGTCGACG ATTTCGAAGT GGTCGACACC GACGCGGTCT GCGCGGCGAT CAAGGATGTG TTCCAGGACA CCCGCTCGAT CCTGGAGCCG GCCGGCGCGC TGGGCGTGGC GGCCATCAAA CAGTACACGG CACGCACCGG CTGCAAGGGC AAGACCTTCG TGGCGATCAC CTGCGGCGCC AACATGAACT TCGACCGATT GCGCTTCGTT GCCGAGCGGG CCGAGGTGGG CGAGGAGCGC GAGGCGCTGT TCGCCGTCAC TATCCCGGAG GAACGGGGTT CCTTCAAGCG CTTCTGCGAG TTGCTCGGCC CGCGCAGCGT CACGGAGTTC AACTACCGCA TCTCGGACGC GCAGACCGCG CAGGTGTTCG TGGGCCTGTC CACGCGGGAA CACGGCGAAT CGGCTCGCAT TGCCGAGCGG TTCGAGAAGC AGGGCTTCGC GACGGTCGAC CTGACACACG ACGAACTGGC CAAGACCCAT ATCCGCCACA TGGTCGGCGG CCGCTCCGAG CTGGCCCGCG AGGAGCGACT GTTCCGGTTC GTCTTCCCGG AGCGTCCGGG AGCGCTGATG CGCTTCCTGA CCCGCATGCA CCCCGACTGG AACATCAGCC TGTTCCATTA CCGGAATCAG GGTGCCGATT ACGGTCGCAT CCTTGTCGGC CTGCAGATTC CCCGCGGCGC GCAGCGCGCA CTGCGAGAGT TCCTGTCCAC ACTGGATTAC CCCTGCGTCG AGGAGACCGA CAACCCGGTC TACCAGCTCT TCCTGCGCTG A
|
Protein sequence | MARAAPRPAP RTSVAKPALR KAGAKPDYLQ RILTARVYDV AIESALQFAP QLSERMGNQV WLKREDEQAV FSFKLRGAYN KMAHLSPQQL GRGVICASAG NHAQGVALSA RKLRCQATVV MPVTTPQLKI EAVRALGGKV VLHGESYSDA YAHALKLERE HGQTFVHPFD DPDVIAGQGT IAMEILRQHQ GPIDAVFVAV GGGGLISGIA AYIKAVRPEI QVIGVQTTDS DAMLRSVRAG KRVTLHDVGL FSDGTAVKLV GEETFRLTKQ WVDDFEVVDT DAVCAAIKDV FQDTRSILEP AGALGVAAIK QYTARTGCKG KTFVAITCGA NMNFDRLRFV AERAEVGEER EALFAVTIPE ERGSFKRFCE LLGPRSVTEF NYRISDAQTA QVFVGLSTRE HGESARIAER FEKQGFATVD LTHDELAKTH IRHMVGGRSE LAREERLFRF VFPERPGALM RFLTRMHPDW NISLFHYRNQ GADYGRILVG LQIPRGAQRA LREFLSTLDY PCVEETDNPV YQLFLR
|
| |