Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2404 |
Symbol | |
ID | 5833846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2658490 |
End bp | 2660910 |
Gene Length | 2421 bp |
Protein Length | 806 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641368203 |
Product | tryptophan halogenase |
Protein accession | YP_001639870 |
Protein GI | 163851827 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGGGGCG ACATCCGCGG AGCCGACAAC TCGATGTTCA GGTCCGAACG CTCCGCGCCT CCGCGCCTGG TCGTCGCCGG CGGCGGTCCC GCCAGCTGGA TGGCCGCGCT CTATCTGAAG CGCGTCCTGC GCCGGGCGGG CTGGACGGTC ACACTCGTCG GGTCGGCCCC GGCCGCGGGG GCCTCCGCAG CCCTGGCGAC GCGCCCGGCC TTCACCCGCT TCCTCAAGGG GTTCGGCATC GACGAGGCGA TCTTCATGCG CCGCTGCGCG GCGACCTATC GGCTCGCCAG CCGCTACGAC GACTGGTTCG CCGAGGGTCA GGGCCACTGG CATCCCTTCG GTGCCTGCGG CCCGCGGATC GCCGGCCGCG ACCTGTTCCA CTACTGGATG AAGCTTCGCG GCGAGGGCGC CGAGGACGAG GCCGGGCGCT ACGCCGATTA CGCGCCCCAG GCGCTGATGG CGGCCGAGGC GCGCGGACCC CGCCCCTTGG CGGACCGGTC CTCGCTCATC GATTCCGGCG ATTACGGCTA TCACCTCGAC CGAAAAGCTC TCGTCCGTTT CCTGCGCGAG TTGGCGCTTT CCGAAGGCGT GCGCGGGGTG ACCGGGCGGG TCCGCGAGAT CGAGCGCAAC CTGTACGGTG ATGTCGGCGC CCTCATCCTC GACGGAGGAC AGACGATCGA GGGCGACATC TTCCTCGATT GTACCGGCGC GGCCGCGCAG CTCATCGGCA CCGCGCTCGA CGAGGCCTGG GTCGCGGGCA GCGGCCCCGG AGACCGCGTG GCCCGCCTGT CCCTGCCGCG ATCGCCCGAG GTGCCGCCCT TCACCGCCTA TCGCGGCCGG GCAGAGGGAT GGACGGCGTC GCTGCCGCTG GCCGACCGCA CGGAGCATCT CTTCGTCTAC GACAGCCGGA CCACGCCGCC GGATGCGGCA GCGGCCACCC TGCGACGCGC CTTCGATGGG CAGGCGGACA TCGTCGATGC GGAACTGCGG CACGGTCGCC GCGCCTCACC GTGGCAGCGC AACGTCGTCG CCCTGGGTGC GGCGGCCGGA GCGGTCGAGC CGCTCCTCGG CTTCGATCTC GATCTCGTCC TGGCCGGACT CGAGACCTTC CTCGCTTACC TGCCGCGCCA GGGGGAGGGC GGTGACGTGC TGCGCCGTGC CTATGCCGCG CGCCTGAACC GCTTCCACGA CGATGCGGCC GAGGCGGTGG CCGCGCATTA CGTTCTCGGC CGCCGGTCAG AACCGTTCTG GGCGGCGGCC CGCGCGGTGA TGATCCCCGA CCGGCTCGCC GACCGCCTCG ACCTCTACGA GTTGGCCGGC CATGTCGCGT TGAGCGAGGA GGCGCCGTTC AGCGAGGGCG ATCACTACCT CCTGTTCTCC GGCGCCGATT TCCTGCCGCG TCGGCCCTTC GCTCCCGTCG ATGTCGCCGA CGGCCGTGAG ATCGCCCGTC TACTCGCCAG CATCCGTGCG CAGTCGGCCC AGATCGCCGG GGCGATGGCG CCGCATGTCC GTCTGATGGA CGCGTTGCAC GGCCAGGCCC CCGTCGCCGC GCCTGCGGTG CGCATCGCGG CGGCCTCGGC GCCGGCCGGC CCGGCGGCTC TGCGCAGAAC GCCGGAGGGC GCCCGTCTCG CCGATCTGGT GGCGGGGCTC GGCCAGCCCT TCGGCTACGA GCGCTCGGTG AAGGCCTCCC CGGCGGGATT GCAGACCGAC CGCTTCCTCA TGAGCCTGCA CCGCGCGAGC CTCGGCCTCA CACCTGAGAC GACGCTCGAC CGGCTGGCCG CCGGGCTCGG GTTGCCGGAG CGGGAGCGCG CGGAGGCCGC CGGCCTGATC GGCGGGGCGG ACATCCTCCA TCTCGGCTAC GAGGACGGCC CGTCGGGCGC GCTCTACAAG CTCTACGTCG AATGGTCGTC GCGCACCGAC GCGGCCTGGA CCGGGGCGGA CGAGGCGGGC GCGGAGCCGA TGCTGGTCCA TCGCGCCTAC AAGTGGAACC CTGGGGGAAC GCATCCGCCC GTCGTCACCC TCTACCACTG GCCATGGGTG CGCGGCCCCG AGGAGATCGA AGCACGGCTG ACCCGGATGA CCGGTGCGTG GGGGCCGGCC GGCGCGCCCG CGCGCGACAC CGCCCATGCG ATCCTGCTTC TGGCGCAGGA GAGGGGGCGC GGTGCGGTCC ACTATCTTGA AGCCCGCGAG GAGCCGGGGC TACGCTTGTC CTACGACCTC AACCTCTATG CCTGCGGCCT GACCGTGGCC GATGCGGAGC CGCTGCTCGG CCAAGCCTTC GTCGATCTCG GCGTGCCGCG CGACGCGGCC GCGATGGTTC TGCGAGAGCG GCTTCAGGAG ACGCTGGGGC ATGTCGCCGG CGGCGTCGGG CGGGACGGCC AGCCCTTCGT CACCGTGTAT TCCGGCGTGG CCGGCGGATG A
|
Protein sequence | MRGDIRGADN SMFRSERSAP PRLVVAGGGP ASWMAALYLK RVLRRAGWTV TLVGSAPAAG ASAALATRPA FTRFLKGFGI DEAIFMRRCA ATYRLASRYD DWFAEGQGHW HPFGACGPRI AGRDLFHYWM KLRGEGAEDE AGRYADYAPQ ALMAAEARGP RPLADRSSLI DSGDYGYHLD RKALVRFLRE LALSEGVRGV TGRVREIERN LYGDVGALIL DGGQTIEGDI FLDCTGAAAQ LIGTALDEAW VAGSGPGDRV ARLSLPRSPE VPPFTAYRGR AEGWTASLPL ADRTEHLFVY DSRTTPPDAA AATLRRAFDG QADIVDAELR HGRRASPWQR NVVALGAAAG AVEPLLGFDL DLVLAGLETF LAYLPRQGEG GDVLRRAYAA RLNRFHDDAA EAVAAHYVLG RRSEPFWAAA RAVMIPDRLA DRLDLYELAG HVALSEEAPF SEGDHYLLFS GADFLPRRPF APVDVADGRE IARLLASIRA QSAQIAGAMA PHVRLMDALH GQAPVAAPAV RIAAASAPAG PAALRRTPEG ARLADLVAGL GQPFGYERSV KASPAGLQTD RFLMSLHRAS LGLTPETTLD RLAAGLGLPE RERAEAAGLI GGADILHLGY EDGPSGALYK LYVEWSSRTD AAWTGADEAG AEPMLVHRAY KWNPGGTHPP VVTLYHWPWV RGPEEIEARL TRMTGAWGPA GAPARDTAHA ILLLAQERGR GAVHYLEARE EPGLRLSYDL NLYACGLTVA DAEPLLGQAF VDLGVPRDAA AMVLRERLQE TLGHVAGGVG RDGQPFVTVY SGVAGG
|
| |