Gene Mext_2404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2404 
Symbol 
ID5833846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2658490 
End bp2660910 
Gene Length2421 bp 
Protein Length806 aa 
Translation table11 
GC content73% 
IMG OID641368203 
Producttryptophan halogenase 
Protein accessionYP_001639870 
Protein GI163851827 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGGGCG ACATCCGCGG AGCCGACAAC TCGATGTTCA GGTCCGAACG CTCCGCGCCT 
CCGCGCCTGG TCGTCGCCGG CGGCGGTCCC GCCAGCTGGA TGGCCGCGCT CTATCTGAAG
CGCGTCCTGC GCCGGGCGGG CTGGACGGTC ACACTCGTCG GGTCGGCCCC GGCCGCGGGG
GCCTCCGCAG CCCTGGCGAC GCGCCCGGCC TTCACCCGCT TCCTCAAGGG GTTCGGCATC
GACGAGGCGA TCTTCATGCG CCGCTGCGCG GCGACCTATC GGCTCGCCAG CCGCTACGAC
GACTGGTTCG CCGAGGGTCA GGGCCACTGG CATCCCTTCG GTGCCTGCGG CCCGCGGATC
GCCGGCCGCG ACCTGTTCCA CTACTGGATG AAGCTTCGCG GCGAGGGCGC CGAGGACGAG
GCCGGGCGCT ACGCCGATTA CGCGCCCCAG GCGCTGATGG CGGCCGAGGC GCGCGGACCC
CGCCCCTTGG CGGACCGGTC CTCGCTCATC GATTCCGGCG ATTACGGCTA TCACCTCGAC
CGAAAAGCTC TCGTCCGTTT CCTGCGCGAG TTGGCGCTTT CCGAAGGCGT GCGCGGGGTG
ACCGGGCGGG TCCGCGAGAT CGAGCGCAAC CTGTACGGTG ATGTCGGCGC CCTCATCCTC
GACGGAGGAC AGACGATCGA GGGCGACATC TTCCTCGATT GTACCGGCGC GGCCGCGCAG
CTCATCGGCA CCGCGCTCGA CGAGGCCTGG GTCGCGGGCA GCGGCCCCGG AGACCGCGTG
GCCCGCCTGT CCCTGCCGCG ATCGCCCGAG GTGCCGCCCT TCACCGCCTA TCGCGGCCGG
GCAGAGGGAT GGACGGCGTC GCTGCCGCTG GCCGACCGCA CGGAGCATCT CTTCGTCTAC
GACAGCCGGA CCACGCCGCC GGATGCGGCA GCGGCCACCC TGCGACGCGC CTTCGATGGG
CAGGCGGACA TCGTCGATGC GGAACTGCGG CACGGTCGCC GCGCCTCACC GTGGCAGCGC
AACGTCGTCG CCCTGGGTGC GGCGGCCGGA GCGGTCGAGC CGCTCCTCGG CTTCGATCTC
GATCTCGTCC TGGCCGGACT CGAGACCTTC CTCGCTTACC TGCCGCGCCA GGGGGAGGGC
GGTGACGTGC TGCGCCGTGC CTATGCCGCG CGCCTGAACC GCTTCCACGA CGATGCGGCC
GAGGCGGTGG CCGCGCATTA CGTTCTCGGC CGCCGGTCAG AACCGTTCTG GGCGGCGGCC
CGCGCGGTGA TGATCCCCGA CCGGCTCGCC GACCGCCTCG ACCTCTACGA GTTGGCCGGC
CATGTCGCGT TGAGCGAGGA GGCGCCGTTC AGCGAGGGCG ATCACTACCT CCTGTTCTCC
GGCGCCGATT TCCTGCCGCG TCGGCCCTTC GCTCCCGTCG ATGTCGCCGA CGGCCGTGAG
ATCGCCCGTC TACTCGCCAG CATCCGTGCG CAGTCGGCCC AGATCGCCGG GGCGATGGCG
CCGCATGTCC GTCTGATGGA CGCGTTGCAC GGCCAGGCCC CCGTCGCCGC GCCTGCGGTG
CGCATCGCGG CGGCCTCGGC GCCGGCCGGC CCGGCGGCTC TGCGCAGAAC GCCGGAGGGC
GCCCGTCTCG CCGATCTGGT GGCGGGGCTC GGCCAGCCCT TCGGCTACGA GCGCTCGGTG
AAGGCCTCCC CGGCGGGATT GCAGACCGAC CGCTTCCTCA TGAGCCTGCA CCGCGCGAGC
CTCGGCCTCA CACCTGAGAC GACGCTCGAC CGGCTGGCCG CCGGGCTCGG GTTGCCGGAG
CGGGAGCGCG CGGAGGCCGC CGGCCTGATC GGCGGGGCGG ACATCCTCCA TCTCGGCTAC
GAGGACGGCC CGTCGGGCGC GCTCTACAAG CTCTACGTCG AATGGTCGTC GCGCACCGAC
GCGGCCTGGA CCGGGGCGGA CGAGGCGGGC GCGGAGCCGA TGCTGGTCCA TCGCGCCTAC
AAGTGGAACC CTGGGGGAAC GCATCCGCCC GTCGTCACCC TCTACCACTG GCCATGGGTG
CGCGGCCCCG AGGAGATCGA AGCACGGCTG ACCCGGATGA CCGGTGCGTG GGGGCCGGCC
GGCGCGCCCG CGCGCGACAC CGCCCATGCG ATCCTGCTTC TGGCGCAGGA GAGGGGGCGC
GGTGCGGTCC ACTATCTTGA AGCCCGCGAG GAGCCGGGGC TACGCTTGTC CTACGACCTC
AACCTCTATG CCTGCGGCCT GACCGTGGCC GATGCGGAGC CGCTGCTCGG CCAAGCCTTC
GTCGATCTCG GCGTGCCGCG CGACGCGGCC GCGATGGTTC TGCGAGAGCG GCTTCAGGAG
ACGCTGGGGC ATGTCGCCGG CGGCGTCGGG CGGGACGGCC AGCCCTTCGT CACCGTGTAT
TCCGGCGTGG CCGGCGGATG A
 
Protein sequence
MRGDIRGADN SMFRSERSAP PRLVVAGGGP ASWMAALYLK RVLRRAGWTV TLVGSAPAAG 
ASAALATRPA FTRFLKGFGI DEAIFMRRCA ATYRLASRYD DWFAEGQGHW HPFGACGPRI
AGRDLFHYWM KLRGEGAEDE AGRYADYAPQ ALMAAEARGP RPLADRSSLI DSGDYGYHLD
RKALVRFLRE LALSEGVRGV TGRVREIERN LYGDVGALIL DGGQTIEGDI FLDCTGAAAQ
LIGTALDEAW VAGSGPGDRV ARLSLPRSPE VPPFTAYRGR AEGWTASLPL ADRTEHLFVY
DSRTTPPDAA AATLRRAFDG QADIVDAELR HGRRASPWQR NVVALGAAAG AVEPLLGFDL
DLVLAGLETF LAYLPRQGEG GDVLRRAYAA RLNRFHDDAA EAVAAHYVLG RRSEPFWAAA
RAVMIPDRLA DRLDLYELAG HVALSEEAPF SEGDHYLLFS GADFLPRRPF APVDVADGRE
IARLLASIRA QSAQIAGAMA PHVRLMDALH GQAPVAAPAV RIAAASAPAG PAALRRTPEG
ARLADLVAGL GQPFGYERSV KASPAGLQTD RFLMSLHRAS LGLTPETTLD RLAAGLGLPE
RERAEAAGLI GGADILHLGY EDGPSGALYK LYVEWSSRTD AAWTGADEAG AEPMLVHRAY
KWNPGGTHPP VVTLYHWPWV RGPEEIEARL TRMTGAWGPA GAPARDTAHA ILLLAQERGR
GAVHYLEARE EPGLRLSYDL NLYACGLTVA DAEPLLGQAF VDLGVPRDAA AMVLRERLQE
TLGHVAGGVG RDGQPFVTVY SGVAGG