Gene Mjls_2237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_2237 
Symbol 
ID4877957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp2332127 
End bp2333503 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content63% 
IMG OID640139534 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_001070514 
Protein GI126434823 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCG AAACAACCGA AACCACCGGC ACAGCTGACG CGACCGATCC CTACCTGCGG 
CGCGCGCTGC GCGACGTAGC GGACGGGCTC AAGGTCGGGC GCTTACCGGC CCGCGTCGTC
AGCGATCCCG CGCTACACAC GATCGAGATG GAGCGGATCT TCGGGCGCGC CTGGGTGTTT
CTCGGACACG AGTCGGAGTT GGCCAAGTCC GGCGACTTCG TCGTGCGGCA CATCGGGGCC
GATTCGGTGA TCGTTTGCCG GGACAACTCC GGCCGCATCC AGGCGCTGTC CAATTCTTGT
CGCCACCGTG GTGCGCTCGT GTGCCGCGCT GAGATGGGAA ACACCGCGCA CTTCCAATGC
CCGTACCACG GCTGGGTGTA CAGCAACACC GGAGAGCTCG TCGGCGTGCC GGCGATGACG
GAGGCCTATC CCGGCGGCTT CGACAAGTCG CAGTGGGGAT TACGTCACAT CCCCCATGTC
GACTCGTACG CCGGATTCAT CTTCGGCAGC GTCGATCCGA AGGCGCCGAG CCTGACCGAC
TACCTCGGCG ACACGACGTT CTACCTCGAC CTCATTGCGA AGAAGACAGC GGGCGGTCTG
GAGGTGATAG GGGCACCGCA TCGATGGGTG ATGTCAGCGA ACTGGAAGAC AGCCGCCGAC
AATTTTGTCG GCGACTCCTA CCACACCCTC TTTGCTCACC GCTCGATGGT CGAGCTAGGC
ATGGCGCCCG GTGACCCAAA CTTCGCGAGC GCACCAGCGG AAATCTCGCT GCAGAACGGC
CACGGCGTCG GCGTACTCGG CTTTCCGCCC ACGCTCGCCG ATTTTCCCGA GTACGAGGGA
TACCCCGACG AAGTCGTCGA CCAGATGGCG ACGTCCTACC CGTCGCCGGT ACACAAGGAC
CTGATGCGAC GCTCATCCTT TATTCACGGC ACCGTGTTCC CGAATTTGTC GTTCATCAAC
GTGACCCTCG CGCAGGACCA CATGTCGCCC CCTACCCCCT TCATCACGTT CCGGGTATGG
CATCCGCTCT CCCATGATCG GATGGAGATC CTCTCCTGGT TCCTGGTCGA ACGCGATGCT
CCGGAATGGT TGCGCGATGC GTCCCAGGCG TCCTACGTCA ACAACTTCGG CCCAGGTGGG
GTTTTCGAAC AGGACGACGC CGAGGCATGG AAGGCCATCA CCGAATCTGT CCAGGGCCCG
TTCGCCGGTG AAGGCCTGCT GAACTACGAA ATGGGCATGG ACTTGACTCC GCTCACCGAC
TGGCCAGGGC CGGGAGAGGC CCTCCCGAGC GGGTACGCCG AGCAGAATCA GCGGCGGTTT
TGGGGGAGAT GGCTGGAATA CATGGGTCAG CCTCCCGCAT TCGGCGGGCG TGCTTGA
 
Protein sequence
MTTETTETTG TADATDPYLR RALRDVADGL KVGRLPARVV SDPALHTIEM ERIFGRAWVF 
LGHESELAKS GDFVVRHIGA DSVIVCRDNS GRIQALSNSC RHRGALVCRA EMGNTAHFQC
PYHGWVYSNT GELVGVPAMT EAYPGGFDKS QWGLRHIPHV DSYAGFIFGS VDPKAPSLTD
YLGDTTFYLD LIAKKTAGGL EVIGAPHRWV MSANWKTAAD NFVGDSYHTL FAHRSMVELG
MAPGDPNFAS APAEISLQNG HGVGVLGFPP TLADFPEYEG YPDEVVDQMA TSYPSPVHKD
LMRRSSFIHG TVFPNLSFIN VTLAQDHMSP PTPFITFRVW HPLSHDRMEI LSWFLVERDA
PEWLRDASQA SYVNNFGPGG VFEQDDAEAW KAITESVQGP FAGEGLLNYE MGMDLTPLTD
WPGPGEALPS GYAEQNQRRF WGRWLEYMGQ PPAFGGRA