Gene Mjls_1639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_1639 
Symbol 
ID4877366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp1742312 
End bp1743703 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content59% 
IMG OID640138940 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_001069923 
Protein GI126434232 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.548645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCTC ACGTTCTAGG GGCACAAATC GATAGGAAGG TGCGCCCGGT GGATTCGATG 
GCGCCTGATG CGACGACAAT GCGAACCTTA GAGAATGCGC GCGGCTCCAT CCTAAAGGGT
CGCCTCCCTG CGTCTCTCAT CGCTAATGCA GCGCTTTACG AGCTTGAATT GAAGCGAGTA
TTTGGTAGGA CCTGGCAGTT TCTCTGCCAC GAAGACGAGA TCCCCAATGC GGGTGACTAT
GTAGTGCGCT ACATCGCTGA TAACTCAATT ATTGTCGCGC GGCAGCAGGA TATGACGATT
CGGGCGATGT CGAACTCGTG TCGGCACCGC GGCACGCTGC TTTGCCGAAC CGAGTCTGGG
AATGAGTCGG CGTTCCAGTG TCCGTACCAC GGTTGGACCT ATCGAAACAA CGGTGATCTC
ATCGCGATAC CTGCGCAGCA GGCAGTGTAC GGTGCTGCGT TCGACAAGAG TCGGCTAGGG
TTGCGCGCTC TGCCGATGCT GGACTCGTAC GCGGGCCTTG TCTTCGGGTG TGTGTCGGAT
GAGGCGCCGG GACTGGATGA GTACCTCGGG GACATGCGCT GGTATCTCGA CTTGATGATG
AAGAAGAGCC CGACCGGCCT TGAGGCGTGG GGTGCCCCGC AGCGTTGGGT GATTGACGCG
AACTGGAAGA CCGGCGCCGA TAACTTTGTT GGGGACGGCT ATCACACGGT CATGACGCAC
CGTTCGATGT GTGAGCTGGG GTTGTTACCG CCCGATAATG TGGCCGTTTC GCCGGCCCAC
GTCAGCCTAT CGGGCGGGCA CGGGGCGGGC GTTCTAGGCG CACCACCCGG CATACCCGCA
CCGCCGTACA TGGGCTATCC GGAGGAAGTC GTCTCCGGTC TCAGCGAGGG TTACGGCGAT
GACGTCCATG GCGAGTTGCT GAAACGGACG ATGTTCATTC ATGGCAATGT GTTCCCGAAC
TTGTCCTTCT TGAACGCCTT CATCGCCAAG GACGGGGAGT CTATGCCGGT GCCCATTCTG
ACCTTGCGGC AATGGCGTCC CTTGGACGCA GCGCGTATGG AGGTGTGGTC GTGGTTCTTC
GTGGAGCGCA ACGCGCCCGA AGAGTTCAAG CAGCAGTCGT TTGAGACTTA TGTTCGGACG
TTCGGGGTCG GGGGTGTCTT CGAGCAGGAT GACGCCGAGA TATTCCAGGC TATTACCAAG
GGAACACGCG GCGAGTTGGC TGGTGGTGTG GAGCTGAACC TGGAGATGGG ACTGGACAAT
CTGGCTCCTG ATCCAACGTG GCTGGGCCCG GGACGACCGT TGGCCAGTGG CTACGCCGAA
CAGAATCAGC GCGAGTACTG GAAGCAGTAC TTCGACTATC TGGCCACACC GAGAAGGGAT
GAGAACGTAT GA
 
Protein sequence
MSAHVLGAQI DRKVRPVDSM APDATTMRTL ENARGSILKG RLPASLIANA ALYELELKRV 
FGRTWQFLCH EDEIPNAGDY VVRYIADNSI IVARQQDMTI RAMSNSCRHR GTLLCRTESG
NESAFQCPYH GWTYRNNGDL IAIPAQQAVY GAAFDKSRLG LRALPMLDSY AGLVFGCVSD
EAPGLDEYLG DMRWYLDLMM KKSPTGLEAW GAPQRWVIDA NWKTGADNFV GDGYHTVMTH
RSMCELGLLP PDNVAVSPAH VSLSGGHGAG VLGAPPGIPA PPYMGYPEEV VSGLSEGYGD
DVHGELLKRT MFIHGNVFPN LSFLNAFIAK DGESMPVPIL TLRQWRPLDA ARMEVWSWFF
VERNAPEEFK QQSFETYVRT FGVGGVFEQD DAEIFQAITK GTRGELAGGV ELNLEMGLDN
LAPDPTWLGP GRPLASGYAE QNQREYWKQY FDYLATPRRD ENV