Gene Mjls_3503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3503 
Symbol 
ID4879214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp3695017 
End bp3696048 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content70% 
IMG OID640140807 
Product2OG-Fe(II) oxygenase 
Protein accessionYP_001071771 
Protein GI126436080 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00994559 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.800786 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCAGCG ATCGTGAACG CCGCTCGCCT CACCGGCGTC GACGGCCGAC AATGGCGCAC 
GTGCTGCCCG TCCTCGACCT CACCGACGCC GACACCGACC CGGCGGGTTT CCGCGCCCGG
TTGCGGGAGG CCGCCCACGA CGCCGGGTTC TTCTACCTCG TCGGGCACGG TGTGCCGGTC
GAGGGCTTCG AGCGGGTGCT GCGCCTGGCG CGTGACTTCT TCACCCAACC GCCGGAGCGA
AAGAACGAGA TCAGCCAACT GCTCAGCCCG CAGTTCCGCG GATACTCCCG GCTCGGTGGG
GAACTGACCA ACGGCACCGT GGACTGGCGC GAGCAGATCG ACATCGGACC GGAGCGCGAC
GTCATCGAGG GCGCCGAAGG CTACTGGCGG CTGCAGGGGC CGAACCTGTG GCCGGCGCAG
CCGCCGGGAT TCCGTGCCGC ATTCGAAGAG TGGGGGGCCG CGCTGTCGGA GGTGGGTGTG
CGGCTGCTGC GGCACTGGGC GGTGTCGCTC GGTGCGGCCG AGGACACCTT CGACGCAGCC
TTCGCCGACC GGCCCGCCAC GTTGATGAAG GTGGTGCGCT ATCCCGGCAC GACCCAGACG
GCGCAGGGTG TGGGCGCGCA CAAGGACTCC GGGGTGTTGA CGCTGCTGCT CGTCGAACCG
GGATCGGTCG GGCTGCAGGT CGAGTCGGGC CCCGACGAGT GGATCGACGT ACCGCCCCTT
CCCGGAGCCT TCATCGTCAA CATCGGGGAA CTGCTGGAGG TGGCGACGGG TGGGTACCTG
CGTGCCACCC GCCACCGTGT GCTCGCCCCG CCACCCGGCA CGGACCGCAT CTCGATCCCG
TTCTTCCTCA ACCCGGCCCT CGACGCGCTG ATCCCCATCC TGCCGTTGCC TCCGGAGCTG
GCTGTGCGCT CGCGGGGGGT GGAAACCGAC CCGGACAACC CGATCTTCAA CACTTACGGG
GAGAACGCGT GGAAGTCGCG CACCCGGGCG CATCCCGACG TCGCCGAACT GCATCACGGC
ATCACCCGGT GA
 
Protein sequence
MSSDRERRSP HRRRRPTMAH VLPVLDLTDA DTDPAGFRAR LREAAHDAGF FYLVGHGVPV 
EGFERVLRLA RDFFTQPPER KNEISQLLSP QFRGYSRLGG ELTNGTVDWR EQIDIGPERD
VIEGAEGYWR LQGPNLWPAQ PPGFRAAFEE WGAALSEVGV RLLRHWAVSL GAAEDTFDAA
FADRPATLMK VVRYPGTTQT AQGVGAHKDS GVLTLLLVEP GSVGLQVESG PDEWIDVPPL
PGAFIVNIGE LLEVATGGYL RATRHRVLAP PPGTDRISIP FFLNPALDAL IPILPLPPEL
AVRSRGVETD PDNPIFNTYG ENAWKSRTRA HPDVAELHHG ITR