Gene Mjls_5022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_5022 
Symbol 
ID4880720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp5261140 
End bp5262297 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content65% 
IMG OID640142332 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001073277 
Protein GI126437586 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTACCG ACACCGCCCA CAGCGGCATT CGCGAGATCG ACACCGGAAC CCTGCCCGAC 
CGGTACGCCA GGGGCTGGCA CTGCCTCGGC CCGGTCAACG ACTACCTCGA CGGCGAACCG
CACTCCGTCG AGGCGTTCGG CACCAAACTC GTGGTGTTCG CCGATTCGAA GGGCGACGTC
AAAATCCTCG ACGGCTACTG CCGGCACATG GGCGGTGACC TGTCCCAGGG CACCATCAAG
GGTGACGAGG TCGCCTGCCC CTTCCACGAC TGGCGCTGGG GCGGCGACGG CAAGTGCAAG
CTCGTGCCCT ACGCCAAGCG GACGCCGCGG CTGGCCCGCA CCCGCGCCTG GACCACCGAC
GTGCGCAGCG GTCTGCTGTT CGTCTGGCAC GACCACGAGG GCAACCCGCC TCCCCCCGAG
GTGCGCATCC CCGAGATCCC GGAGTTCGCC AGCGACGAGT GGACCGACTG GCGGTGGAAC
TCGATCCTGA TCGAGGGCGC GAACTGCCGC GAGATCATCG ACAACGTCAC CGACATGGCG
CACTTCTTCT ACATCCACTT CGGGCTGCCC ACGTACTTCA AGAACGTGTT CGAGGGCCAC
ATCGCCAGCC AGTACCTGCA CAATGTGGGC CGCCCCGACG TCAACGACAT GGGCACCACC
TACGGCGAAG CGCACCTCGA CTCCGAGGCG TCGTATTTCG GGCCGTCGTT CATGATCAAC
TGGCTGCACA ACAACTACGG CGGCTACAAG GCCGAGTCCA TCCTGATCAA CTGCCACTAC
CCGGTGACCC AGGATTCGTT CGTGCTGCAG TGGGGCGTCA TCGTCGAGAA GCCCAAGGGC
ATGGACGAGA AGATGACCGA CAAGCTGGCG CGGACGTTCA CCGACGGCGT CAGCAAGGGC
TTCCTGCAGG ACGTCGAGAT CTGGAAGCAC AAGACGCGTA TCGACAATCC GCTGCTGGTC
GAAGAGGACG GCGCGGTCTA CCAGCTGCGC CGCTGGTATC AGCAGTTCTA CGTCGACGTC
GCCGACGTGA CCCCGGAGAT GACGGACCGT TTCGAGATCG AGGTCGACAC CACCGCGGCC
AACGAGTACT GGAACACCGA GGTTCAGGAG AATCTCGCGC GCCGCGAGGG CGAGAAAGCC
GAACAGCCGA CCCCATGA
 
Protein sequence
MSTDTAHSGI REIDTGTLPD RYARGWHCLG PVNDYLDGEP HSVEAFGTKL VVFADSKGDV 
KILDGYCRHM GGDLSQGTIK GDEVACPFHD WRWGGDGKCK LVPYAKRTPR LARTRAWTTD
VRSGLLFVWH DHEGNPPPPE VRIPEIPEFA SDEWTDWRWN SILIEGANCR EIIDNVTDMA
HFFYIHFGLP TYFKNVFEGH IASQYLHNVG RPDVNDMGTT YGEAHLDSEA SYFGPSFMIN
WLHNNYGGYK AESILINCHY PVTQDSFVLQ WGVIVEKPKG MDEKMTDKLA RTFTDGVSKG
FLQDVEIWKH KTRIDNPLLV EEDGAVYQLR RWYQQFYVDV ADVTPEMTDR FEIEVDTTAA
NEYWNTEVQE NLARREGEKA EQPTP