Gene Mjls_3574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3574 
Symbol 
ID4879285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp3768683 
End bp3769708 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content68% 
IMG OID640140878 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001071842 
Protein GI126436151 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAC CGCCGTTGTC GATGAAGCCG ACCGGCTGGT TCTGCGTGGC GTGGTCGGAC 
GAGGTCGGCG TCGGGGACGT GCGCGCGATG CACTACTTCG GCGAGGAGAT GGTGGCCTGG
CGCGCACAGT CGGGCCGCGT CACCGTGATG AACGCCTACT GCGAACACCT CGGCGCGCAC
CTGGGCCACG GCGGTCACGT GGTCGACGAG GTCATCCAGT GCCCGTTCCA CGGCTGGCAG
TGGAACGCCG AGGGTCGCAA CGTCTGCATC CCGTACCAGG ACCGGCCCAA CCGCGGCAGG
CGGATGCGGA CCTACCCGGT GGTCGAGCGC AACGACGCCA TCTGGATCTG GCACGACGTC
GACGGCCGCG AACCCTTCTT CGACGCCCCC GACGTGTTCG CCTCGTTCGC CGACGGCAGC
AGCGCCGCGG GCTACTACCC GCAGCAGCGG CTCTTCCGCG GGTCGCTGGA GATGCACCCG
CAGTACGTCC TCGAGAACGG CGTCGACTTC GCGCATTTCA AGTACGTGCA CCAGACGCCG
ATCGTCCCGG TGTTCACCCG TCACGACTTC TCCGCACCCG TGTCCTACGT CGACTTCACC
ATCACGTTCG AAGGTGACGA GGGTCAGTCC ATCGACGATG TGCGCAGCGG CGTCGAGGCC
ATCAACGGCG GGCTGGGCAT CGCGGTGACC AAGAGCTGGG GGATGGTCGA CAACCGCACG
ATCTCGGCGG TCACCCCCGT CGACGAGCGC ACCTCCGATG TCCGGTTCAT GGTCTACATC
GGACGCACTC CCGGTCGAGA CGACCAGCGG GCCGCGGACA AGGCGCGCGG CTTCGGCGAG
GAGGTCATCC GGCAGTTCGC CCAGGACATC CACATCTGGA GCCACCAGCG CTACTCCGAT
CCGCCCGCGC TGGCGACCGC CGAGTTCGAG GGTTTCACCG CGATCCGCCA GTGGGCCAAG
CAGTTCTACC CGGACGGCAT CGGTGGCAGC GCCGCCGAAG TCCACGCCGC ACTACAGAAG
GGCTGA
 
Protein sequence
MAKPPLSMKP TGWFCVAWSD EVGVGDVRAM HYFGEEMVAW RAQSGRVTVM NAYCEHLGAH 
LGHGGHVVDE VIQCPFHGWQ WNAEGRNVCI PYQDRPNRGR RMRTYPVVER NDAIWIWHDV
DGREPFFDAP DVFASFADGS SAAGYYPQQR LFRGSLEMHP QYVLENGVDF AHFKYVHQTP
IVPVFTRHDF SAPVSYVDFT ITFEGDEGQS IDDVRSGVEA INGGLGIAVT KSWGMVDNRT
ISAVTPVDER TSDVRFMVYI GRTPGRDDQR AADKARGFGE EVIRQFAQDI HIWSHQRYSD
PPALATAEFE GFTAIRQWAK QFYPDGIGGS AAEVHAALQK G