Gene Mkms_3642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3642 
Symbol 
ID4611572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3835546 
End bp3836571 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content67% 
IMG OID639793318 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_939626 
Protein GI119869674 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.55041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.365786 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAC CGCCGTTGTC GATGAAGCCG ACCGGCTGGT TCTGCGTGGC GTGGTCGGAC 
GAGGTCGGCG TCGGGGACGT GCGCGCGATG CACTACTTCG GCGAGGAGAT GGTGGCCTGG
CGCACACAGT CGGGCCGCGT CACCGTGATG AACGCCTACT GCGAACACCT CGGCGCGCAC
CTCGGCCACG GCGGTCACGT GGTCGACGAG GTCATCCAGT GCCCGTTCCA CGGCTGGCAG
TGGAACGCCG AGGGTCGCAA CGTCTGCATC CCGTATCAGG ACCGGCCCAA CCGCGGCAGG
CGGATGCGGA CCTACCCGGT GGTCGAGCGC AACGACGCCA TCTGGATCTG GCACGACGTC
GACGGCCGCG AACCCTTCTT CGACGCCCCC GACGTGTTCG CCTCGTTCGC CGACGGCAGC
AGCGCCGCGG GCTACTACCC GCAGCAGCGG CTCTTCCGCG GGTCGCTGGA GATGCACCCG
CAGTACGTCC TCGAGAACGG CGTCGACTTC GCGCATTTCA AGTACGTGCA CCAGACGCCG
ATCGTCCCGG TGTTCACCCG TCACGACTTC TCCGCACCCG TGTCCTACGT CGACTTCACC
ATCACGTTCG AAGGTGACGA GGGTCAGTCC ATCGACGATG TGCGCAGCGG CGTCGAGGCC
ATCAACGGCG GGCTGGGCAT CGCGGTGACC AAGAGCTGGG GGATGGTCGA CAACCGCACG
ATCTCGGCGG TCACCCCCGT CGACGAGCGC ACCTCCGATG TCCGGTTCAT GGTCTACATC
GGACGCACTC CCGGTCGAGA CGACCAGCGG GCCGCGGACA AGGCGCGCGG CTTCGGCGAG
GAGGTCATCC GGCAGTTCGC CCAGGACATC CACATCTGGA GCCACCAGCG CTACTCCGAT
CCGCCCGCGC TGGCGACCGC CGAGTTCGAG GGTTTCACCG CGATCCGCCA GTGGGCCAAG
CAGTTCTACC CGGACGGCAT CGGTGGCAGC GCCGCCGAAG TCCACGCCGC ACTACAGAAG
GGCTGA
 
Protein sequence
MAKPPLSMKP TGWFCVAWSD EVGVGDVRAM HYFGEEMVAW RTQSGRVTVM NAYCEHLGAH 
LGHGGHVVDE VIQCPFHGWQ WNAEGRNVCI PYQDRPNRGR RMRTYPVVER NDAIWIWHDV
DGREPFFDAP DVFASFADGS SAAGYYPQQR LFRGSLEMHP QYVLENGVDF AHFKYVHQTP
IVPVFTRHDF SAPVSYVDFT ITFEGDEGQS IDDVRSGVEA INGGLGIAVT KSWGMVDNRT
ISAVTPVDER TSDVRFMVYI GRTPGRDDQR AADKARGFGE EVIRQFAQDI HIWSHQRYSD
PPALATAEFE GFTAIRQWAK QFYPDGIGGS AAEVHAALQK G