Gene Mkms_3820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3820 
Symbol 
ID4611755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4034333 
End bp4035655 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content63% 
IMG OID639793500 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_939803 
Protein GI119869851 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.365881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACATT TCCCCAAACC GGCCGCCGGT AGCTGGACCG AGAACTGGCC GGAGCTCGGC 
ACGGCACCGG TGGACTACAC CGACTCGATC GACCCGGAGC AGTGGAAGCT GGAGCAGCAG
GCCATCTTCC GGAAGCTGTG GCTGCACGTC GGTCGCGTGG AGCGGCTCCC CAAGACCGGC
AGCTACTTCA CCAGGGAGAT GCCCTCCGTC GGACCGGGCA CCTCGATCAT CGTCAACAAG
GACAAGGACG GCACCATCCG GGCGTTCTAC AACCTGTGCC GCCACCGCGG AAACAAGTTG
GTGTGGAACG ACTATCCGGG CGAAGAGGTC TCGGGCAGCT GCCGCCAGTT CACCTGCAAG
TACCACGCCT GGCGTTACGC CCTCAACGGC GACCTGACGT TCATCCAGCA GGAGGATGAG
TTCTTCGACG TCGACAAGGC CGACTACCCG CTCAAGCCGG TGCGCTGCGA GGTGTGGGAA
GGCTTCATCT TCGTCAACTT CGACGACGAC GCCGAACCGC TGGAGGACTA CCTGGGCGAG
TTCGGGCAGG GCCTCAAGGG CTACCCGTTC CACGAGATGA CCGAGGTGTA CAGCTACCGC
TCCGAGATCA AGGCGAACTG GAAGCTGTTC ATCGACGCGT TCGTCGAGTT CTACCACGCG
CCGATCCTGC ACATGAAGCA GGCGACCCCG GAAGAGGCGG CCAAGCTCGC CAAGATCGGT
TTCGAGGCGC TGCATTACGA CATCAAGGAC CAGCACTCGA TGATCTCGTC CTGGGGTGGC
ATGAGCCCGC CCAAGGACCT CAGCATGGTC AAGCCGATCG AGCGGATCCT GCACAGCGGT
CTGTTCGGCC CCTGGGACCG TCCCGACATC AAGGGCATCC TGCCCGACGA GCTGCCGCCG
GCGGTCAACC CGGCTCGCCA GAAGACGTGG GGCCAGGACT CGTTCGAGTT CTTCCCGAAC
TTCACGCTGC TGCTGTGGGT TCCGGGTTGG TACCTGACGT ACAACTACTG GCCCACCGGT
GTGGACACCC ACATCTTCGA GGCCAACCTG TACTTCGTGC CGCCGAAGAA CACCCGCCAG
CGCCTGTCGC AGGAACTCGC GGCCGTGACG TTCAAGGAGT ACGCGCTGCA GGACGCGAAC
ACCCTGGAAG CCACCCAGAC TCAGATCGGC ACCCGCGCCG TCACCGAGTT CCCGTTGTGC
GATCAGGAGA TCCTGCTGCG CCACCTGCAC CACACCGCGC ACAAGTACGT CGACGAGTAC
AAGCTCGAGC AGGCCGCGAA GGCGGCGACG AACGGAAAGG TCAAGGACGA GGCACATGTC
TGA
 
Protein sequence
MAHFPKPAAG SWTENWPELG TAPVDYTDSI DPEQWKLEQQ AIFRKLWLHV GRVERLPKTG 
SYFTREMPSV GPGTSIIVNK DKDGTIRAFY NLCRHRGNKL VWNDYPGEEV SGSCRQFTCK
YHAWRYALNG DLTFIQQEDE FFDVDKADYP LKPVRCEVWE GFIFVNFDDD AEPLEDYLGE
FGQGLKGYPF HEMTEVYSYR SEIKANWKLF IDAFVEFYHA PILHMKQATP EEAAKLAKIG
FEALHYDIKD QHSMISSWGG MSPPKDLSMV KPIERILHSG LFGPWDRPDI KGILPDELPP
AVNPARQKTW GQDSFEFFPN FTLLLWVPGW YLTYNYWPTG VDTHIFEANL YFVPPKNTRQ
RLSQELAAVT FKEYALQDAN TLEATQTQIG TRAVTEFPLC DQEILLRHLH HTAHKYVDEY
KLEQAAKAAT NGKVKDEAHV