Gene Mkms_1626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1626 
Symbol 
ID4614095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1740695 
End bp1741879 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content70% 
IMG OID639791297 
ProductDyp-type peroxidase family protein 
Protein accessionYP_937623 
Protein GI119867671 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGAGC GGTTCAGTCG CCGCGCACTG CTCGGGGCGG GGGCCGCCGC CGCTGGGATC 
GCCGGGGTGG GGTTGACGGC GACGGCGGGC GCCCGCGGTC TGGGGCCCGC GGCCGTCGCC
ACCGAGCCCT TTCACGGCGT GCACCAGGCC GGAATCGCCA CGCGCCCACA GGCATACACG
ACGCTGGTCG CACTGGACCT GCGTACGGGG CATGCCGACC GCGCGACGCT GCGGTCGATC
ATGAGGTTGT GGAGCGAGGA CGCCGCGCGA CTGACCCAGG GGCTACCGGC GCTGGCCGAC
ACCGAACCCG AACTCGCCGC CGATCCCGCA CGACTCACGG TCACGGTCGG CTACGGATCG
GAGCTCTTCG ACGCCGTCGG GCTGACCGCC CACCGGCCCG CAACGCTGCG GCCGCTGCCG
AACTTCGCCG TGGACCGGCT CGAATCACGT TGGTGCGGTG GGCATCTGCT GCTGCAGCTG
TGCGGCGACA ACCTGCTGAC GCTCAGCCAT GCTTACCGCG TGCTGACCAA GAACGTGCGG
ACGATGGCCG CAATCCGCTG GATTCAGCGT GGCTACCGCA CCCCGGCGGG TTCGTCGCCG
GCGGGAACGT CGATGCGCAA CGTCATGGGT CAGGTCGACG GGACGGTCAG CCTCGACGGC
GCAGCACTGG ACGACCACGT CTGGTGTTCG GGTTCGGACC AACCGTGGTT CGCCGGCGGC
ACCGTCCTGG TACTGCGACG CATCCGCGCC GAGATGGACA CATGGGATCA GGTGGACCGG
CACTCCAAGG AGTTGTTCGT TGGCCGCAGA CTCGGCAGCG GGGCGCCGCT GACCGGATCC
CGCGAGCGCG ACGAACCCGA TTTCGCCGCC GCCGTCAACG GAATCCCCGT CATCCCGGAG
AACTCGCACA TCGCACTCGC CCGGCATCGC TCCGACGAAG AACGGTTCCT CCGGCGGCCG
TACAACTACG ACGACACACC ACCGGTCGGC CAGGTCTCCG ACAGCGGACT GCTGTTCCTC
GCCTACCAGC GTGACCCGGC CAGCCAGTTC GTCCCGGCTC AGCAGCGGCT GTCCGAGGCC
GATGCGCTCA ACCCGTGGAT CACGCCCGTC GGATCGGCGG TGTTCGCGAT TCCGCCTGGC
GCACCCGAAG GAGGTTACCT GGGCCAGCAG CTACTCGAAA TGTGA
 
Protein sequence
MGERFSRRAL LGAGAAAAGI AGVGLTATAG ARGLGPAAVA TEPFHGVHQA GIATRPQAYT 
TLVALDLRTG HADRATLRSI MRLWSEDAAR LTQGLPALAD TEPELAADPA RLTVTVGYGS
ELFDAVGLTA HRPATLRPLP NFAVDRLESR WCGGHLLLQL CGDNLLTLSH AYRVLTKNVR
TMAAIRWIQR GYRTPAGSSP AGTSMRNVMG QVDGTVSLDG AALDDHVWCS GSDQPWFAGG
TVLVLRRIRA EMDTWDQVDR HSKELFVGRR LGSGAPLTGS RERDEPDFAA AVNGIPVIPE
NSHIALARHR SDEERFLRRP YNYDDTPPVG QVSDSGLLFL AYQRDPASQF VPAQQRLSEA
DALNPWITPV GSAVFAIPPG APEGGYLGQQ LLEM