Gene Mmcs_1602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1602 
Symbol 
ID4110438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1736541 
End bp1737725 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content70% 
IMG OID638030723 
ProductDyp-type peroxidase 
Protein accessionYP_638769 
Protein GI108798572 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.544403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGAGC GGTTCAGTCG CCGCGCACTG CTCGGGGCGG GGGCCGCCGC CGCTGGGATC 
GCCGGGGTGG GGTTGACGGC GACGGCGGGC GCCCGCGGTC TGGGGCCCGC GGCCGTCGCC
ACCGAGCCCT TTCACGGCGT GCACCAGGCC GGAATCGCCA CGCGCCCACA GGCATACACG
ACGCTGGTCG CACTGGACCT GCGTACGGGG CATGCCGACC GCGCGACGCT GCGGTCGATC
ATGAGGTTGT GGAGCGAGGA CGCCGCGCGA CTGACCCAGG GGCTACCGGC GCTGGCCGAC
ACCGAACCCG AACTCGCCGC CGATCCCGCA CGACTCACGG TCACGGTCGG CTACGGATCG
GAGCTCTTCG ACGCCGTCGG GCTGACCGCC CACCGGCCCG CAACGCTGCG GCCGCTGCCG
AACTTCGCCG TGGACCGGCT CGAATCACGT TGGTGCGGTG GGCATCTGCT GCTGCAGCTG
TGCGGCGACA ACCTGCTGAC GCTCAGCCAT GCTTACCGCG TGCTGACCAA GAACGTGCGG
ACGATGGCCG CAATCCGCTG GATTCAGCGT GGCTACCGCA CCCCGGCGGG TTCGTCGCCG
GCGGGAACGT CGATGCGCAA CGTCATGGGT CAGGTCGACG GGACGGTCAG CCTCGACGGC
GCAGCACTGG ACGACCACGT CTGGTGTTCG GGTTCGGACC AACCGTGGTT CGCCGGCGGC
ACCGTCCTGG TACTGCGACG CATCCGCGCC GAGATGGACA CATGGGATCA GGTGGACCGG
CACTCCAAGG AGTTGTTCGT TGGCCGCAGA CTCGGCAGCG GGGCGCCGCT GACCGGATCC
CGCGAGCGCG ACGAACCCGA TTTCGCCGCC GCCGTCAACG GAATCCCCGT CATCCCGGAG
AACTCGCACA TCGCACTCGC CCGGCATCGC TCCGACGAAG AACGGTTCCT CCGGCGGCCG
TACAACTACG ACGACACACC ACCGGTCGGC CAGGTCTCCG ACAGCGGACT GCTGTTCCTC
GCCTACCAGC GTGACCCGGC CAGCCAGTTC GTCCCGGCTC AGCAGCGGCT GTCCGAGGCC
GATGCGCTCA ACCCGTGGAT CACGCCCGTC GGATCGGCGG TGTTCGCGAT TCCGCCTGGC
GCACCCGAAG GAGGTTACCT GGGCCAGCAG CTACTCGAAA TGTGA
 
Protein sequence
MGERFSRRAL LGAGAAAAGI AGVGLTATAG ARGLGPAAVA TEPFHGVHQA GIATRPQAYT 
TLVALDLRTG HADRATLRSI MRLWSEDAAR LTQGLPALAD TEPELAADPA RLTVTVGYGS
ELFDAVGLTA HRPATLRPLP NFAVDRLESR WCGGHLLLQL CGDNLLTLSH AYRVLTKNVR
TMAAIRWIQR GYRTPAGSSP AGTSMRNVMG QVDGTVSLDG AALDDHVWCS GSDQPWFAGG
TVLVLRRIRA EMDTWDQVDR HSKELFVGRR LGSGAPLTGS RERDEPDFAA AVNGIPVIPE
NSHIALARHR SDEERFLRRP YNYDDTPPVG QVSDSGLLFL AYQRDPASQF VPAQQRLSEA
DALNPWITPV GSAVFAIPPG APEGGYLGQQ LLEM