Gene Mmcs_4772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4772 
Symbol 
ID4113601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp5048898 
End bp5050088 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content67% 
IMG OID638033923 
Productluciferase-like protein 
Protein accessionYP_641932 
Protein GI108801735 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGCA AACCGCTGAA CTTCGGCGTC TTCATCACAC CGTTCCACCC GGTGGGCCAA 
TCCCCCACCG TCGCATTGGA ATACGACCTC GAGCGGGTGG TGCGGCTGGA CCGGCTCGGC
TTCGACGAGG CGTGGTTCGG CGAACACCAT TCGGGCGGTT ACGAACTCAT CGCCTGCCCG
GAGGTCTTCA TCGCCACGGC CGCCGAACGG ACCAGACACA TCCGCCTCGG CACCGGCGTG
GTGTCGCTGC CCTACCACCA TCCGCTCATG GTCGCCGACC GGTGGGTCCT GCTCGACCAC
CTCACCCGTG GCCGCGTCAT GTTCGGCACC GGGCCGGGCG CGCTGCCGTC GGACGCCTAC
ATGATGGGCC TCGACCCGGT CGAGCAGCGC CGCATGATGC AGGAGTCGCT CGAGGCGATC
CTCGCGTTGT TCCGCGCGGA ACCCGAGGAA CGCATCACCC GCGAGACCGA CTGGTTCACC
CTGCGCGACG CCCAACTTCA CATCCGGCCC TACACCTGGC CGTACCCCGA GATCTCCACC
GCCGCGATGA TCTCCCCGTC CGGACCACGA CTGGCCGGTT CACTCGGGAC GTCGCTGCTG
TCACTGTCGA TGTCGGTGCC GGGTGGATTC GCGGCACTGG AGTCGACGTG GCAGATCGTC
GTCGACCAGG CGGCCAAATC GGGCCGCCCC GAACCCGAAC GGGACAACTG GCGGGTGCTG
TCGATCATGC ACCTGGCCGA CACCCGAGAG CAGGCGATCG ACGACTGCAC GTACGGACTG
GCCGATTTCG CCAACTACTT CGGTGCGGCC GGCTTCGTCC CGCTGTCCAA CAGCGTCGAG
GGGGAACGCT CACCGCGGGA GTTCGCGGCC GAGTACGCCG CACAGGGGAA CTGCTGTATC
GGCACGCCCG ACGACGCCAT CGCCTACATC GACGACCTGC TCGTGAAGTC CGGCGGGTTC
GGGACACTGC TGCTGCTCGG CCACGACTGG GCGTCCCCGG AGGCCACCTA CCACTCCTAC
GACCTGTTCG CGCGAAAGGT GATGCCGCAC TTCAAGGGTC AGCTCACCGC CCCTCGCGCC
TCGCACGACT GGGCCAAGGG TATGCGCGAC CAGTTGCTCG GCCGGGCCGG CGACGCGATC
GTCAAGGCGA TCACCGAACA CACCGACGAG TTGAAGGTGG ACCCGCGGTA G
 
Protein sequence
MSRKPLNFGV FITPFHPVGQ SPTVALEYDL ERVVRLDRLG FDEAWFGEHH SGGYELIACP 
EVFIATAAER TRHIRLGTGV VSLPYHHPLM VADRWVLLDH LTRGRVMFGT GPGALPSDAY
MMGLDPVEQR RMMQESLEAI LALFRAEPEE RITRETDWFT LRDAQLHIRP YTWPYPEIST
AAMISPSGPR LAGSLGTSLL SLSMSVPGGF AALESTWQIV VDQAAKSGRP EPERDNWRVL
SIMHLADTRE QAIDDCTYGL ADFANYFGAA GFVPLSNSVE GERSPREFAA EYAAQGNCCI
GTPDDAIAYI DDLLVKSGGF GTLLLLGHDW ASPEATYHSY DLFARKVMPH FKGQLTAPRA
SHDWAKGMRD QLLGRAGDAI VKAITEHTDE LKVDPR