Gene Mmcs_0344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_0344 
Symbol 
ID4109190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp379844 
End bp381124 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content68% 
IMG OID638029469 
Producthypothetical protein 
Protein accessionYP_637521 
Protein GI108797324 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.053887 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGTG ACTGGGTCGT CGGCGACCAT CTCGGCCTGC CGATCCCCGC CACCGCCGCT 
GCACTGCGTG ACGGCGGAGA GACGTTTCTG ACCAACGCGT TTCGCGCATT CGGTGCACTC
ACCGAGGACA ACCGGGTGGT ACGGATCGGC CGGTGCGACG AGATGACCGG CGGCAGCACA
GGCCGCAAGA TGCTGCTCGA CGTCGAGTAC GCCCGCCCCG AACCGGGTCT GCGCACGGAC
CTGTTCGTCA AGTTCTCCCG CGACTTCGAC GACCCCGTCC GGGACCGCGG GAGAACCCAG
ATGGCTTCGG AGGTGGTGTT CGCGGCGCTG TCGCGGACGC CCGGGTTCCC GATCGCGATC
CCCCATCCCC GGTTCGGGGA CTACCACGCC GGCACCGGCA CGGGAATCCT GATCACCGAT
CGGATCCCGT TCGGGTGCAA CGGTGTCGAG CGCCAGTACG AGAAGTGCCT GGACGAGGAC
ATGCCGCATC CCGACGAGCA CTACCGCGCG CTGGTCACCG CGCTGGCGCG GCTGGCGGGC
GCGCAGCGGT CCGGTCGGCT GCCTGAGCAG CTCTCGGCGG CGTTCCCCGT CGACCTGCGG
GCGGCGACCG TCGGGGAACC GGTGACGTTG TCACCAGATC GATTGCAGCG CCGGTTGTCC
CGTCTCGGAG AATTCACCGA GACCCACCCG GGACTGCTGC CACCGCATGT GCGGACCTCC
GGCTTCCTGG CGCGCCTCGG CGAGGAGGCC CATGAGGTGT TACGCCGCGA GCAGGCGATC
TGGCGATCGC TGCGGGACGC CGACGACCAC ATCGCGCTGA GCCACTGGAA CGCCAACGTC
GACAACGCGT GGTTCTGGCG CGACGGCGGC GGCGTGCTGC AGTGCGGGCT GATGGACTGG
GGCTGCGTCA GCCGACTGAA CCTCGCGATG GCGCTGTGGG GCGCGTTGTG CGCCGCCGAA
ACCGACCTGT GGGACAACCA CTTCGACGAG CTGCTCGTGC TGTTCTGCAC CGAGGTGGAA
GGCGCAGGAG GACCACGACC CGATCCGGTG CTGATGCGGC GGCACCTGAT GCTCTACATG
GCGCTGATGG GCATCACCTG GCTGCTCGAC GTGCCTGCGC GCATCGGCAA CCGCCTGCCC
GACGCCGACG TCCACACCAC GAGACACGAT CCACGCATCC GTGGGGACGA GAGCCTGCGC
GCTCCGCTGC AGATGTTCAC CAACATGTTG AACCTCTGGC AGACAAGGGG TTTGAGCGGC
CACCTGGAGG GGCTCGACTA G
 
Protein sequence
MSGDWVVGDH LGLPIPATAA ALRDGGETFL TNAFRAFGAL TEDNRVVRIG RCDEMTGGST 
GRKMLLDVEY ARPEPGLRTD LFVKFSRDFD DPVRDRGRTQ MASEVVFAAL SRTPGFPIAI
PHPRFGDYHA GTGTGILITD RIPFGCNGVE RQYEKCLDED MPHPDEHYRA LVTALARLAG
AQRSGRLPEQ LSAAFPVDLR AATVGEPVTL SPDRLQRRLS RLGEFTETHP GLLPPHVRTS
GFLARLGEEA HEVLRREQAI WRSLRDADDH IALSHWNANV DNAWFWRDGG GVLQCGLMDW
GCVSRLNLAM ALWGALCAAE TDLWDNHFDE LLVLFCTEVE GAGGPRPDPV LMRRHLMLYM
ALMGITWLLD VPARIGNRLP DADVHTTRHD PRIRGDESLR APLQMFTNML NLWQTRGLSG
HLEGLD