Gene Mkms_0354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_0354 
Symbol 
ID4615237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp387184 
End bp388464 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content68% 
IMG OID639790029 
Producthypothetical protein 
Protein accessionYP_936361 
Protein GI119866409 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.428446 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGTG ACTGGGTCGT CGGCGACCAT CTCGGCCTGC CGATCCCCGC CACCGCCGCT 
GCACTGCGTG ACGGCGGAGA GACGTTTCTG ACCAACGCGT TTCGCGCATT CGGTGCACTC
ACCGAGGACA ACCGGGTGGT ACGGATCGGC CGGTGCGACG AGATGACCGG CGGCAGCACA
GGCCGCAAGA TGCTGCTCGA CGTCGAGTAC GCCCGCCCCG AACCGGGTCT GCGCACGGAC
CTGTTCGTCA AGTTCTCCCG CGACTTCGAC GACCCCGTCC GGGACCGCGG GAGAACCCAG
ATGGCTTCGG AGGTGGTGTT CGCGGCGCTG TCGCGGACGC CCGGGTTCCC GATCGCGATC
CCCCATCCCC GGTTCGGGGA CTACCACGCC GGCACCGGCA CGGGAATCCT GATCACCGAT
CGGATCCCGT TCGGGTGCAA CGGTGTCGAG CGCCAGTACG AGAAGTGCCT GGACGAGGAC
ATGCCGCATC CCGACGAGCA CTACCGCGCG CTGGTCACCG CGCTGGCGCG GCTGGCGGGC
GCGCAGCGGT CCGGTCGGCT GCCTGAGCAG CTCTCGGCGG CGTTCCCCGT CGACCTGCGG
GCGGCGACCG TCGGGGAACC GGTGACGTTG TCACCAGATC GATTGCAGCG CCGGTTGTCC
CGTCTCGGAG AATTCACCGA GACCCACCCG GGACTGCTGC CACCGCATGT GCGGACCTCC
GGCTTCCTGG CGCGCCTCGG CGAGGAGGCC CATGAGGTGT TACGCCGCGA GCAGGCGATC
TGGCGATCGC TGCGGGACGC CGACGACCAC ATCGCGCTGA GCCACTGGAA CGCCAACGTC
GACAACGCGT GGTTCTGGCG CGACGGCGGC GGCGTGCTGC AGTGCGGGCT GATGGACTGG
GGCTGCGTCA GCCGACTGAA CCTCGCGATG GCGCTGTGGG GCGCGTTGTG CGCCGCCGAA
ACCGACCTGT GGGACAACCA CTTCGACGAG CTGCTCGTGC TGTTCTGCAC CGAGGTGGAA
GGCGCAGGAG GACCACGACC CGATCCGGTG CTGATGCGGC GGCACCTGAT GCTCTACATG
GCGCTGATGG GCATCACCTG GCTGCTCGAC GTGCCTGCGC GCATCGGCAA CCGCCTGCCC
GACGCCGACG TCCACACCAC GAGACACGAT CCACGCATCC GTGGGGACGA GAGCCTGCGC
GCTCCGCTGC AGATGTTCAC CAACATGTTG AACCTCTGGC AGACAAGGGG TTTGAGCGGC
CACCTGGAGG GGCTCGACTA G
 
Protein sequence
MSGDWVVGDH LGLPIPATAA ALRDGGETFL TNAFRAFGAL TEDNRVVRIG RCDEMTGGST 
GRKMLLDVEY ARPEPGLRTD LFVKFSRDFD DPVRDRGRTQ MASEVVFAAL SRTPGFPIAI
PHPRFGDYHA GTGTGILITD RIPFGCNGVE RQYEKCLDED MPHPDEHYRA LVTALARLAG
AQRSGRLPEQ LSAAFPVDLR AATVGEPVTL SPDRLQRRLS RLGEFTETHP GLLPPHVRTS
GFLARLGEEA HEVLRREQAI WRSLRDADDH IALSHWNANV DNAWFWRDGG GVLQCGLMDW
GCVSRLNLAM ALWGALCAAE TDLWDNHFDE LLVLFCTEVE GAGGPRPDPV LMRRHLMLYM
ALMGITWLLD VPARIGNRLP DADVHTTRHD PRIRGDESLR APLQMFTNML NLWQTRGLSG
HLEGLD