Gene Mkms_0820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_0820 
Symbol 
ID4614840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp881141 
End bp884512 
Gene Length3372 bp 
Protein Length1123 aa 
Translation table11 
GC content62% 
IMG OID639790496 
Producthypothetical protein 
Protein accessionYP_936826 
Protein GI119866874 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.909605 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0446039 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTAT CGGACTGGTT GAGCGATACG GACTATCACT GGCGGCAACG CCTGCAGCCG 
GTAAACCTCG TCATCGAAGC AAACTTTTCG ATAGATGAGG TGCGCCACGC ACAGCAGCGA
TATGGAGCAG CGGCGAGACA GCTCTTTCTG CGAGGCGTGC CCTACCGGAA ATTCATTCGG
CGGTACCCAG CACTCACATT GTTGGTACTT GTCGGCCATG CCGCGCTCGA GTACGACCAA
GGGAAGTACT GGGACAGCTT CTGGGATGAG CTCGGGATTC CACGCGACGC CGATTTCGAG
ACGGAGATCA GGAAGAATCT ATTCGACCTG CTCGACAAAT TCTCGCTCGC CCGCTTTCCG
CGAATCGAGG AGGCATCGAC ATTTAGATAC GTGATGACGT TGACCCTCCA TGCCGGAATC
CCGGTGCACT GCCTTGGCGA TCTCCTCAGA GTGATCAACG ATCACATCAG GCAAGGCCGG
GCACCCCATG GTGCCGCACT GATCGAGTGG TTGGAAGAGC CAGGAAAAGA ACACCGTATC
GATCCACTCG ACGTGCCCGT CCGAAACTTC ATCGCGAACG GCGCTCAGTT CGCCGTCGAC
ATCCTCGACC GAATCATCGA ATTGGTGCAG GAAGTCGCAG CGAATACAGG TCTGCTCGAC
GCCGACCTCG ATGCCTCTAC GACAGGCTTG CCCGACGTCC TCCTCGACGA GCTCATCAAG
CAACTGCGCG ACGCCCCGCC GACATCGCAA GGGAGGAGGC TGACTGGGCG CCACAACCGT
CAACCATCAA TCGAATACAA CGTCGACGAT GACGAGATCG TGCTGGTGCT ACCGACTCCG
GAGACCGATG TCGATCTCCC CTGGCGCGTG TCGTTCGACG GTGACGTCCG CCAGGTGCAC
CCCTCACGCC GGTGGGGTGG TGACGCCATG TCGGCCCAGG TAGCGGTTCC CGGGCCAGTC
CGCGAGATCG TGGTAGCCCA TCCCAGCGGA GTGAATTCGG CACTTCCGCT CGTGATGAAG
TCCGACCCAC TGCTGACCTT CGACAAGTCC GGCCGGTGGA TTCCCCGACG AGACGGCTTG
AAGGAATCCG TGTGGGCGGT CTTCCCTGAA GAGTTCCAGC TGGCCGACAC CCGGGCACAC
CAGGCTGTCG ACGCCCAGGA CTCTGGATCC CCAGCCGGAT GGCGTGGGTG GCGCAGCGCC
TTCATCGACC TCACCGAAAT CACGGCACTG CAGCTCCTCA CCTCCGACCG AGTTGCGATC
GGTACACCGC ACTCGGTTCG CAAAGATGCG AGGCCATCAT TCTTGCTCGG ATCCCCCGTC
GTAGGTGTGT CCGCACTCGA TGGGCGGACG GTGTATAACA CGCGTCCATG GGTCCTGCTG
CCTCCGTCGC AGACCGATCC CGCGCCCGAG TGGCTCGTAC GGGTGAGGCA CTTCGGGGCG
TCGGAATGGA TCGTCGAGGA GAGCTGGCGC GCGGAGGAAA TCGAAACCTG TGTCGACCCG
TTCGACGACG ACGAGAACCC GCAGCTGGGC CTCTTTGAGA TTGTCGTGAA CGGCCCACTG
GGCGCTGATG CGCGCTATGT GGTGTTCATG GCGGAAGGCT TGCACATTGA CTTCGATACC
CCGATTCGCG TGCCCGGTCG GGAAGGGCTG ACGCCCTGCA CCGCTGAGGT AACAGCTGAC
CATCTTGCGG TGTCCCCTTC AGAACCGTTG CGCTTCGGCC CACGTCAACT GGAGCAGCAG
ATCAGGCTGC AGTCAGGAGA CCTCGAGTCG AGGATCGCGG TCAGACCGCC CCACGTTGAG
ATCCGCGCGG GCGTGTCCGG CGAACCAGCA GCGTGGAGGA TGACCCCCGA GGTCTGCGAT
CCTGCCGACT TCGCCGAAGA CCGCTTCGCC GCGATACGTG TTCCGGGCAT CGATCACGTC
CAGTTCGCAT ACATCTCATC GCACGGCGAT CTGCTTCAAC GCGATCCGAA CTCGCGAAGG
CGCCACGGCG ACGTCGTCGA GTCGCGAATC CAGCAGTTCG CAGACACAGT GCGAAACAAC
CCGGGCGGAC GGGTTGTGGC GACGCTTTCG ACGCACGCCG GCCCCCTCGA CGTGACCGTA
CTCTTCGCAT ATCCCCGACG ACTGGCCTCG GGCGTCCAGC TCCACGAGGA CACGTTGAAA
TTTTTCGAAA CTCCCGCCCT TGACGATCTG GCGGTGTATG TCTGGAGCAG CACCGCGCCC
TGGCGGGCGC CCGAAGTTCT GCCGGTCTCA GACGGAATGG CCGCTCTCCC TCCTGCTTTG
GTCGACGCCG GGGATCTGCG GTGCCAGTTG TTCATCGACG ACCCTTGGGT GTTGATCGAG
CCGCCACCGA TGCCACCCGC GAGCGCTTTC GTCGTAGAGC AAGTCGGCTG GCGCGAGGAC
GGTACACCGA GCCAGGTGAA GCTCTCGCGG TATCTCGGTA CACAACGCTC AGCGCCCATC
GAAGTAGGCG CGATCCCCGA GGTGTGGGCG GCCATGGCAC GGCTCCACGC AGACGGCAAG
GCAGAACGCT TCGATGGGCT GACGCAAGTT TTGGCCGTCG ATCCCCGTTT CGCTCTGGAA
CGCCTGGGCA ACAGCATGAT TCCCGCTGGC GACAAGATGG CCATGCTCAT TCGCAGTGAG
TTGGTCAACC ACGATTTCTC CGCGGAGGAA ACCCTCAACG ACCTACACGC CCACCCGTGG
TTCGGTTGCA TGGTCGAGCT CGCCGACCTG CCGTCCCTAC ACAACCGCCG TGAGCAAGTG
CGAGACGAGC GCGCGCAGAC ACTTGCCTAC CTCCAGGACC GAGGTGGGGT GCCTCTGATG
GATCTGCTGC GAACCGGCAC GAACGACCAC GCTTTCGGGG CGTGTTTCGA CGGCAACGTG
TTCCGTTGGA CTGAGATACC GGGCAACCGA ATCGAAGAGA AGCTGCGTGA AATCCAGCAG
ATCCCCCTCG CGCAACTGCA TCACGACAAC CTGCGCGCAG GGGTGTACGA AGCGTTCTGC
CGGCGCAGCG AGTGGCTCGC ATCAGGCTGG ACCGCACACT TCGCCATGCA AACAGGATTG
GTGGCCACGC CGATCAGACA CGCCTCACTG CTTGCTCACG AGGCGGTAGT GACCCGCCAC
GACCGCGTCC GAAAGATCGA CGCTTCAGCG AATCCCTGGA TTCTCATGTC GGTGGAGTCG
CTGACCTTGG CGCTACTGGC TCGGCTCGAA GCTCATGGAC GAATCGACGG CCGGTACCTC
GATCGTGGAC TGTTACGGAC GTGGTCCCGC ATGGCGAAGC TCTGCCCGAC CATGGTGGCG
AACGATCTGT TGATCGCCGA AGCCGTTGTG CTGTATGACC GGCGCGGCGA CCTCACTGGA
GAGGACACAT GA
 
Protein sequence
MSLSDWLSDT DYHWRQRLQP VNLVIEANFS IDEVRHAQQR YGAAARQLFL RGVPYRKFIR 
RYPALTLLVL VGHAALEYDQ GKYWDSFWDE LGIPRDADFE TEIRKNLFDL LDKFSLARFP
RIEEASTFRY VMTLTLHAGI PVHCLGDLLR VINDHIRQGR APHGAALIEW LEEPGKEHRI
DPLDVPVRNF IANGAQFAVD ILDRIIELVQ EVAANTGLLD ADLDASTTGL PDVLLDELIK
QLRDAPPTSQ GRRLTGRHNR QPSIEYNVDD DEIVLVLPTP ETDVDLPWRV SFDGDVRQVH
PSRRWGGDAM SAQVAVPGPV REIVVAHPSG VNSALPLVMK SDPLLTFDKS GRWIPRRDGL
KESVWAVFPE EFQLADTRAH QAVDAQDSGS PAGWRGWRSA FIDLTEITAL QLLTSDRVAI
GTPHSVRKDA RPSFLLGSPV VGVSALDGRT VYNTRPWVLL PPSQTDPAPE WLVRVRHFGA
SEWIVEESWR AEEIETCVDP FDDDENPQLG LFEIVVNGPL GADARYVVFM AEGLHIDFDT
PIRVPGREGL TPCTAEVTAD HLAVSPSEPL RFGPRQLEQQ IRLQSGDLES RIAVRPPHVE
IRAGVSGEPA AWRMTPEVCD PADFAEDRFA AIRVPGIDHV QFAYISSHGD LLQRDPNSRR
RHGDVVESRI QQFADTVRNN PGGRVVATLS THAGPLDVTV LFAYPRRLAS GVQLHEDTLK
FFETPALDDL AVYVWSSTAP WRAPEVLPVS DGMAALPPAL VDAGDLRCQL FIDDPWVLIE
PPPMPPASAF VVEQVGWRED GTPSQVKLSR YLGTQRSAPI EVGAIPEVWA AMARLHADGK
AERFDGLTQV LAVDPRFALE RLGNSMIPAG DKMAMLIRSE LVNHDFSAEE TLNDLHAHPW
FGCMVELADL PSLHNRREQV RDERAQTLAY LQDRGGVPLM DLLRTGTNDH AFGACFDGNV
FRWTEIPGNR IEEKLREIQQ IPLAQLHHDN LRAGVYEAFC RRSEWLASGW TAHFAMQTGL
VATPIRHASL LAHEAVVTRH DRVRKIDASA NPWILMSVES LTLALLARLE AHGRIDGRYL
DRGLLRTWSR MAKLCPTMVA NDLLIAEAVV LYDRRGDLTG EDT