Gene Mkms_3566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3566 
Symbol 
ID4611496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3756527 
End bp3758680 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content67% 
IMG OID639793242 
Producthypothetical protein 
Protein accessionYP_939550 
Protein GI119869598 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.748776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.18845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACCAC CTGATAGCAG AGTGTCGGCC CCCGAGGGGA CCGACACCCG CCACGACGCC 
CAAGTCGATA GTCCCGATCT TACCGGCCCG TACGCGGCGG CGTGCAACAT CTATGCCGAC
CTCGGTTGGC GCGGCGTGAT TCCCGTGGAC CCGCGCGACA AGGGCGGCGT ACCTGCCGGA
TTCACCGGGT ACGGCGGCAT CGATGTGACA CCGGAAAACA TGGCCTGGTT CGCCAAGTCG
AAACCCGGTC ACAACATCGG CCTACGCCTG CCCGACGGCG TCATCGGCAT CGACGTCGAC
GCTTACGGCC CGAAAAAGGG CGCCGACACC TTCGCCGAAG CGCAAGGGCG TTGGGGCGCT
TTGCCGCCCA GCTATCGCAG CACGAGCCGC GACGATGGCA TATCGGGCAT TCGGCTCTAC
CGCGTGCCTG CTGGCACCAA GTTGGAAACC ATAATCGAAT TCAAAGATCT TGATATCCGC
GATATCGAGA TCGTCCAGCG CCATCACCGG CACGTCCAGT GCTGGCCGTC AATTCACGAC
AAAACCGGTC AGCGGTACCG GTGGGTCTCC GAGCTCGACG GCAGTGCGAT GGACACCCCA
CCGGCGCCGG ATGATCTGCC CAACCTGCCG GCGGCATGGG TGGCGGCGCT GCGCGCCGAG
AGCGGATCGA ACGGAACCCC ACTCAATGGC GCCGAGGCGC CCGTGGACGT GCAAACGGCG
CTCACCGAAG GTGACGCATC GCCGCGGGTC GCTGAGCTAC TCGCTCGCGC GATCGGCGAT
TGCTACGGGG GCAGTCGGTT CGACCACACC CGCGGCAACG TCCTGACGCT CTTACGCTTC
GGCAAGCAGG GGGACACCGG TGTACGCCCC GCGCTCTCAG CGCTCAAAGC TGTGTACGTC
AACGCCGTCA GCCCCGATCG TGCGGGCGGT CAGAGGGCCG CCGAGGTCGA ATTCGACCGG
CTGGTGTCCG GCAAGAAGGT CGCCACGCTG CTCGCCGAAC CGGACTACAA CGATTGGGTC
TCGGATCTCG CACCGGCGAA CGCTGCGGAC ATAGCGGCGC CAGAACTACC GGCCGACGAC
GGCCGTGCGC CGGCGGCGAC GGGCTGGGAG CCGGTCGACC TCGGTCCGTG GCTGCGCGGG
GAGATCGAAC TACCGACCCC GTCGCTTGGT ATCGCGCGAT CAGACGGGCT TCGGCTACTC
TACCCGGGTC ACGAGCACGC AGTCATCGGG GAGACGGAGG CGGGCAAGTC TTGGCTCGCT
CTGCAGTGTG CGGCCGTCGA GCTGCGCGCC GACAACGCCG TGGTGTACGT CCATTTCGAA
GAGGGCAACC CGAGCAGCAC TATCGAACGT CTGCGGCTGC TAGGCGTCGA TATCGAGACA
ATGACTCGAC GGTTGCGTTT CGTCGCGCCC TCGCGTGCGC TTGCCGATGC TGAGTGGCTG
GCTGCGCTGC TGCGCGATCC TACGCCGACG CTCGTGGTGC TCGACGGCGT CAATGAGGGC
ATGGCGTTGC ACGGGCTCGA CATCTTCGCC GCTGATGGGG CGGCGCAGTT CCGGCGCGTG
CTCGTCGCTC CCGCCATACG GGTCGGCGCC GCGGTGCTCT CCTGCGACCA CCTGCCGAAG
AGTCGAGATG GTCAGGGCCG CGACGCTTAC GGGTCCGTCC ACAAGGGCAA TGCGCTCGAC
GGCGCGCGGT TCGTGCTCGA GAACGTCACG CCGTTCGGGC GCGGTATGCG CGGAGCATCC
AACGTCTACG TGACGAAGGA TCGGCCCGGG CATCTGCGGA GCCACGGTCG GCCGTCGAAG
CTCGCGGGCA AGACGTACCT CGGCACTCTT GTCGCCGATG ACTCCGAACC CTTTCAGCCG
TTCTCGCTGA CGCTGTACGC GCCCCAGGAT GACGAGGAGT CGCCCACACA GCAGGCAGCC
GCCAAACTGA CTGACGCCGT GTACGACGTC ATCGCGGCAC AGCCTGATCG CACCGTGCGG
TCAACGCGTG ATCTGTACGC GGCGATGCGG GCTGCTGGGC ACGCTCAACG CAATAGCGCG
TTTCGCGACG CGCTCGACGA TCTGCTCGCC GCTGGACGCA TCGAAGAGGT CAGCGGCGCC
CGCAGGTTGG GGTATCGCGC CGTCGCGACT GTTTCCCAGG AGTGCACCGC ATGA
 
Protein sequence
MPPPDSRVSA PEGTDTRHDA QVDSPDLTGP YAAACNIYAD LGWRGVIPVD PRDKGGVPAG 
FTGYGGIDVT PENMAWFAKS KPGHNIGLRL PDGVIGIDVD AYGPKKGADT FAEAQGRWGA
LPPSYRSTSR DDGISGIRLY RVPAGTKLET IIEFKDLDIR DIEIVQRHHR HVQCWPSIHD
KTGQRYRWVS ELDGSAMDTP PAPDDLPNLP AAWVAALRAE SGSNGTPLNG AEAPVDVQTA
LTEGDASPRV AELLARAIGD CYGGSRFDHT RGNVLTLLRF GKQGDTGVRP ALSALKAVYV
NAVSPDRAGG QRAAEVEFDR LVSGKKVATL LAEPDYNDWV SDLAPANAAD IAAPELPADD
GRAPAATGWE PVDLGPWLRG EIELPTPSLG IARSDGLRLL YPGHEHAVIG ETEAGKSWLA
LQCAAVELRA DNAVVYVHFE EGNPSSTIER LRLLGVDIET MTRRLRFVAP SRALADAEWL
AALLRDPTPT LVVLDGVNEG MALHGLDIFA ADGAAQFRRV LVAPAIRVGA AVLSCDHLPK
SRDGQGRDAY GSVHKGNALD GARFVLENVT PFGRGMRGAS NVYVTKDRPG HLRSHGRPSK
LAGKTYLGTL VADDSEPFQP FSLTLYAPQD DEESPTQQAA AKLTDAVYDV IAAQPDRTVR
STRDLYAAMR AAGHAQRNSA FRDALDDLLA AGRIEEVSGA RRLGYRAVAT VSQECTA