Gene Mkms_0799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_0799 
Symbol 
ID4614819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp851624 
End bp853882 
Gene Length2259 bp 
Protein Length752 aa 
Translation table11 
GC content71% 
IMG OID639790475 
Producthypothetical protein 
Protein accessionYP_936805 
Protein GI119866853 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.666809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.067283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGTTGA TCTCCACCGC CCTGCTTCCG CTGCGTATCG CGCGCCGCGT CGCCGAGGCC 
GTCAAGAACG CGGTCGTCGC CGCCGCGCCG TCCCCGCCGG ACCTCGTCGT GATCGACGGG
ATGCCCGAGG GGGTGCCGCC GGCGGCGCTG CAGCGCGAGC CCGCCCTGCC GGTGCCGGCC
GGCTGGCCGT TCGGCGAGGA CTTCCCCCGC ACGTGCGGGA CGGGCCGCTT CGCGGGTGGG
GCGCTGTTCT GGACCGATTT CCTCTACGAC GACCACGGCG CCTCGGGTGT CCCCGTCAGC
GTGCCCACCG GGGGCGCACC GCCGCGCGGC ACCTACATCT ATCCGGACGG GCCCGCGGCC
CGCAACGGCG CGGACATCTT CCGGGTGGCG ATCGGGCTGA GCACGACCGA CACGTGGTGG
CGGATCGACT GGAACACTCT GTTGGACAAG ACGATTCCGG TCGGCCTGTT CACCTTCGAC
ACCGACCGCG GCACCTCGGC GACCAACGAG TGGCCGGCGA ACGCGGGTGT GCGTTCCACC
GGTATCGACC TGGCGCTGCT GATCTCCGGG ACCGGATCGT GGTTGATCGA CCTGACCACC
GGCACGCGCA CCGCGGTCGA ACACCGGGTC GACATGGAAT CGCGGTCGTT CCTGGCGACG
GTGCCCCGCG TGCTCGTCGA GCCCACCGGC ACGTGGACGG TGCGTTTGGC CGCCGGGTTG
GCCAACGAGG CGGGTGACGG GTTCGCGGAG GTGCCGTTCG TCCGCGGCGC CGGACCCGGT
CAGCCCAACG TCTACAACCT CGCGTTCCGC ACCCGCGAAC AGGAGAAGGT GCCGCTGAAC
TTCTGGTCGG ATCAGGCCCA GGCCGACGCC CTCGAAGACG GTGACGTATC GCAGTTCTCG
GTGGCGGTGC CGTGGGATCA GCTGGCCGGG CGGCGGACCG ATCCGGAACA GGTGGTGACC
GGGACGTCCA CCCGCTGGTA TGTGTCGTCG GTCGAACTGG GGCAGGGGGT CGGCCCGGGC
AACATCCTCG ACACCGAACC CCAGTTCCTG GGCCGGGTGC AGCCGTATTC GGTGTGCGTG
CCCGCGACGT ACGCTCGGGG CCGGCGGCTC CCGCTGACGC TGATGCTGCA CTCGCTGGCG
CTCGGGCAGA ACCAGTACGC CGCGTTCGAC CCGAAGCTGC TGCACGAGGT GTGCGACGGA
CGCAACTCGA TCGTCGTGAC ACCGCTGGCG CGCGGCCCGG CGTGCTGGTA CTTCGACGAG
GGCGAACTCG ACGTGTGGGA GGTGTTCGCC CGGGTCGTCG AACAGCTCGG CGCCGACCCG
AACCGCACGG TCGTCTCGGG ATACTCGATG GGCGGATACG GCACGTACAA GCTGGGCCTG
AGCTATCCGG AGGTGTTCGC TCAAGCGGTC GTGCTCGCCG GCCCGCCGAC GTGCGGGGTG
CGGCTGGTGC GCGGGGTCGA GGCGCCGGCC GACTTCGACC CGACGTCGCA CTGCGCGCGG
GAGGGCGACA CGTTCCGGCT GCTCGGCAAC GCGCGGTGGC TGCCGTTCGT CATCGCCCAC
GGCGGACTGG ACCAGTTGGT GCCGTTCCCG GGGATCGTGC AGCAGGTCCT TGAGCTCGAC
CGGCTGGGTT ACCGGTACCG GTTCGCGACG TATCCGGTCG AGGACCACAT CGCGTTCCTG
CTCAAGGACG ACTTCGACGA TCCGATCGCA CACATGGGAA CCGGTCTGCG ACAACCGGAT
CCGGGCCACA TCACGTTCTC GTGGTATCCG CAGCTGGTGC GTGAGGACCT CGGCATCGGA
CCGCACCGGG TGTGGTGGGT GTCGGAGCTG CGCGCCGCCG ACGACGTGAA GGTCGAGCGC
GGGAAGCTGG CCACCGTCGA CGCCAGGTCC TACGCCCGGC CCGACCGGAC CCGCACCATC
AAGCGCCGCC GCGGGGTGAT CCCCGACTTC GACCCGACGC CCGGTCTGTT CACCGAGATG
ACGTGGGAGC TCGGGGACGA GGTCGCCCCG ATGCCGTGGC TGACGCTGGA GTTGAACGGC
GTCGCCGGTC TCGCGGTGGA CGTCGTGCGG GCGGGGCTGG CGCCGTTGGC GCGGTCGACG
ATCGCGGTCG CCACCGACCG CCCGGTGCAG ATCCGGCTCG CGGCCCTGCC GCCGGGGATG
ACCGTCGAGT TGGACGGACG ACAGGTCGGT GCGGTCGTCG ACGTCCCCGC CGGCCGCTAC
CGCATCACGC TCGCAGCGGA CAATGGGAGC CGGAGGTGA
 
Protein sequence
MSLISTALLP LRIARRVAEA VKNAVVAAAP SPPDLVVIDG MPEGVPPAAL QREPALPVPA 
GWPFGEDFPR TCGTGRFAGG ALFWTDFLYD DHGASGVPVS VPTGGAPPRG TYIYPDGPAA
RNGADIFRVA IGLSTTDTWW RIDWNTLLDK TIPVGLFTFD TDRGTSATNE WPANAGVRST
GIDLALLISG TGSWLIDLTT GTRTAVEHRV DMESRSFLAT VPRVLVEPTG TWTVRLAAGL
ANEAGDGFAE VPFVRGAGPG QPNVYNLAFR TREQEKVPLN FWSDQAQADA LEDGDVSQFS
VAVPWDQLAG RRTDPEQVVT GTSTRWYVSS VELGQGVGPG NILDTEPQFL GRVQPYSVCV
PATYARGRRL PLTLMLHSLA LGQNQYAAFD PKLLHEVCDG RNSIVVTPLA RGPACWYFDE
GELDVWEVFA RVVEQLGADP NRTVVSGYSM GGYGTYKLGL SYPEVFAQAV VLAGPPTCGV
RLVRGVEAPA DFDPTSHCAR EGDTFRLLGN ARWLPFVIAH GGLDQLVPFP GIVQQVLELD
RLGYRYRFAT YPVEDHIAFL LKDDFDDPIA HMGTGLRQPD PGHITFSWYP QLVREDLGIG
PHRVWWVSEL RAADDVKVER GKLATVDARS YARPDRTRTI KRRRGVIPDF DPTPGLFTEM
TWELGDEVAP MPWLTLELNG VAGLAVDVVR AGLAPLARST IAVATDRPVQ IRLAALPPGM
TVELDGRQVG AVVDVPAGRY RITLAADNGS RR