Gene Mkms_3838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3838 
Symbol 
ID4611773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4054193 
End bp4055461 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content68% 
IMG OID639793518 
Productpeptidase M24 
Protein accessionYP_939821 
Protein GI119869869 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGT CGACTCACAC CGGCGTCACC CAGATCGCCC GGACCGGGTA CACGTGGCTG 
GACATCCCGC AGGAGCCCGA CTTCACCCGG CTGCGCAGTG AGGTCGGTGC ACGTCTGCAC
GCCGCGATGG CCGAACAGGG TGTCGACGCG CTGGTTCTGC TGGGCAACGG AAACGTCATG
TACGCCACCG GTATCAGCTG GCCGCTGGCC GATGCCGGCC TGTCACACGT CGAGCGGCCG
GTGGCGGTCG TGCTGGCCGA CGACGAGCAC CCGCACCTGT TCCTGCCCTT CCGCGAGGGT
GCGGCGATGG AGTCGGACCT GCCCGACGAC CACCTGCACG GGCCGGTCTA TCTGGAGTTC
GACGAAGGCG TCGCCGAATT CGCGAAGATC CTGGCCCGCC TGATCCCGGC CGGCGCGACA
GTCGCGACCG ACGAGTTGAC CGGGGCGATG CGGCGGGCCG GCAGCGCGCT GTTCCCCGAC
GCGCCGATCG ATGCGGCCCC GGTGATCGGC GCGGCCAAGA TCGTGAAGAC CATCGACCAG
ATCGCCTGCA TCCGGCGGGC GTGTCAGATC ACCGAACAGG CCGTCGCCGA GATCCAGAAA
TCGCTCGCCC CGGGTGCGCG TCAGATCGAC CTGTCCGCCG AATTCGTGCG CCGCACCTTC
GAACTCGGCG CCACCACCAA CATGTTCGAC TCGATCTGGC AGGCCATGCC GGCGTCGAAG
GCCGAGGGCA CCTGGACCAC CACCGGCGAT CTGGCCCTGC CCCTGCTGAC GACCGAACGT
GAGATCCAGC AGGGCGACGT CCTGTGGACC GACGTGTCCA TCGCCTACCA GGGCTATTGC
TCCGATCACG GACGCACCTG GATCGTCGGT CAGGATCCGA CGCCGGCCCA GCAGAAGCAG
TTCGACAGGT GGAGCGAGAT CGTCGACGCG GTGCTCGCGG TGACCAAGGC CGGTGCGACC
TGCGGCGACC TCGGGCGCGC GGCCACCGCG GCAGCGGGCG GTCAGAAGCC GTGGCTGCCG
CACTTCTACC TGGGCCACGG AATCGGAACC AGCGCGGCCG AAATGCCGAT GATCGGAACG
GATCTCGGTC AGGAGTGGGA CGACAACTTC GTCTTCCCGG CCGGCATGCT CCTGGTGTTC
GAGCCGGTGG TCTGGGAGGA CGGCACCGGC GGCTACCGGG GCGAGGAGAT CGTGGTCGTC
ACCGAGGGCG GCTGGATGCC GCTGACCGAG TATCCCTACG ACCCGTACGA GGTGACCCGT
GGGAATTGA
 
Protein sequence
MTTSTHTGVT QIARTGYTWL DIPQEPDFTR LRSEVGARLH AAMAEQGVDA LVLLGNGNVM 
YATGISWPLA DAGLSHVERP VAVVLADDEH PHLFLPFREG AAMESDLPDD HLHGPVYLEF
DEGVAEFAKI LARLIPAGAT VATDELTGAM RRAGSALFPD APIDAAPVIG AAKIVKTIDQ
IACIRRACQI TEQAVAEIQK SLAPGARQID LSAEFVRRTF ELGATTNMFD SIWQAMPASK
AEGTWTTTGD LALPLLTTER EIQQGDVLWT DVSIAYQGYC SDHGRTWIVG QDPTPAQQKQ
FDRWSEIVDA VLAVTKAGAT CGDLGRAATA AAGGQKPWLP HFYLGHGIGT SAAEMPMIGT
DLGQEWDDNF VFPAGMLLVF EPVVWEDGTG GYRGEEIVVV TEGGWMPLTE YPYDPYEVTR
GN