Gene Mmcs_3765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3765 
Symbol 
ID4112596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4019711 
End bp4020979 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content68% 
IMG OID638032904 
Productpeptidase M24 
Protein accessionYP_640927 
Protein GI108800730 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACGT CGACTCACAC CGGCGTCACC CAGATCGCCC GGACCGGGTA CACGTGGCTG 
GACATCCCGC AGGAGCCCGA CTTCACCCGG CTGCGCAGTG AGGTCGGTGC ACGTCTGCAC
GCCGCGATGG CCGAACAGGG TGTCGACGCG CTGGTTCTGC TGGGCAACGG AAACGTCATG
TACGCCACCG GTATCAGCTG GCCGCTGGCC GATGCCGGCC TGTCACACGT CGAGCGGCCG
GTGGCGGTCG TGCTGGCCGA CGACGAGCAC CCGCACCTGT TCCTGCCCTT CCGCGAGGGT
GCGGCGATGG AGTCGGACCT GCCCGACGAC CACCTGCACG GGCCGGTCTA TCTGGAGTTC
GACGAAGGCG TCGCCGAATT CGCGAAGATC CTGGCCCGCC TGATCCCGGC CGGCGCGACA
GTCGCGACCG ACGAGTTGAC CGGGGCGATG CGGCGGGCCG GCAGCGCGCT GTTCCCCGAC
GCGCCGATCG ATGCGGCCCC GGTGATCGGC GCGGCCAAGA TCGTGAAGAC CATCGACCAG
ATCGCCTGCA TCCGGCGGGC GTGTCAGATC ACCGAACAGG CCGTCGCCGA GATCCAGAAA
TCGCTCGCCC CGGGTGCGCG TCAGATCGAC CTGTCCGCCG AATTCGTGCG CCGCACCTTC
GAACTCGGCG CCACCACCAA CATGTTCGAC TCGATCTGGC AGGCCATGCC GGCGTCGAAG
GCCGAGGGCA CCTGGACCAC CACCGGCGAT CTGGCCCTGC CCCTGCTGAC GACCGAACGT
GAGATCCAGC AGGGCGACGT CCTGTGGACC GACGTGTCCA TCGCCTACCA GGGCTATTGC
TCCGATCACG GACGCACCTG GATCGTCGGT CAGGATCCGA CGCCGGCCCA GCAGAAGCAG
TTCGACAGGT GGAGCGAGAT CGTCGACGCG GTGCTCGCGG TGACCAAGGC CGGTGCGACC
TGCGGCGACC TCGGGCGCGC GGCCACCGCG GCAGCGGGCG GTCAGAAGCC GTGGCTGCCG
CACTTCTACC TGGGCCACGG AATCGGAACC AGCGCGGCCG AAATGCCGAT GATCGGAACG
GATCTCGGTC AGGAGTGGGA CGACAACTTC GTCTTCCCGG CCGGCATGCT CCTGGTGTTC
GAGCCGGTGG TCTGGGAGGA CGGCACCGGC GGCTACCGGG GCGAGGAGAT CGTGGTCGTC
ACCGAGGGCG GCTGGATGCC GCTGACCGAG TATCCCTACG ACCCGTACGA GGTGACCCGT
GGGAATTGA
 
Protein sequence
MTTSTHTGVT QIARTGYTWL DIPQEPDFTR LRSEVGARLH AAMAEQGVDA LVLLGNGNVM 
YATGISWPLA DAGLSHVERP VAVVLADDEH PHLFLPFREG AAMESDLPDD HLHGPVYLEF
DEGVAEFAKI LARLIPAGAT VATDELTGAM RRAGSALFPD APIDAAPVIG AAKIVKTIDQ
IACIRRACQI TEQAVAEIQK SLAPGARQID LSAEFVRRTF ELGATTNMFD SIWQAMPASK
AEGTWTTTGD LALPLLTTER EIQQGDVLWT DVSIAYQGYC SDHGRTWIVG QDPTPAQQKQ
FDRWSEIVDA VLAVTKAGAT CGDLGRAATA AAGGQKPWLP HFYLGHGIGT SAAEMPMIGT
DLGQEWDDNF VFPAGMLLVF EPVVWEDGTG GYRGEEIVVV TEGGWMPLTE YPYDPYEVTR
GN