Gene Mmcs_4787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4787 
Symbol 
ID4113616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp5066674 
End bp5068002 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content67% 
IMG OID638033938 
Productpeptidase M1, membrane alanine aminopeptidase 
Protein accessionYP_641947 
Protein GI108801750 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.475246 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGCGGT CGAAGAAGGG GAAGAACACC CCGGCCGTCA TCGACCCCTA TCTGCCCGAG 
ACCGGCAACT TCGGGTACCG GGTGTCGCGG TACGAACTCG ACCTGGAGTA CAAGGTCGCG
ATCAACCGGT TGTCGGGTTC GGCGTCGATC ACGGCCGTCA CGCTGGCCGC GCTGCGCACC
TTCACCCTGG ATCTGGCCGA CACCCTGCGG GTGACGAAGG TGGCGGTCAA CGGCCGCCGA
CCCGCCCGGT TCTCCTGTGC GAACGACAAG CTGCGGGTCG AGTTGTCCTC GGCGTTGCCT
GCGGGCGCGG CGCTGGTCGT GGAGGTCCGG TACGGCGGCA CCCCCGAGCC GATCGAAACC
CTCTGGGGTG ACGTCGGTTT CGAGGAGCTG ACGAACGGGG CGCTGGTCGC GGGGCAGCCC
AACGGCGCGT CCTCGTGGTT CCCGTGCGAC GACCACCCCA GCGCGAAGGC CAGCTACCGC
ATCCAGATCA GCACCGACAG CCCGTACCGC GCGATCGCCA ACGGCGAGTT GGTGTCCCGC
CGCGCCAGGG CGGGTCACAC CGTGTGGACC TACGAGCAGG CCGAACCGAC GTCGACGTAC
CTGATCACGT TGCAGATCGG CATGTACGAC GTGCACAAGC TGGCGAAGTC GCCGGTGCCC
ATCCAGGCCG CCCTTCCGGC CCGGCTGCGG AGCAACTTCG ACCACAGCTT CGGCCGCCAA
CCACAGATGA TGAAACTGTT CGAGAAGCTG TTCGGACCGT ACCCGCTGGC GAGTGGCTAC
ACCGTGGTGG TCACCGACGA CGCCCTCGAG ATACCCCTTG AGGCACAAGG TATTTCGATT
TTCGGCGCCA ACCATTGCGA CGGCACTCGA AGGGCCGAGC GGTTGATCGC GCACGAGTTG
GCCCACCAGT GGTTCGGCAA CTCGGTCACC GTGCGTCGGT GGCGCGACAT CTGGCTGCAC
GAGGGTTTCG CGTGTTACGC GGAGTGGTTG TGGTCGGAGA ACTCCGGCGG CCGCAGCGCC
GACGAATGGG CCCACCACTA TCACGGACGG CTGACCGATC TGCCCCAGGA CCTCCTGCTC
GCCGACCCCG GTCCGAAGGA CATGTTTGAC GACCGGGTGT ACAAACGCGG CGCGCTGACA
CTGCACGTGC TGCGCACCCG CATCGGCGAC GAGAAGTTCT TCGCCCTGCT GCGGGACTGG
ACGGCGCGAC ACCGTCACAG CACCGCGTTC ACCGACGACT TCACCGGTCT GGCAGCCAAT
TACGCCGACG AGTCACTGCG CCCGCTGTGG GATGCCTGGC TCTACGCCGA GGATCTGCCG
AAGCTGTGA
 
Protein sequence
MTRSKKGKNT PAVIDPYLPE TGNFGYRVSR YELDLEYKVA INRLSGSASI TAVTLAALRT 
FTLDLADTLR VTKVAVNGRR PARFSCANDK LRVELSSALP AGAALVVEVR YGGTPEPIET
LWGDVGFEEL TNGALVAGQP NGASSWFPCD DHPSAKASYR IQISTDSPYR AIANGELVSR
RARAGHTVWT YEQAEPTSTY LITLQIGMYD VHKLAKSPVP IQAALPARLR SNFDHSFGRQ
PQMMKLFEKL FGPYPLASGY TVVVTDDALE IPLEAQGISI FGANHCDGTR RAERLIAHEL
AHQWFGNSVT VRRWRDIWLH EGFACYAEWL WSENSGGRSA DEWAHHYHGR LTDLPQDLLL
ADPGPKDMFD DRVYKRGALT LHVLRTRIGD EKFFALLRDW TARHRHSTAF TDDFTGLAAN
YADESLRPLW DAWLYAEDLP KL