Gene Mkms_1872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1872 
Symbol 
ID4613799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1985738 
End bp1986835 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content69% 
IMG OID639791537 
Productagmatinase 
Protein accessionYP_937862 
Protein GI119867910 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.548708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCATCCA GTGCATTCGG ACCCGACATC ACTTTCCTCG GCGTCCCTCG ATGCGACCTC 
TCCGATGCGT CGACGTACAC CGACGCCGAC ATCGTCATCC TCGGTGCACC CCTCGACGGC
GGTACGACCT ACCGCTCAGG TGCCCGGTTC GGGCCATCGG CGCTGCGCCA GGCGTGCTAC
CTGCCCCAGG ACGGGTCGCG GCCCAGCCTG GCGCTGCGGG TCGACGGACT CAAGGATCTC
CGGGTGTACG ACGCCGGCGA CGTCGCCCTC TACAGCGGCA ACGTCGAACA GGCGGTGCAG
TTGATCGAGG AGGAGGTGTT CACGATCTCC GCCGCGGGCG CGATTCCCAT CATCCTCGGT
GGCGACCACA CCATCGCCTG GCCGGACCAT ACCGGGGTGG CCCGCCAACA CGGTTTCGGC
AAGGTGTCGA TGATCCACTT CGACGCCCAC GCCGACACCG GCGACATCCA CGCGGGTTCC
CTTGTGGGTC ATGGCACCCC GATGCGAAGG TTGATCGAGT CCGGGGCGCT GCGCGGTGAC
CGGTTCCTGC AGTTGGGGTT GCGCGGCTAC TGGCCCGACG AGCCGGTGCT GCAGTGGATG
GCCGCCCAGG GTCTGCGGTC CTACGAGATG ACCGAGATCG TGGCGCGCGG CCTCGAGACC
TGTCTGACCG AGGCGTTCGA GATCGCCACC GACGACTGTG GGGGCGTGTT CCTGTCCGTC
GACATCGACG TCTGCGATCC CGGGCATGCC CCCGGCACCG GAACCCCTGA GCCGGGCGGC
TTCTCGGCGC GCCAACTGCT CGACGCGGTC CGCCGGATCT GCTACGAGCT TCCGGTGCTC
GGCGTCGACG TCGTCGAGGT CGCCCCGCCC TACGACCATG CCGACATCAC CGCACTGCTC
GGCAACCGCG TGGTGCTGGA GGTCATCTCG GCGATCGCGC GGCGCCGGAA GGACGCCGCG
GCCGGCACCA CCTGGGATCC CAGCCAACCT CTGCTCGCGG GCCGGGAGGT CGACGAGCAG
ATGCGCCTGA TCGCCGAGGG TGAGGACGCG CGCCGCCGCG GCGAGGGCGG TGGGCGGCCA
CACCACCATC ACCACTGA
 
Protein sequence
MASSAFGPDI TFLGVPRCDL SDASTYTDAD IVILGAPLDG GTTYRSGARF GPSALRQACY 
LPQDGSRPSL ALRVDGLKDL RVYDAGDVAL YSGNVEQAVQ LIEEEVFTIS AAGAIPIILG
GDHTIAWPDH TGVARQHGFG KVSMIHFDAH ADTGDIHAGS LVGHGTPMRR LIESGALRGD
RFLQLGLRGY WPDEPVLQWM AAQGLRSYEM TEIVARGLET CLTEAFEIAT DDCGGVFLSV
DIDVCDPGHA PGTGTPEPGG FSARQLLDAV RRICYELPVL GVDVVEVAPP YDHADITALL
GNRVVLEVIS AIARRRKDAA AGTTWDPSQP LLAGREVDEQ MRLIAEGEDA RRRGEGGGRP
HHHHH