Gene Mkms_2778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_2778 
Symbol 
ID4615699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp2897875 
End bp2898912 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content71% 
IMG OID639792443 
Productagmatinase 
Protein accessionYP_938762 
Protein GI119868810 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01227] formimidoylglutamase
[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.470269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCATG CGCACGACCG CGAGCCGCAA CGGGATGTCC CGCCGGGAAT GGCCGAACAA 
CTCGACCTGC CCCATTCGGG GATGGCCACT TTCGGTCATC GGCCGTTCCT GACCGAGACC
GCCCAACTGG ACTCGTGGCG GCCCGATGTC GCGATCGTCG GTGCGCCGTT CGACGTGGGG
ACCACCAACC GCCCGGGCGC GCGGTTCGGG CCGCGCGCCA TCCGGGCGAC GGCGTACGAA
CCCGGCACGT ACCACATGGA TCTGGGGCTG GAGATCTTCG ACTGGCTGGA GGTGGTCGAC
TTCGGCGACG CCTACTGTCC GCACGGGCAG ACCGAGGTGT CGCACGCCAA CATCCGCGAG
CGGGTGGCCG CGGTCGCGTC GCGCGGCATC GTGCCGGTCG TCCTCGGTGG TGACCACTCG
ATCACGTGGC CGGCAGCCAC CGCGGTCGCC GATGTGCACG GCCACGGCAA CGTCGGCATC
GTGCACTTCG ACGCCCACGC CGACACCGCC GACACCATCG AGGGCAACCT GGCCAGTCAC
GGCACGCCGA TGCGGCGGCT CATCGAATCG GGTGCGGTCC CCGGAACCCA CTTCGTACAA
GTCGGTCTGC GCGGCTACTG GCCGCCGCAG GACACCTTCG AGTGGATGCT CGAACAGGGC
ATGACCTGGC ACACCATGCA GGAGATCTGG GAGCGCGGCT TCCAGGAGGT GATGCGCGAC
GCGGTGGCCG AGGCGCTCGC CAGGGCCGAC AAGCTCTACG TCTCCGTGGA CATCGACGTC
CTCGATCCCG CCCACGCCCC CGGCACCGGG ACCCCGGAAC CGGGCGGCAT CACCAGCGCG
GACCTACTCC GGATGGTCCG GCGGCTCTGT TACGAGCACG ATGTGGCGGG TGTCGACGTC
GTCGAGGTCG CACCGGCCTA CGACCACGCC GAACTGACCG TCAACGCCGC GCACCGGGTG
GTGTTCGAAG CGCTGGCCGG GATGGCGGCC CGCAGGCGCG ACGCCGCGGG CGCCCAGCCC
GGTCCGCCCG CCCGGTGA
 
Protein sequence
MGHAHDREPQ RDVPPGMAEQ LDLPHSGMAT FGHRPFLTET AQLDSWRPDV AIVGAPFDVG 
TTNRPGARFG PRAIRATAYE PGTYHMDLGL EIFDWLEVVD FGDAYCPHGQ TEVSHANIRE
RVAAVASRGI VPVVLGGDHS ITWPAATAVA DVHGHGNVGI VHFDAHADTA DTIEGNLASH
GTPMRRLIES GAVPGTHFVQ VGLRGYWPPQ DTFEWMLEQG MTWHTMQEIW ERGFQEVMRD
AVAEALARAD KLYVSVDIDV LDPAHAPGTG TPEPGGITSA DLLRMVRRLC YEHDVAGVDV
VEVAPAYDHA ELTVNAAHRV VFEALAGMAA RRRDAAGAQP GPPAR