Gene Mmcs_2734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_2734 
Symbol 
ID4111566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp2879987 
End bp2881024 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content71% 
IMG OID638031858 
Productagmatinase 
Protein accessionYP_639897 
Protein GI108799700 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01227] formimidoylglutamase
[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCCATG CGCACGACCG CGAGCCGCAA CGGGATGTCC CGCCGGGAAT GGCCGAACAA 
CTCGACCTGC CCCATTCGGG GATGGCCACT TTCGGTCATC GGCCGTTCCT GACCGAGACC
GCCCAACTGG ACTCGTGGCG GCCCGATGTC GCGATCGTCG GTGCGCCGTT CGACGTGGGG
ACCACCAACC GCCCGGGCGC GCGGTTCGGG CCGCGCGCCA TCCGGGCGAC GGCGTACGAA
CCCGGCACGT ACCACATGGA TCTGGGGCTG GAGATCTTCG ACTGGCTGGA GGTGGTCGAC
TTCGGCGACG CCTACTGTCC GCACGGGCAG ACCGAGGTGT CGCACGCCAA CATCCGCGAG
CGGGTGGCCG CGGTCGCGTC GCGCGGCATC GTGCCGGTCG TCCTCGGTGG TGACCACTCG
ATCACGTGGC CGGCAGCCAC CGCGGTCGCC GATGTGCACG GCCACGGCAA CGTCGGCATC
GTGCACTTCG ACGCCCACGC CGACACCGCC GACACCATCG AGGGCAACCT GGCCAGTCAC
GGCACGCCGA TGCGGCGGCT CATCGAATCG GGTGCGGTCC CCGGAACCCA CTTCGTACAA
GTCGGTCTGC GCGGCTACTG GCCGCCGCAG GACACCTTCG AGTGGATGCT CGAACAGGGC
ATGACCTGGC ACACCATGCA GGAGATCTGG GAGCGCGGCT TCCAGGAGGT GATGCGCGAC
GCGGTGGCCG AGGCGCTCGC CAGGGCCGAC AAGCTCTACG TCTCCGTGGA CATCGACGTC
CTCGATCCCG CCCACGCCCC CGGCACCGGG ACCCCGGAAC CGGGCGGCAT CACCAGCGCG
GACCTACTCC GGATGGTCCG GCGGCTCTGT TACGAGCACG ATGTGGCGGG TGTCGACGTC
GTCGAGGTCG CACCGGCCTA CGACCACGCC GAACTGACCG TCAACGCCGC GCACCGGGTG
GTGTTCGAAG CGCTGGCCGG GATGGCGGCC CGCAGGCGCG ACGCCGCGGG CGCCCAGCCC
GGTCCGCCCG CCCGGTGA
 
Protein sequence
MGHAHDREPQ RDVPPGMAEQ LDLPHSGMAT FGHRPFLTET AQLDSWRPDV AIVGAPFDVG 
TTNRPGARFG PRAIRATAYE PGTYHMDLGL EIFDWLEVVD FGDAYCPHGQ TEVSHANIRE
RVAAVASRGI VPVVLGGDHS ITWPAATAVA DVHGHGNVGI VHFDAHADTA DTIEGNLASH
GTPMRRLIES GAVPGTHFVQ VGLRGYWPPQ DTFEWMLEQG MTWHTMQEIW ERGFQEVMRD
AVAEALARAD KLYVSVDIDV LDPAHAPGTG TPEPGGITSA DLLRMVRRLC YEHDVAGVDV
VEVAPAYDHA ELTVNAAHRV VFEALAGMAA RRRDAAGAQP GPPAR