Gene Mmcs_5283 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5283 
Symbol 
ID4114110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp5570711 
End bp5571691 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content70% 
IMG OID638034439 
Productagmatinase 
Protein accessionYP_642440 
Protein GI108802243 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01227] formimidoylglutamase
[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACAG AGGGAGGACA CGTGCAGTAC GTGCAGTCGG AGACGGGGGT GCTGGGTCAG 
GTCGACGCAC AGGCGGTACC GCGGTACGCC GGAATCGCCA CGTTCGCGCG GTTACCGCAG
CGCCACGAGG TCGGCGACTA CGACATCGCC GTCGTCGGCG TGCCCTTCGA CAGCGGTGTG
ACCTACCGGC CCGGTGCTCG ATTCGGCCCG TCCGCGATCC GGCAGGCGTC CCGGCTGCTC
AAGCCGTACC ACCCCGCGCT CGACGTGTCG CCGTTCGCCG CGGCGCAGGT CGTCGACGCG
GGCGATATCG CGGCCAACCC GTTCGACATC GCCACCGCCG TCGACGAGAT CCGCGCCGGG
GTGCTCGGGC TTCTCACCCG CCCCGAACAG CGTGTCGTGT TGCTGGGCGG GGACCACACC
ATCGCGCTGC CGGCCCTGCA GGCCGTCAAC GAGGTGCACG GTCCGGTGGC GCTGGTGCAC
TTCGACGCCC ACCTCGACAC CTGGGACACC TACTTCGGCG CCCCCTGTAC CCACGGCACC
CCGTTCCGCC GGGCGTCCGA ACAGGGGCTG TTGGTGAAGG ATCGCTCCGC GCACGTCGGC
ATCCGCGGTT CGCTCTACGA CCGCGCCGAC CTGCTCGAGG ACGCCGAACT CGGATTCACC
GTGGTGCACT GCCGCGACAT CGACCGCATC GGCGTCGACG GCGTGATCGA ACGGGTCCTC
GACCGGGTCG GCGACCATCC GGTCTACGTG TCCATCGACA TCGACGTGCT GGACCCGGCG
TTCGCCCCGG GCACGGGAAC CCCGGAGATC GGCGGCATGA CCAGCCGGGA ACTGGTCGCG
GTGCTGCGGG CCATGCGCGG GTGCAACATC GTCGCCGCCG ACATCGTGGA GGTGGCGCCG
GCCTACGACC AGGCCGAGGT CACCGCCGTC GCCGGGGCCA ACCTCGCCTA CGAGCTGATC
ACGCTGATGG CCGACCGATG A
 
Protein sequence
MTTEGGHVQY VQSETGVLGQ VDAQAVPRYA GIATFARLPQ RHEVGDYDIA VVGVPFDSGV 
TYRPGARFGP SAIRQASRLL KPYHPALDVS PFAAAQVVDA GDIAANPFDI ATAVDEIRAG
VLGLLTRPEQ RVVLLGGDHT IALPALQAVN EVHGPVALVH FDAHLDTWDT YFGAPCTHGT
PFRRASEQGL LVKDRSAHVG IRGSLYDRAD LLEDAELGFT VVHCRDIDRI GVDGVIERVL
DRVGDHPVYV SIDIDVLDPA FAPGTGTPEI GGMTSRELVA VLRAMRGCNI VAADIVEVAP
AYDQAEVTAV AGANLAYELI TLMADR