Gene Mjls_5662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_5662 
Symbol 
ID4881359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp5914980 
End bp5915960 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content70% 
IMG OID640142980 
Productputative agmatinase 
Protein accessionYP_001073916 
Protein GI126438225 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01227] formimidoylglutamase
[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAG AGGGAGGACA CGTGCAGTAC GTGCAGTCGG AGACGGGGGT GCTGGGTCAG 
GTCGACGCAC AGGCGGTACC GCGGTACGCC GGAATCGCCA CGTTCGCGCG GTTACCGCAG
CGCCACGAGG TCGGCGACTA CGACATCGCC GTCGTCGGCG TGCCCTTCGA CAGCGGTGTG
ACCTACCGGC CCGGTGCTCG ATTCGGCCCG TCCGCGATCC GGCAGGCGTC CCGGCTGCTC
AAGCCGTACC ACCCCGCGCT CGACGTGTCG CCGTTCGCCG CGGCGCAGGT CGTCGACGCG
GGCGATATCG CGGCCAACCC GTTCGACATC GCCACCGCCG TCGACGAGAT CCGCGCCGGG
GTGCTCGGGC TTCTCACCCG CCCCGAACAG CGTGTCGTGT TGCTGGGCGG GGACCACACC
ATCGCGCTGC CGGCCCTGCA GGCCGTCAAC GAGGTGCACG GTCCGGTGGC GCTGGTGCAC
TTCGACGCCC ACCTCGACAC CTGGGACACC TACTTCGGCG CCCCCTGTAC CCACGGCACC
CCGTTCCGCC GGGCGTCCGA ACAGGGGCTG TTGGTGAAGG ATCGCTCCGC GCACGTCGGC
ATCCGCGGTT CGCTCTACGA CCGCGCCGAC CTGCTCGAGG ACGCCGAACT CGGATTCACC
GTGGTGCACT GCCGCGACAT CGACCGCATC GGCGTCGACG GCGTGATCGA ACGGGTCCTC
GACCGGGTCG GCGACCATCC GGTCTACGTG TCCATCGACA TCGACGTGCT GGACCCGGCG
TTCGCCCCGG GCACGGGAAC CCCGGAGATC GGCGGCATGA CCAGCCGGGA ACTGGTCGCG
GTGCTGCGGG CCATGCGCGG GTGCAACATC GTCGCCGCCG ACATCGTGGA GGTGGCGCCG
GCCTACGACC AGGCCGAGGT CACCGCCGTC GCCGGGGCCA ACCTCGCCTA CGAGCTGATC
ACGCTGATGG CCGACCGATG A
 
Protein sequence
MTTEGGHVQY VQSETGVLGQ VDAQAVPRYA GIATFARLPQ RHEVGDYDIA VVGVPFDSGV 
TYRPGARFGP SAIRQASRLL KPYHPALDVS PFAAAQVVDA GDIAANPFDI ATAVDEIRAG
VLGLLTRPEQ RVVLLGGDHT IALPALQAVN EVHGPVALVH FDAHLDTWDT YFGAPCTHGT
PFRRASEQGL LVKDRSAHVG IRGSLYDRAD LLEDAELGFT VVHCRDIDRI GVDGVIERVL
DRVGDHPVYV SIDIDVLDPA FAPGTGTPEI GGMTSRELVA VLRAMRGCNI VAADIVEVAP
AYDQAEVTAV AGANLAYELI TLMADR