Gene SeD_A3420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3420 
SymbolspeB 
ID6872134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3286612 
End bp3287532 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content56% 
IMG OID642786417 
Productagmatinase 
Protein accessionYP_002217055 
Protein GI198246071 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.0247827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCT TAGGTCATCA GTACGATAAC TCACTGGTTT CTAATGCGTT TGGTTTTTTA 
CGTCTGCCAA TGAACTTCCA GCCGTATGAC AGCGATGCCG ACTGGGTGAT CACTGGCGTA
CCGTTTGATA TGGCAACGTC CGGTCGCGCT GGCGGCCGTC ATGGCCCGGC GGCGATCCGT
CAGGTGTCGA CCAACCTCGC CTGGGAACAT CACCGTTTCC CGTGGAGTTT TGACATGCGC
GAGCGCCTGA ACGTCGTGGA CTGCGGCGAT TTGGTGTATG CGTTTGGCGA TGCCCGTGAG
ATGAGTGAAA AATTACAGGC GCACGCTGAA AAACTGCTGT CTGCAGGCAA GCGTATGCTC
TCTTTCGGCG GCGACCACTT CGTCACGCTG CCGCTGCTGC GCGCCCACGC GAAACATTTT
GGCAAAATGG CGCTGGTACA TTTTGATGCG CATACCGATA CCTACGCTAA CGGCTGCGAA
TTCGATCACG GCACGATGTT CTACACCGCG CCGAAAGAAG GCCTGATCGA TCCGCATCAT
TCGGTACAGA TCGGTATTCG TACTGAGTTT GACAAAGACA ATGGCTTTAC CGTGCTGGAT
GCCTGCCAGG TCAACGATCG CGGCGTGGAT GATATTCTCG CTCAGGTGAA ACAGATCGTC
GGCGATATGC CGGTCTATCT GACCTTTGAT ATCGACTGTC TGGACCCGGC GTTTGCGCCT
GGCACCGGTA CGCCGGTGAT CGGCGGTTTG ACCTCCGATC GCGCCATTAA ACTGGTACGC
GGTCTGAAAG ATCTGAACAT TGTCGGTATG GATGTAGTGG AAGTCGCGCC GGCTTACGAT
CAGTCGGAGA TCACCGCTCT GGCGGCCGCG ACGCTGGCAT TAGAAATGCT CTATATCCAG
GCGGCGAAGA AAGGCGAGTA A
 
Protein sequence
MSTLGHQYDN SLVSNAFGFL RLPMNFQPYD SDADWVITGV PFDMATSGRA GGRHGPAAIR 
QVSTNLAWEH HRFPWSFDMR ERLNVVDCGD LVYAFGDARE MSEKLQAHAE KLLSAGKRML
SFGGDHFVTL PLLRAHAKHF GKMALVHFDA HTDTYANGCE FDHGTMFYTA PKEGLIDPHH
SVQIGIRTEF DKDNGFTVLD ACQVNDRGVD DILAQVKQIV GDMPVYLTFD IDCLDPAFAP
GTGTPVIGGL TSDRAIKLVR GLKDLNIVGM DVVEVAPAYD QSEITALAAA TLALEMLYIQ
AAKKGE