Gene SNSL254_A3324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3324 
SymbolspeB 
ID6483592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3228877 
End bp3229797 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content56% 
IMG OID642738616 
Productagmatinase 
Protein accessionYP_002042337 
Protein GI194443030 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0546446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.86641 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCT TAGGTCATCA GTACGATAAC TCACTGGTTT CTAATGCGTT TGGTTTTTTA 
CGTCTGCCAA TGAACTTCCA GCCGTATGAC AGCGATGCCG ACTGGGTGAT CACTGGCGTA
CCGTTTGATA TGGCAACGTC CGGTCGCGCT GGCGGTCGTC ATGGCCCGGC GGCGATCCGT
CAGGTGTCGA CCAACCTCGC CTGGGAACAT CACCGTTTCC CGTGGAATTT TGACATGCGC
GAGCGCCTGA ACGTCGTGGA CTGCGGCGAT TTGGTGTATG CGTTTGGCGA TGCCCGTGAG
ATGAGTGAAA AATTACAGGC GCACGCTGAA AAACTGCTGT CTGCAGGCAA GCGTATGCTC
TCTTTCGGCG GCGACCACTT CGTCACGCTG CCGCTGCTGC GCGCCCACGC GAAACATTTT
GGCAAAATGG CGCTGGTACA TTTTGACGCG CATACCGATA CCTACGCTAA CGGCTGCGAA
TTCGATCACG GCACGATGTT CTACACCGCG CCGAAAGAAG GCCTGATCGA TCCGCATCAT
TCGGTACAGA TCGGTATTCG CACTGAGTTT GACAAAGACA ATGGCTTTAC CGTGCTGGAT
GCCTGCCAGG TCAACGATCG CGGCGTGGAT GATATTCTCG CTCAGGTGAA ACAGATCGTC
GGCGATATGC CGGTCTATCT GACCTTTGAT ATCGACTGTC TGGATCCGGC GTTTGCGCCT
GGCACCGGTA CGCCGGTGAT CGGCGGTTTG ACCTCCGATC GCGCCATTAA ACTGGTACGC
GGTCTGAAAG ATCTGAACAT TGTCGGTATG GATGTAGTGG AAGTCGCGCC GGCTTACGAT
CAGTCGGAGA TCACCGCTCT GGCGGCCGCG ACGCTGGCAT TAGAAATGCT CTATATCCAG
GCGGCGAAGA AGGGCGAGTA A
 
Protein sequence
MSTLGHQYDN SLVSNAFGFL RLPMNFQPYD SDADWVITGV PFDMATSGRA GGRHGPAAIR 
QVSTNLAWEH HRFPWNFDMR ERLNVVDCGD LVYAFGDARE MSEKLQAHAE KLLSAGKRML
SFGGDHFVTL PLLRAHAKHF GKMALVHFDA HTDTYANGCE FDHGTMFYTA PKEGLIDPHH
SVQIGIRTEF DKDNGFTVLD ACQVNDRGVD DILAQVKQIV GDMPVYLTFD IDCLDPAFAP
GTGTPVIGGL TSDRAIKLVR GLKDLNIVGM DVVEVAPAYD QSEITALAAA TLALEMLYIQ
AAKKGE