Gene EcSMS35_3079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3079 
SymbolspeB 
ID6146468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3168888 
End bp3169808 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content55% 
IMG OID641617947 
Productagmatinase 
Protein accessionYP_001745098 
Protein GI170682959 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.00297049 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCACCT TAGGTCATCA ATACGATAAC TCACTGGTTT CCAACGCCTT TGGTTTTTTA 
CGCCTGCCGA TGAACTTCCA GCCATATGAC AGCGATGCTG ACTGGGTGAT TACTGGCGTG
CCGTTCGATA TGGCCACTTC TGGTCGTGCG GGTGGACGCC ACGGTCCGGC AGCTATCCGT
CAGGTTTCGA CGAATCTGGC CTGGGAACAC AATCGCTTCC CGTGGAATTT CGACATGCGT
GAGCGCCTGA ACGTCGTGGA CTGCGGCGAT CTGGTATATG CCTTCGGAGA TGCCCGTGAG
ATGAGCGAGA AGCTGCAGGC GCACGCCGAG AAGCTGCTGG CTGCCGGTAA GCGTATGCTC
TCCTTCGGTG GTGACCACTT TGTTACGCTG CCGCTGCTGC GTGCTCATGC GAAGCATTTC
GGTAAAATGG CGCTGGTACA CTTTGACGCC CACACCGATA CCTATGCGAA CGGTTGTGAA
TTTGACCACG GCACCATGTT CTATACCGCG CCGAAAGAAG GTCTGATCGA CCCGAATCAT
TCCGTGCAGA TTGGTATTCG TACCGAGTTT GATAAAGACA ACGGCTTTAC CGTGCTGGAC
GCCTGCCAGG TGAACGATCG CAGCGTGGAT GACGTTATCG CCCAGGTGAA ACAGATTGTG
GGTGATATGC CGGTTTACCT GACCTTTGAT ATCGACTGCC TGGATCCTGC TTTTGCACCA
GGCACCGGTA CGCCAGTGAT TGGCGGCCTG ACCTCCGATC GCGCTATTAA ACTGGTACGC
GGCCTGAAAG ATCTCAACAT CGTTGGGATG GACGTAGTGG AAGTGGCTCC GGCATACGAT
CAGTCGGAAA TCACCGCTCT GGCTGCGGCG ACGCTGGCGC TGGAAATGCT GTATATTCAG
GCGGCGAAAA AGGGCGAGTA A
 
Protein sequence
MSTLGHQYDN SLVSNAFGFL RLPMNFQPYD SDADWVITGV PFDMATSGRA GGRHGPAAIR 
QVSTNLAWEH NRFPWNFDMR ERLNVVDCGD LVYAFGDARE MSEKLQAHAE KLLAAGKRML
SFGGDHFVTL PLLRAHAKHF GKMALVHFDA HTDTYANGCE FDHGTMFYTA PKEGLIDPNH
SVQIGIRTEF DKDNGFTVLD ACQVNDRSVD DVIAQVKQIV GDMPVYLTFD IDCLDPAFAP
GTGTPVIGGL TSDRAIKLVR GLKDLNIVGM DVVEVAPAYD QSEITALAAA TLALEMLYIQ
AAKKGE