Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3079 |
Symbol | speB |
ID | 6146468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3168888 |
End bp | 3169808 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617947 |
Product | agmatinase |
Protein accession | YP_001745098 |
Protein GI | 170682959 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family |
TIGRFAM ID | [TIGR01230] agmatinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.00297049 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCACCT TAGGTCATCA ATACGATAAC TCACTGGTTT CCAACGCCTT TGGTTTTTTA CGCCTGCCGA TGAACTTCCA GCCATATGAC AGCGATGCTG ACTGGGTGAT TACTGGCGTG CCGTTCGATA TGGCCACTTC TGGTCGTGCG GGTGGACGCC ACGGTCCGGC AGCTATCCGT CAGGTTTCGA CGAATCTGGC CTGGGAACAC AATCGCTTCC CGTGGAATTT CGACATGCGT GAGCGCCTGA ACGTCGTGGA CTGCGGCGAT CTGGTATATG CCTTCGGAGA TGCCCGTGAG ATGAGCGAGA AGCTGCAGGC GCACGCCGAG AAGCTGCTGG CTGCCGGTAA GCGTATGCTC TCCTTCGGTG GTGACCACTT TGTTACGCTG CCGCTGCTGC GTGCTCATGC GAAGCATTTC GGTAAAATGG CGCTGGTACA CTTTGACGCC CACACCGATA CCTATGCGAA CGGTTGTGAA TTTGACCACG GCACCATGTT CTATACCGCG CCGAAAGAAG GTCTGATCGA CCCGAATCAT TCCGTGCAGA TTGGTATTCG TACCGAGTTT GATAAAGACA ACGGCTTTAC CGTGCTGGAC GCCTGCCAGG TGAACGATCG CAGCGTGGAT GACGTTATCG CCCAGGTGAA ACAGATTGTG GGTGATATGC CGGTTTACCT GACCTTTGAT ATCGACTGCC TGGATCCTGC TTTTGCACCA GGCACCGGTA CGCCAGTGAT TGGCGGCCTG ACCTCCGATC GCGCTATTAA ACTGGTACGC GGCCTGAAAG ATCTCAACAT CGTTGGGATG GACGTAGTGG AAGTGGCTCC GGCATACGAT CAGTCGGAAA TCACCGCTCT GGCTGCGGCG ACGCTGGCGC TGGAAATGCT GTATATTCAG GCGGCGAAAA AGGGCGAGTA A
|
Protein sequence | MSTLGHQYDN SLVSNAFGFL RLPMNFQPYD SDADWVITGV PFDMATSGRA GGRHGPAAIR QVSTNLAWEH NRFPWNFDMR ERLNVVDCGD LVYAFGDARE MSEKLQAHAE KLLAAGKRML SFGGDHFVTL PLLRAHAKHF GKMALVHFDA HTDTYANGCE FDHGTMFYTA PKEGLIDPNH SVQIGIRTEF DKDNGFTVLD ACQVNDRSVD DVIAQVKQIV GDMPVYLTFD IDCLDPAFAP GTGTPVIGGL TSDRAIKLVR GLKDLNIVGM DVVEVAPAYD QSEITALAAA TLALEMLYIQ AAKKGE
|
| |