Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4169 |
Symbol | ssnA |
ID | 6969203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3861709 |
End bp | 3863037 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643387915 |
Product | putative chlorohydrolase/aminohydrolase |
Protein accession | YP_002272354 |
Protein GI | 209398297 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR03314] putative selenium metabolism protein SsnA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGATTC TGAAGAATGT CACCGCAGTG CAGTTACACC CGGCGAAAGT GCAGGAAGGC GTTGATATCG CTATCGAAAA TGATGTGATT GTCGCTATCG GCGATGCCCT GACGCAACGC TACCCCGATG CCAGCTACAA AGAGATGCAT GGTCGGATTG TGATGCCGGG AATTGTCTGC TCGCACAACC ATTTTTACTC GGGGCTTTCC CGCGGAATTA TGGCAAACAT CGCCCCCAGC CCAGATTTCA TCTCAACGCT GAAAAATCTC TGGTGGCGGC TCGATCGCGC CCTTGATGAA GAGTCGCTCT ATTACAGCGG GCTGATTTGT TCCCTGGAAG CGATTAAGAG CGGATGTACA TCGGTTATCG ATCACCATGC CTCTCCGGCG TATATCGGCG GGTCGCTCTC CACATTGCGC GACGCATTTT TAAAAGTTGG CCTGCGCGCG ATGACCTGTT TTGAAACTAC TGACCGTAAC AACGGCATCA AAGAGTTGCA GGAAGGTGTA GAAGAAAACA TCCGCTTCGC CCGTCAGATT GATGAGGCGA AGAAAGCAGC AACCGAGCCG TATCTGGTGG AAGCACATAT CGGTGCTCAC GCGCCGTTTA CCGTGCCAGA TGCCGGTCTG GAGATGCTAC GTGAAGCCGT GAACGCCACA GGCCGTGGTT TGCATATTCA CGCTGCGGAA GACCTTTACG ACGTTTCCTA CAGTCACCAC TGGTACGGCA AAGACCTGCT GGCACGACTG GCGCAATTCG ATCTCATCGA TAGCAAAACG CTGGTCGCTC ATGGGCTGTA CTTGTCGAAA GATGACATCG CCCTACTCAA TCAGCGCGAT GCGTTCCTGG TGCATAACGC CCGTTCAAAC ATGAACAACC ATGTCGGCTA CAACCATCAC CTTAGCGACA TCCGCAATCT GGCGTTGGGA ACGGACGGCA TTGGTTCGGA CATGTTTGAA GAGATGAAAT TTGCCTTCTT TAAACATCGC GATGCGGGTG GTCCGCTGTG GCCTGACAGT TTTGCCAAAG CCCTGACTAA CGGTAACGAA CTGATGAGCC GCAACTTTGG CGCGAAATTT GGGCTTCTGG AAGCCGGTTA CAAAGCTGAT TTAACCATTT GCGATTACAA CTCGCCGACG CCGCTGCTGG CAGACAATAT CGCCGGGCAT ATCGCTTTCG GTATGGGCTC AGGCAGCGTT CACAGCGTGA TGGTCAATGG TGTGATGGTC TATGAAGACC GTCAGTTTAA CTTCGATTGC GATTCCATTT ATGCACAAGC CAGAAAAGCC GCTGCCAGTA TGTGGCGTCG GATGGATGCG CTGGCATAA
|
Protein sequence | MLILKNVTAV QLHPAKVQEG VDIAIENDVI VAIGDALTQR YPDASYKEMH GRIVMPGIVC SHNHFYSGLS RGIMANIAPS PDFISTLKNL WWRLDRALDE ESLYYSGLIC SLEAIKSGCT SVIDHHASPA YIGGSLSTLR DAFLKVGLRA MTCFETTDRN NGIKELQEGV EENIRFARQI DEAKKAATEP YLVEAHIGAH APFTVPDAGL EMLREAVNAT GRGLHIHAAE DLYDVSYSHH WYGKDLLARL AQFDLIDSKT LVAHGLYLSK DDIALLNQRD AFLVHNARSN MNNHVGYNHH LSDIRNLALG TDGIGSDMFE EMKFAFFKHR DAGGPLWPDS FAKALTNGNE LMSRNFGAKF GLLEAGYKAD LTICDYNSPT PLLADNIAGH IAFGMGSGSV HSVMVNGVMV YEDRQFNFDC DSIYAQARKA AASMWRRMDA LA
|
| |