Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3012 |
Symbol | ssnA |
ID | 6145572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3097962 |
End bp | 3099290 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617881 |
Product | putative chlorohydrolase/aminohydrolase |
Protein accession | YP_001745032 |
Protein GI | 170680729 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR03314] putative selenium metabolism protein SsnA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGATTC TGAAGAATGT CACCGCAGTG CAGTTACACC CGGCGAAAGT GCAGGAAGGC GTTGATATCG CCATCGAAAA TGATGTGATT GTCGCTATCG GCGATGCCCT GACGCAACGC TACCCCGATG CCAGCTACAA AGAGATGCAT GGTCGGATTG TGATGCCAGG AATTGTCTGC TCGCACAACC ATTTTTACTC GGGGCTGTCC CGCGGAATTA TGGCAAACAT CGCCCCCTGC CCGGATTTCA TCTCAACGCT GAAAAATCTC TGGTGGCGAC TCGATCGCAC CCTTGATGAA GAGTCGCTCT ATTACAGCGG ACTGATTTGT TCCCTGGAAG CGATTAAGGG CGGATGTACA TCGGTTATCG ATCACCATGC CTCTCCGGCG TATATCGACG GGTCGCTCTC CACATTGCGC AACGCATTTT TAAAAGTTGG CCTGCGCGCG ATGACCTGTT TTGAAACTAC TGACCGTAAC AACGGTATCA AAGAGTTGCA GGAAGGTGTA GAAGAAAACA TCCGCTTCGC CCGTCAGATT GATGAGGCGA AGAAAGCAGC AACCGAACCG TATCTGGTGG AAGCACATAT CGGCGCTCAC GCGCCGTTTA CCGTGCCGGA TGCCGGTCTG GAGATGCTGC GTGAAGCCGT GAAAGCCACA GGTCGTGGTT TGCACATTCA CGCTGCGGAA GACCTTTATG ACGTTTCCTA CAGTCACCAC TGGTACGGCA AAGACCTGCT GGCACGACTG GCGCAATTCG ATCTCATCGA CAGCAAAACG CTGGTCGCTC ATGGGCTGTA CTTGTCGAAA GATGACATCG CCCTACTCAA TCAGCGCGAT GCGTTCCTGG TGCATAACGC CCGTTCAAAC ATGAACAACC ATGTCGGCTA CAACCATCAC CTTAGCGACA TCCGCAATCT GGCGTTGGGA ACTGACGGCA TTGGTTCGGA CATGTTTGAA GAGATGAAAT TTGCCTTCTT TAAACATCGC GATGCGGGTG GTCCGCTGTG GCCTGACAGT TTTGCCAAAG CACTGGCTAA CGGCAACGAA CTGATGAGCC GCAACTTTGG CGCGAAATTT GGCCTTCTGG AAGCCGGTTA CAAAGCCGAT TTAACCATTT GCGATTACAA CTCGCCGACA CCGCTGCTGG CAGACAATAT CGCCGGGCAT ATCGCTTTCG GTATGGGCTC AGGCAGCGTT CATAGCGTGA TGGTCAATGG CGTGATGGTC TATGAAGACC GTCAGTTTAA CTTCGATTGC GATTCCATTT ATGCGCAAGC CAGAAAAGCC GCTGCCAGTA TGTGGCGTCG GATGGATGCG CTGGCATAA
|
Protein sequence | MLILKNVTAV QLHPAKVQEG VDIAIENDVI VAIGDALTQR YPDASYKEMH GRIVMPGIVC SHNHFYSGLS RGIMANIAPC PDFISTLKNL WWRLDRTLDE ESLYYSGLIC SLEAIKGGCT SVIDHHASPA YIDGSLSTLR NAFLKVGLRA MTCFETTDRN NGIKELQEGV EENIRFARQI DEAKKAATEP YLVEAHIGAH APFTVPDAGL EMLREAVKAT GRGLHIHAAE DLYDVSYSHH WYGKDLLARL AQFDLIDSKT LVAHGLYLSK DDIALLNQRD AFLVHNARSN MNNHVGYNHH LSDIRNLALG TDGIGSDMFE EMKFAFFKHR DAGGPLWPDS FAKALANGNE LMSRNFGAKF GLLEAGYKAD LTICDYNSPT PLLADNIAGH IAFGMGSGSV HSVMVNGVMV YEDRQFNFDC DSIYAQARKA AASMWRRMDA LA
|
| |