Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3205 |
Symbol | ssnA |
ID | 5587388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 3222623 |
End bp | 3223951 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640926845 |
Product | putative chlorohydrolase/aminohydrolase |
Protein accession | YP_001464217 |
Protein GI | 157157569 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR03314] putative selenium metabolism protein SsnA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGATTC TGAAGAATGT CACCGCAGTA CAGTTACACC CGGCGAAAGT GCAGGAAGGC GTTGATATCG CCATCGAAAA TGATGTGATT GTCGCTATCG GCGATGCCCT GACGCAACGC TACCCCGATG CCAGCTACAA AGAGATGCAT GGTCGGATTG TGATGCCGGG AATTGTCTGC TCGCACAACC ATTTTTACTC GGGGCTTTCC CGCGGAATTA TGGCAAACAT CGCCCCCTGC CCGGATTTCA TCTCAACGCT GAAAAATCTC TGGTGGCGGC TCGATCGCGC CCTTGATGAA GAGTCGCTCT ATTACAGCGG GCTGATTTGT TCCCTGGAAG CGATTAAGAG CGGATGTACA TCGGTTATCG ATCACCATGC CTCTCCGGCG TATATCGGCG GGTCGCTCTC CACATTGCGC GACGCATTTT TAAAAGTTGG CCTGCGCGCG ATGACCTGTT TTGAAACTAC TGACCGTAAC AACGGCATCA AAGAGTTGCA GGAAGGTGTA GAAGAAAACA TCCGCTTCGC CCGTCAGATT GATGAGGCGA AGAAAGCAGC AACCGAGCCG TATCTGGTGG AAGCACATAT CGGTGCTCAC GCGCCGTTTA CCGTGCCAGA TGCCGGTCTG GAGATGCTAC GTGAAGCCGT GAAAGCCACA GGCCGTGGTT TGCATATTCA CGCTGCGGAA GACCTTTACG ACGTTTCCTA CAGTCACCAC TGGTACGGCA AAGACCTGCT GGCACGACTA GCGCAATTCG ATCTCATCGA TAGCAAAACG CTGGTCGCTC ATGGGCTTTA CTTGTCGAAA GATGACATCA CCCTACTCAA TCAGCGCGAT GCGTTCCTGG TGCATAACGC CCGTTCAAAC ATGAACAACC ATGTCGGCTA CAACCATCAC CTTAGCGACA TCCGCAATCT GGCGTTGGGA ACGGACGGCA TTGGTTCGGA CATGTTTGAA GAGATGAAAT TTGCCTTCTT TAAACATCGC GATGCGGGTG GTCCGCTGTG GCCTGACAGT TTTGCCAAAG CCCTGACTAA TGGCAACGAA CTGATGAGCC GCAACTTTGG CGCGAAATTT GGGCTTCTGG AAGCCGGTTA CAAAGCTGAT TTAACCATTT GCGATTACAA CTCGCCGACG CCGCTGCTGG CAGACAATAT CGCCGGGCAT ATCGCTTTCG GTATGGGCTC AGGCAGCGTT CACAGCGTAA TGGTCAATGG TGTGATGGTC TATGAAGACC GTCAGTTTAA CTTCGATTGC GATTCCATTT ATGCACAAGC CAGAAAAGCC GCTGCCAGTA TGTGGCGTCG GATGGATGCG CTGGCATAA
|
Protein sequence | MLILKNVTAV QLHPAKVQEG VDIAIENDVI VAIGDALTQR YPDASYKEMH GRIVMPGIVC SHNHFYSGLS RGIMANIAPC PDFISTLKNL WWRLDRALDE ESLYYSGLIC SLEAIKSGCT SVIDHHASPA YIGGSLSTLR DAFLKVGLRA MTCFETTDRN NGIKELQEGV EENIRFARQI DEAKKAATEP YLVEAHIGAH APFTVPDAGL EMLREAVKAT GRGLHIHAAE DLYDVSYSHH WYGKDLLARL AQFDLIDSKT LVAHGLYLSK DDITLLNQRD AFLVHNARSN MNNHVGYNHH LSDIRNLALG TDGIGSDMFE EMKFAFFKHR DAGGPLWPDS FAKALTNGNE LMSRNFGAKF GLLEAGYKAD LTICDYNSPT PLLADNIAGH IAFGMGSGSV HSVMVNGVMV YEDRQFNFDC DSIYAQARKA AASMWRRMDA LA
|
| |