Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0829 |
Symbol | |
ID | 6065278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 892254 |
End bp | 893582 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641600234 |
Product | putative chlorohydrolase/aminohydrolase |
Protein accession | YP_001723828 |
Protein GI | 170018874 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR03314] putative selenium metabolism protein SsnA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGATTC TGAAGAATGT CACTGCGGTA CAGCTACACC CGGCAAAAGT GCAGGAAGGC GTTGATATCG CCATCGAAAA CGATGTGATT GTCGCTATCG GCGATGCCCT GACGCAACGC TACCCCGACG CCAGCTTCAA AGAGATGCAT GGCCGGATTG TGATGCCGGG AATTGTCTGC TCGCACAACC ATTTTTACTC GGGGCTTTCC CGCGGAATTA TGGCAAACAT CGCCCCCTGC CCGGATTTCA TCTCAACGCT GAAAAATCTC TGGTGGCGGC TCGATCGCGC CCTTGATGAA GAGTCGCTCT ATTACAGCGG ACTGATTTGT TCCCTGGAAG CGATTAAGAG CGGATGTACA TCGGTTATCG ATCACCATGC CTCTCCGGCG TATATCGGCG GGTCGCTCTC CACATTGCGC GACGCATTTT TAAAAGTTGG CCTGCGCGCG ATGACCTGTT TTGAAACTAC TGACCGTAAC AACGGCATCA AAGAGTTGCA GGAAGGTGTA GAAGAAAACA TCCGTTTCGC CCGTTTGATT GATGAGGCGA AGAAAGCGAC AAGCGAGCCG TATCTGGTGG AAGCACATAT CGGTGCTCAC GCGCCGTTTA CCGTGCCGGA TGCCGGTCTG GAGATGCTGC GTGAAGCCGT GAAAGCCACA GGCCGTGGTT TGCATATTCA CGCTGCGGAA GACCTTTACG ACGTTTCCTA CAGTCACCAC TGGTACGGCA AAGACCTGCT GGCACGACTG GCGCAATTCG ATCTCATCGA CAGCAAAACG CTGGTCGCTC ATGGGCTGTA CTTGTCGAAA GATGACATCG CCCTACTCAA TCAGCGCGAT GCGTTCCTGG TGCATAACGC CCGTTCAAAC ATGAACAACC ATGTCGGCTA CAACCATCAC CTTAGCGACA TCCGCAATCT GGCGTTGGGA ACGGACGGCA TTGGTTCGGA CATGTTTGAA GAGATGAAAT TTGCCTTCTT TAAACATCGC GATGCGGGTG GTCCGCTGTG GCCTGACAGT TTTGCCAAAG CCCTGACTAA CGGTAACGAA CTGATGAGCC GCAACTTTGG CGCGAAATTT GGTCTTCTGG AAGCCGGTTA CAAAGCCGAT TTAACCATTT GCGATTACAA CTCGCCGACG CCGCTGCTGG CAGACAATAT CGCCGGGCAT ATCGCTTTCG GTATGGGCTC AGGCAGCGTT CACAGCGTGA TGGTCAATGG CGTGATGGTC TATGAAGACC GTCAGTTTAA CTTTGATTGC GATTCCATTT ATGCACAAGC CAGAAAAGCC GCTGCCAGTA TGTGGCGTCG GATGGATGCG CTGGCATAA
|
Protein sequence | MLILKNVTAV QLHPAKVQEG VDIAIENDVI VAIGDALTQR YPDASFKEMH GRIVMPGIVC SHNHFYSGLS RGIMANIAPC PDFISTLKNL WWRLDRALDE ESLYYSGLIC SLEAIKSGCT SVIDHHASPA YIGGSLSTLR DAFLKVGLRA MTCFETTDRN NGIKELQEGV EENIRFARLI DEAKKATSEP YLVEAHIGAH APFTVPDAGL EMLREAVKAT GRGLHIHAAE DLYDVSYSHH WYGKDLLARL AQFDLIDSKT LVAHGLYLSK DDIALLNQRD AFLVHNARSN MNNHVGYNHH LSDIRNLALG TDGIGSDMFE EMKFAFFKHR DAGGPLWPDS FAKALTNGNE LMSRNFGAKF GLLEAGYKAD LTICDYNSPT PLLADNIAGH IAFGMGSGSV HSVMVNGVMV YEDRQFNFDC DSIYAQARKA AASMWRRMDA LA
|
| |