Gene EcolC_0829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0829 
Symbol 
ID6065278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp892254 
End bp893582 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content53% 
IMG OID641600234 
Productputative chlorohydrolase/aminohydrolase 
Protein accessionYP_001723828 
Protein GI170018874 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR03314] putative selenium metabolism protein SsnA 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGATTC TGAAGAATGT CACTGCGGTA CAGCTACACC CGGCAAAAGT GCAGGAAGGC 
GTTGATATCG CCATCGAAAA CGATGTGATT GTCGCTATCG GCGATGCCCT GACGCAACGC
TACCCCGACG CCAGCTTCAA AGAGATGCAT GGCCGGATTG TGATGCCGGG AATTGTCTGC
TCGCACAACC ATTTTTACTC GGGGCTTTCC CGCGGAATTA TGGCAAACAT CGCCCCCTGC
CCGGATTTCA TCTCAACGCT GAAAAATCTC TGGTGGCGGC TCGATCGCGC CCTTGATGAA
GAGTCGCTCT ATTACAGCGG ACTGATTTGT TCCCTGGAAG CGATTAAGAG CGGATGTACA
TCGGTTATCG ATCACCATGC CTCTCCGGCG TATATCGGCG GGTCGCTCTC CACATTGCGC
GACGCATTTT TAAAAGTTGG CCTGCGCGCG ATGACCTGTT TTGAAACTAC TGACCGTAAC
AACGGCATCA AAGAGTTGCA GGAAGGTGTA GAAGAAAACA TCCGTTTCGC CCGTTTGATT
GATGAGGCGA AGAAAGCGAC AAGCGAGCCG TATCTGGTGG AAGCACATAT CGGTGCTCAC
GCGCCGTTTA CCGTGCCGGA TGCCGGTCTG GAGATGCTGC GTGAAGCCGT GAAAGCCACA
GGCCGTGGTT TGCATATTCA CGCTGCGGAA GACCTTTACG ACGTTTCCTA CAGTCACCAC
TGGTACGGCA AAGACCTGCT GGCACGACTG GCGCAATTCG ATCTCATCGA CAGCAAAACG
CTGGTCGCTC ATGGGCTGTA CTTGTCGAAA GATGACATCG CCCTACTCAA TCAGCGCGAT
GCGTTCCTGG TGCATAACGC CCGTTCAAAC ATGAACAACC ATGTCGGCTA CAACCATCAC
CTTAGCGACA TCCGCAATCT GGCGTTGGGA ACGGACGGCA TTGGTTCGGA CATGTTTGAA
GAGATGAAAT TTGCCTTCTT TAAACATCGC GATGCGGGTG GTCCGCTGTG GCCTGACAGT
TTTGCCAAAG CCCTGACTAA CGGTAACGAA CTGATGAGCC GCAACTTTGG CGCGAAATTT
GGTCTTCTGG AAGCCGGTTA CAAAGCCGAT TTAACCATTT GCGATTACAA CTCGCCGACG
CCGCTGCTGG CAGACAATAT CGCCGGGCAT ATCGCTTTCG GTATGGGCTC AGGCAGCGTT
CACAGCGTGA TGGTCAATGG CGTGATGGTC TATGAAGACC GTCAGTTTAA CTTTGATTGC
GATTCCATTT ATGCACAAGC CAGAAAAGCC GCTGCCAGTA TGTGGCGTCG GATGGATGCG
CTGGCATAA
 
Protein sequence
MLILKNVTAV QLHPAKVQEG VDIAIENDVI VAIGDALTQR YPDASFKEMH GRIVMPGIVC 
SHNHFYSGLS RGIMANIAPC PDFISTLKNL WWRLDRALDE ESLYYSGLIC SLEAIKSGCT
SVIDHHASPA YIGGSLSTLR DAFLKVGLRA MTCFETTDRN NGIKELQEGV EENIRFARLI
DEAKKATSEP YLVEAHIGAH APFTVPDAGL EMLREAVKAT GRGLHIHAAE DLYDVSYSHH
WYGKDLLARL AQFDLIDSKT LVAHGLYLSK DDIALLNQRD AFLVHNARSN MNNHVGYNHH
LSDIRNLALG TDGIGSDMFE EMKFAFFKHR DAGGPLWPDS FAKALTNGNE LMSRNFGAKF
GLLEAGYKAD LTICDYNSPT PLLADNIAGH IAFGMGSGSV HSVMVNGVMV YEDRQFNFDC
DSIYAQARKA AASMWRRMDA LA