Gene EcE24377A_3205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3205 
SymbolssnA 
ID5587388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3222623 
End bp3223951 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content52% 
IMG OID640926845 
Productputative chlorohydrolase/aminohydrolase 
Protein accessionYP_001464217 
Protein GI157157569 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR03314] putative selenium metabolism protein SsnA 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGATTC TGAAGAATGT CACCGCAGTA CAGTTACACC CGGCGAAAGT GCAGGAAGGC 
GTTGATATCG CCATCGAAAA TGATGTGATT GTCGCTATCG GCGATGCCCT GACGCAACGC
TACCCCGATG CCAGCTACAA AGAGATGCAT GGTCGGATTG TGATGCCGGG AATTGTCTGC
TCGCACAACC ATTTTTACTC GGGGCTTTCC CGCGGAATTA TGGCAAACAT CGCCCCCTGC
CCGGATTTCA TCTCAACGCT GAAAAATCTC TGGTGGCGGC TCGATCGCGC CCTTGATGAA
GAGTCGCTCT ATTACAGCGG GCTGATTTGT TCCCTGGAAG CGATTAAGAG CGGATGTACA
TCGGTTATCG ATCACCATGC CTCTCCGGCG TATATCGGCG GGTCGCTCTC CACATTGCGC
GACGCATTTT TAAAAGTTGG CCTGCGCGCG ATGACCTGTT TTGAAACTAC TGACCGTAAC
AACGGCATCA AAGAGTTGCA GGAAGGTGTA GAAGAAAACA TCCGCTTCGC CCGTCAGATT
GATGAGGCGA AGAAAGCAGC AACCGAGCCG TATCTGGTGG AAGCACATAT CGGTGCTCAC
GCGCCGTTTA CCGTGCCAGA TGCCGGTCTG GAGATGCTAC GTGAAGCCGT GAAAGCCACA
GGCCGTGGTT TGCATATTCA CGCTGCGGAA GACCTTTACG ACGTTTCCTA CAGTCACCAC
TGGTACGGCA AAGACCTGCT GGCACGACTA GCGCAATTCG ATCTCATCGA TAGCAAAACG
CTGGTCGCTC ATGGGCTTTA CTTGTCGAAA GATGACATCA CCCTACTCAA TCAGCGCGAT
GCGTTCCTGG TGCATAACGC CCGTTCAAAC ATGAACAACC ATGTCGGCTA CAACCATCAC
CTTAGCGACA TCCGCAATCT GGCGTTGGGA ACGGACGGCA TTGGTTCGGA CATGTTTGAA
GAGATGAAAT TTGCCTTCTT TAAACATCGC GATGCGGGTG GTCCGCTGTG GCCTGACAGT
TTTGCCAAAG CCCTGACTAA TGGCAACGAA CTGATGAGCC GCAACTTTGG CGCGAAATTT
GGGCTTCTGG AAGCCGGTTA CAAAGCTGAT TTAACCATTT GCGATTACAA CTCGCCGACG
CCGCTGCTGG CAGACAATAT CGCCGGGCAT ATCGCTTTCG GTATGGGCTC AGGCAGCGTT
CACAGCGTAA TGGTCAATGG TGTGATGGTC TATGAAGACC GTCAGTTTAA CTTCGATTGC
GATTCCATTT ATGCACAAGC CAGAAAAGCC GCTGCCAGTA TGTGGCGTCG GATGGATGCG
CTGGCATAA
 
Protein sequence
MLILKNVTAV QLHPAKVQEG VDIAIENDVI VAIGDALTQR YPDASYKEMH GRIVMPGIVC 
SHNHFYSGLS RGIMANIAPC PDFISTLKNL WWRLDRALDE ESLYYSGLIC SLEAIKSGCT
SVIDHHASPA YIGGSLSTLR DAFLKVGLRA MTCFETTDRN NGIKELQEGV EENIRFARQI
DEAKKAATEP YLVEAHIGAH APFTVPDAGL EMLREAVKAT GRGLHIHAAE DLYDVSYSHH
WYGKDLLARL AQFDLIDSKT LVAHGLYLSK DDITLLNQRD AFLVHNARSN MNNHVGYNHH
LSDIRNLALG TDGIGSDMFE EMKFAFFKHR DAGGPLWPDS FAKALTNGNE LMSRNFGAKF
GLLEAGYKAD LTICDYNSPT PLLADNIAGH IAFGMGSGSV HSVMVNGVMV YEDRQFNFDC
DSIYAQARKA AASMWRRMDA LA