Gene ECH74115_1110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1110 
Symbol 
ID6968684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1139318 
End bp1140427 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content50% 
IMG OID643385117 
ProductMOSC domain protein 
Protein accessionYP_002269616 
Protein GI209397096 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0633] Ferredoxin
[COG3217] Uncharacterized Fe-S protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000289378 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGACAT TAACCCGGCT TTTTATTCAT CCTGTTAAAT CGATGCGCGG CATTGGTCTT 
ACACATGCTC TGGCAGATGT CAGTGGTCTG GCCTTCGATC GCATCTTTAT GATCACGGAA
CCTGACGGTA CGTTTATCAC CGCTCGCCAG TTTCCCCAGA TGGTACGATT TACTCCTTCA
CCCGTGCATG ATGGCTTGCA TCTCACCGCA CCAGATAGCA GTAGCGCATA TGTTCGTTTT
GCTGATTTCG CCACACAAGA CGCACCAACC GAAGTTTGGG GCACACATTT TACCGCGCGA
ATTGCGCCAG AAGCGATCAA CAAATGGTTA AGTGGATTTT TCTCCCGCGA AGTGCAATTA
CGCTGGGTGG GGCCACAAAT GACCCGGCGC GTGAAACGCC ACAACACTGT ACCCCTGTCA
TTTGCTGATG GCTATCCTTA CCTTCTTGCT AACGAAGCCT CGTTACGTGA TCTCCAACAA
CGTTGTCCGG CCAGTGTAAA AATGGAGCAA TTCCGCCCCA ATCTGGTGGT TTCCGGCGCG
TCAGCCTGGG AAGAAGATAG TTGGAAAGTG ATTCGCATTG GTGATGTGGT GTTTGATGTG
GTTAAACCTT GTAGCCGCTG TATTTTCACC ACCGTCAGCC CAGAAAAAGG GCAAAAACAT
CCGGCAGGCG AACCATTAAA AACATTGCAA TCTTTCCGCA CTGCCCAGGA TAACGGCGAT
GTCGATTTTG GTCAGAATTT AATTGCCCGT AATAGCGGCG TGATTCGCGT TGGCGATGAG
GTGGAAATTC TGACAACGGC TCCGGCAAAA ATTTACGGCG CAGCTGCCGC TGATGATACT
GCCAACATCA CGCAACAACC GGACGCCAAT GTAGATATTG ACTGGCAGGG ACAGGCATTT
CGTGGAAATA ACCAACAGGT GTTGCTGGAG CAATTAGAAA ATCAGGGAAT TCGTATCCCT
TATTCTTGCC GCGCGGGCAT TTGTGGAAGT TGCCGTGTTC AGCTTTTAGA AGGCGAAGTC
ACGCCGCTGA AAAAATCAGC AATGGGCGAT GATGGCACCA TTCTTTGCTG TAGCTGTGTA
CCGAAGACTG CACTTAAGTT GGCGCGTTAG
 
Protein sequence
MATLTRLFIH PVKSMRGIGL THALADVSGL AFDRIFMITE PDGTFITARQ FPQMVRFTPS 
PVHDGLHLTA PDSSSAYVRF ADFATQDAPT EVWGTHFTAR IAPEAINKWL SGFFSREVQL
RWVGPQMTRR VKRHNTVPLS FADGYPYLLA NEASLRDLQQ RCPASVKMEQ FRPNLVVSGA
SAWEEDSWKV IRIGDVVFDV VKPCSRCIFT TVSPEKGQKH PAGEPLKTLQ SFRTAQDNGD
VDFGQNLIAR NSGVIRVGDE VEILTTAPAK IYGAAAADDT ANITQQPDAN VDIDWQGQAF
RGNNQQVLLE QLENQGIRIP YSCRAGICGS CRVQLLEGEV TPLKKSAMGD DGTILCCSCV
PKTALKLAR