Gene ECH74115_2609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2609 
SymboltorY 
ID6971204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2465256 
End bp2466356 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content48% 
IMG OID643386474 
Producttrimethylamine N-oxide reductase III, c-type cytochrome subunit TorY 
Protein accessionYP_002270956 
Protein GI209400901 
COG category[C] Energy production and conversion 
COG ID[COG3005] Nitrate/TMAO reductases, membrane-bound tetraheme cytochrome c subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.637408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.00703853 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGAGGGA AAAAACGCAT TGGGTTATTG TTTTTGCTGA TAGCGGTTGT GGTTGGTGGT 
GGCGGGTTCT TGATGGCGCA AAAAGCCTTA CATAAAACGT CGGATACAGC ATTTTGCCTT
TCCTGCCACT CGATGAGTAA ACCTTTTGAG GAATATCAGG GAACTGTCCA CTTTTCGAAC
CAGAAAGGGA TACGTGCGGA ATGTGCCGAT TGCCATATTC CAAAGTCAGG GATGGATTAT
TTATTCGCTA AATTAAAAGC ATCTAAAGAT ATTTATCATG AATTTGTTAG CGGCAAAATA
GACAGTGACG ATAAGTTCGA AGCTCATCGC CAGGAAATGG CCGAAACAGT ATGGAAAGAA
TTAAAAGCAA CTGACTCTGC AACGTGCCGT AGTTGCCATT CTTTTGATGC CATGGATATT
GCCTCGCAAA GTGAATCTGC GCAGAAAATG CATAACAAAG CACAAAAGGG CGGCGAAACC
TGTATCGATT GTCATAAAGG CATTGCCCAT TTTCCGCCAG AAATAAAAAT GGATGACAAC
GCGGCGCATG AGCTGGAAAG TCAGGCCGCT ACTTCAGTGA CTAATGGCGC ACATATTTAT
CCTTTCAAAA CTTCTCGCAT AGGCGAGCTG GCTACCGTGA ATCCTGGTAC CGATCTCACC
GTCGTTGATG CCAGTGGCAA ACAGCCGATC GTTCTGTTGC AGGGTTATCA AATGCAGGGC
AGTGAAAACA CGCTCTACCT GGCGGCAGGT CAACGGCTGG CGCTAGCCAC ATTAAGTGAA
GAAGGTATCA AGGCGCTCAC GGTAAACGGG GAATGGCAGG CTGACGAATA CGGCAATCAA
TGGCGTCAGG CGTCTTTACA GGGTGCGCTT ACCGATCCCG CATTAGCGGA CCGTAAACTG
CTATGGCAAT ACGCTGAAAA ACTTGACGAT ACCTATTGCG CTGGTTGTCA TGCCCCTATT
GCCGCCGACC ATTACACCGT CAATGCGTGG CCGTCCATTG CCAAAGGAAT GGGGGCACGA
ACCAGCATGA GCGAAAACGA ACTGGACATT TTAACGCGGT ATTTCCAGTA CAACGCCAAA
GATATTACCG AGAAACAGTG A
 
Protein sequence
MRGKKRIGLL FLLIAVVVGG GGFLMAQKAL HKTSDTAFCL SCHSMSKPFE EYQGTVHFSN 
QKGIRAECAD CHIPKSGMDY LFAKLKASKD IYHEFVSGKI DSDDKFEAHR QEMAETVWKE
LKATDSATCR SCHSFDAMDI ASQSESAQKM HNKAQKGGET CIDCHKGIAH FPPEIKMDDN
AAHELESQAA TSVTNGAHIY PFKTSRIGEL ATVNPGTDLT VVDASGKQPI VLLQGYQMQG
SENTLYLAAG QRLALATLSE EGIKALTVNG EWQADEYGNQ WRQASLQGAL TDPALADRKL
LWQYAEKLDD TYCAGCHAPI AADHYTVNAW PSIAKGMGAR TSMSENELDI LTRYFQYNAK
DITEKQ