Gene ECH74115_2416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2416 
Symbol 
ID6970586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2287404 
End bp2288693 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content54% 
IMG OID643386287 
Producthypothetical protein 
Protein accessionYP_002270769 
Protein GI209396032 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGATG ACAAATTTGA TGCCATTGTG GTCGGTGCGG GCGTTGCTGG TAGCGTTGCC 
GCACTGGTCA TGGCGCGAGC CGGGCTGGAT ATCCTGGTGA TAGAACGCGG CGACAGTGCC
GGATGTAAAA ACATGACCGG CGGGCGTCTT TATGCCCACA CACTTGAAGC AATCATTCCA
GGCTTTGCAG CATCAGCGCC GGTAGAACGC AAGGTCACAC GCGAGAAAAT CTCCTTCTTA
ACCGAAGAGA GCGCCGTTAC CCTCGATTTT CACCGCGAGC AACCAGATGT TCCGCAACAC
GCATCTTATA CCGTATTGCG TAATCGTCTG GACCCGTGGT TGATGGAACA AGCCGAGCAG
GCGGGCGCAC AGTTTATCCC GGGAGTTCGC GTCGATGCGT TGGTTCGTGA AGGAAACAAG
GTCACTGGAG TCCAGGCCGG GGATGATATT CTCGAAGCCA ATGTGGTGAT TTTAGCCGAT
GGCGTTAACT CGATGCTTGG CCGCTCGCTG GGAATGGTTC CCGCTTCCGA TCCGCATCAT
TACGCTGTTG GTGTTAAAGA GGTTATTGGC CTCACACCAG AACAGATCAA CGATCGCTTT
AATATTACGG GCGAGGAAGG TGCCGCCTGG CTGTTTGCCG GTTCCCCTTC TGACGGCCTG
ATGGGCGGGG GATTTCTCTA TACCAACAAG GATTCCATAT CCTTGGGGCT GGTTTGTGGA
TTGGCTGATA TCGCCCATGC GCAAAAAAGC GTGCCGCAAA TGCTGGAAGA TTTTAAACAA
CACCCCGCCA TTCGCCCGCT GATTAGCGGC GGCAAACTGC TTGAATATTC CGCGCATATG
GTGCCAGAAG GCGGTCTGGC GATGGTACCG CAACTGGTTA ACGATGGCGT GATGATCGTT
GGTGACGCCG CAGGCTTCTG CCTGAATTTG GGTTTTACGG TCCGCGGCAT GGATTTAGCC
ATTGCATCGG CTCAGGCTGC CGCCACAACA GTGATCGCCG CCAAAGAACG CGCGGATTTC
TCCGCCAGCA GTCTGGCGCA ATACAAACGT GAGCTGGAAC AAAGCTGCGT CATGCGTGAT
ATGCAGCATT TTCGCAAGAT CCCGGCGCTG ATGGAAAACC CGCGCCTGTT TAGCCAATAC
CCACGAATGG TAGCCGACAT CATGAACGAG ATGTTCACCA TTGACGGTAA GCCTAACCAG
CCGGTACGCA AAATGATCAT GGGACACGCG AAGAAAATTG GGCTGATCAA CTTGCTGAAA
GATGGCATTA AGGGAGCAAC CGCGCTATGA
 
Protein sequence
MSDDKFDAIV VGAGVAGSVA ALVMARAGLD ILVIERGDSA GCKNMTGGRL YAHTLEAIIP 
GFAASAPVER KVTREKISFL TEESAVTLDF HREQPDVPQH ASYTVLRNRL DPWLMEQAEQ
AGAQFIPGVR VDALVREGNK VTGVQAGDDI LEANVVILAD GVNSMLGRSL GMVPASDPHH
YAVGVKEVIG LTPEQINDRF NITGEEGAAW LFAGSPSDGL MGGGFLYTNK DSISLGLVCG
LADIAHAQKS VPQMLEDFKQ HPAIRPLISG GKLLEYSAHM VPEGGLAMVP QLVNDGVMIV
GDAAGFCLNL GFTVRGMDLA IASAQAAATT VIAAKERADF SASSLAQYKR ELEQSCVMRD
MQHFRKIPAL MENPRLFSQY PRMVADIMNE MFTIDGKPNQ PVRKMIMGHA KKIGLINLLK
DGIKGATAL