Gene ECH74115_5785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5785 
Symbol 
ID6967857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5419299 
End bp5420801 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content53% 
IMG OID643389415 
Producthypothetical protein 
Protein accessionYP_002273808 
Protein GI209399989 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00669177 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAC CCCTGTTAAT TGCCCGCACG CCGGACACAG AACTATTTTT ACTGCCGGGA 
ATGGCTAACC GTCACGGGCT GATTACTGGC GCAACGGGGA CGGGTAAAAC TGTCACGCTG
CAAAAACTGG CGGAGTCATT GTCGGAAATC GGCGTGCCGG TGTTTATGGC TGATGTGAAA
GGCGATCTGA CCGGTATCGC GCAGGCAGGA ACGGCGTCGG AAAAACTGCT CGCAAGGCTT
AAAAATATCG ACGTCAATGA CTGGCAACCG CATGCTAATC CGGTGGTGGT GTGGGATATC
TTTGGCGAGA AAGGCCATTC GGTACGGGCG ACGGTTTCAG ACCTGGGGCC ACTGTTGCTG
GTGCGGCTGT TGAATCTCAA CGATGTGCAA TCTGGCGTGC TGAATATCAT CTTCCGCATT
GCTGACGATC AGGGGCTGTT ACTGCTCGAC TTTAAAGATC TGCGGGCGAT TACCCAGTAC
ATCGGCGATA ACGCCAAATC TTTCCAGAAT CAGTACGGAA ATATCAGTAG CGCATCGGTT
GGTGCCATCC AGCGCGGGCT GTTGTCGCTG GAACAGCAAG GCGCAGCACA CTTCTTTGGT
GAGCCGATGC TGGATATCAA AGACTGGATG CGCACCGATG CCAACGGTAA AGGCGTTATC
AATATCCTCA GCGCCGAGAA ACTTTATCAG ATGCCGAAAC TGTACGCCGC CAGCCTGCTG
TGGATGCTCT CAGAGTTGTA TGAACAATTG CCGGAAGCGG GCGATCTGGA GAAACCAAAA
CTGGTGTTTT TCTTCGACGA AGCACATCTG CTGTTTAACG ATGCACCGCA GGTACTGCTG
GATAAGATTG AGCAGGTGAT AAGGCTTATT CGCTCAAAAG GCGTGGGCGT CTGGTTCGTT
TCGCAAAACC CGTCTGATAT TCCGGATAAT GTGCTCGGCC AGCTCGGTAA TCGCGTTCAG
CACGCTTTGC GGGCTTTTAC GCCCAAAGAT CAGAAAGCAG TAAAAGCTGC GGCGCAAACC
ATGCGGGCCA ATCCGGCGTT TGATACCGAA AAGGCGATTC AGGAACTGGG CACCGGCGAG
GCGTTGATCT CTTTTCTGGA TGCGAAAGGA AGCCCTTCTG TGGTGGAGCG TGCGATGGTG
ATCGCGCCTT GTTCGCGGAT GGGGCCGGTG ACGGAAGATG AGCGTAATGG CTTGATTAAT
CACTCTCCGG TGTATGGCAA ATATGAGGAT GAGGTGGACC GGGAATCCGC CTATGAGATG
TTGCAAAAAG GCTTTCAGGC CAGTACCGAG CAGCAAAATA ATCCCCCCGC GAAAGGGAAA
GAGGTAGCGG TGGATGACGG CATTCTTGGT GGATTGAAGG ATATTTTGTT CGGCACTACC
GGACCACGCG GCGGGAAGAA AGATGGTGTG GTGCAAACAA TGGCCAAAAG CGCCGCTCGC
CAAGTGACGA ATCAGATTGT ACGCGGGATG TTGGGGAGTT TGCTGGGGGG GAGAAGAAGG
TAA
 
Protein sequence
MSEPLLIART PDTELFLLPG MANRHGLITG ATGTGKTVTL QKLAESLSEI GVPVFMADVK 
GDLTGIAQAG TASEKLLARL KNIDVNDWQP HANPVVVWDI FGEKGHSVRA TVSDLGPLLL
VRLLNLNDVQ SGVLNIIFRI ADDQGLLLLD FKDLRAITQY IGDNAKSFQN QYGNISSASV
GAIQRGLLSL EQQGAAHFFG EPMLDIKDWM RTDANGKGVI NILSAEKLYQ MPKLYAASLL
WMLSELYEQL PEAGDLEKPK LVFFFDEAHL LFNDAPQVLL DKIEQVIRLI RSKGVGVWFV
SQNPSDIPDN VLGQLGNRVQ HALRAFTPKD QKAVKAAAQT MRANPAFDTE KAIQELGTGE
ALISFLDAKG SPSVVERAMV IAPCSRMGPV TEDERNGLIN HSPVYGKYED EVDRESAYEM
LQKGFQASTE QQNNPPAKGK EVAVDDGILG GLKDILFGTT GPRGGKKDGV VQTMAKSAAR
QVTNQIVRGM LGSLLGGRRR