Gene ECH74115_2904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2904 
Symbol 
ID6971326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2693401 
End bp2695338 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content56% 
IMG OID643386748 
Producthypothetical protein 
Protein accessionYP_002271219 
Protein GI209398353 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.148641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0000000129832 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACATTTA AACACTACGA TGTGGTCAGG GCGGCATCGC CGTCAGACCT TGCTGAACGA 
CTGACACAAA AACTGAAGGA GGGCTGGCAG CCATTTGGCA GTCCGGTGGC CATCACGCCT
TACACCCTGA TGCAGGCCAT TGCGGCGGAA GGTGATGTCA CCACACCAGT GGCGGTGACC
GGTAATGAGG GTAAGGCGGT GGCTGTCAGT GCCACCAGAG CCCCGGAGTA TTACTTTGTT
GTGGTTCTGG CAGGGCAGTC AAACGGCATG TCGTATGGTG AAGGTCTTCC GCTGCCGGAG
ACATATGACC GTCCGGAGCC GCGTATTAAG CAACTGGCGC GTCGCAGTAC GGTGACACCG
GGTGGTGCAG CATGCAGATA TAACGACATC ATTCCGGCGG ACCATTGTCT GCATGATGTG
CAGGACATGA GCCGCCTTAA CCATCCGAAA GCGGACCTGT CAAAGGGGCA GTACGGAACC
GTGGGGCAGG GGCTGCATAT CGCCAAAAAA TTGCTGCCGT TTATACCGGC GAATGCGGGC
ATTCTGCTGG TTCCGTGCTG TCGTGGTGGT TCAGCGTTCA CCACCGGAGC TGATGGCACA
TACAGTGACG CGAGTGGTGC TTCGGAGAAT TCAACCCGCT GGGGTGTGGA CAAGCCGCTG
TATAAGGACC TTATCGGTCG AACAAAAGCA GCACTGAAGA AGAACCCGAA AAATGTGCTG
TTTGCCGTGG TGTGGATGCA GGGGGAATTT GATTTTGGCG GTACGCCGGC AAATCACGCA
GCACAGTTTG GTGCGCTGGT TGATAAATTC CGTGCAGACC TGGCGGATAT GGCAGGTCAG
TGCGTCGGTG GCTCTGCTGA CGGTGTTCCC TGGATATGCG GGGACACGAC GTATTTCTGG
AAGCAGAAGA ACGAAGCCAC CTACCAGACG GTGTACGGCA GCTACAAAAA CAAAACGGAA
AAAAATATCC ATTTCGTACC GTTCATGACG GATGAGAACG GGGTGAATGT GCCGACGAAC
AAACCGGAAG AAGACCCGGA CATTCCGGGT ATCGGATATT ACGGTTCGAA ATGGCGTGAC
AGCTCAGCCA CCTGGACGTC ACAGGACAGG GCGAGCCATT TCAGCGCCTG GGCACGCCGT
GGGATTATTT CCGACCGTCT GGCAACGGCG ATTTTGCGCC ATGCGGGAAG AGTGGCGCTA
AACGCGGGGG CATCATCGAC AGTATCAGAG GTGCGCCCGT CATCGCCTTC CGGTGCAGAA
GCCACAGGCA TCACAACACT GCTCTCTTAC CTTGCCAGCG AGTCAGAGGG AAGCCTGAAA
GTACAGGGAT GGTCAGCCAG TGGCGGCAGG GCAGAAGTGG TCAGCGATGC GGAGGGAACC
GGAGGTAAGG CAGTGAAGCT GACCAAGGAA GCCGGTAAAA GCAGCTGGGT GCTGGAGTAC
GCCGCGGGCA ACGGTGCGGC TCTGTTACAG AAAGGGGGGC AGATTCGCTG CCGCTTTAAG
GTTTCGGGAG CGCTGGCTGC GAACCAGTAT GTTATGGCGT TTTACTGGCC GGTATCTTCA
CTGCCACAGG GCGTTGCCCT GACCGGAGAC GGGGGGAATA ACCTGCTGGC AGCGTTCTAC
ATCCAGACAG ATGCAAAAGA CCTGAATGTG ATGTACCACA ATGCGAAAGT GGCGACAAAC
AACCTGAAAC TGGGAAGCTT TGGCGCATTT GATAACGAAT GGCATGCGCT GGCTTTCCGC
TTTGCCGGGA ATAACAGCCT TCAGGTGACG CCGGTTATTG ATGGTCAGGA TGGTACACCG
TTCACGCTGA CGCAGTCACC GGTCAGTGCC TTTGCGGCGG ATAAACTGCA TGTGACAGAC
ATTACCAGGA ATGCGACTTA CCCGGTGCTG ATTGACAGCA TTGCGGTGGA AGTGAACAGC
ACAGACACTG CGGCATGA
 
Protein sequence
MTFKHYDVVR AASPSDLAER LTQKLKEGWQ PFGSPVAITP YTLMQAIAAE GDVTTPVAVT 
GNEGKAVAVS ATRAPEYYFV VVLAGQSNGM SYGEGLPLPE TYDRPEPRIK QLARRSTVTP
GGAACRYNDI IPADHCLHDV QDMSRLNHPK ADLSKGQYGT VGQGLHIAKK LLPFIPANAG
ILLVPCCRGG SAFTTGADGT YSDASGASEN STRWGVDKPL YKDLIGRTKA ALKKNPKNVL
FAVVWMQGEF DFGGTPANHA AQFGALVDKF RADLADMAGQ CVGGSADGVP WICGDTTYFW
KQKNEATYQT VYGSYKNKTE KNIHFVPFMT DENGVNVPTN KPEEDPDIPG IGYYGSKWRD
SSATWTSQDR ASHFSAWARR GIISDRLATA ILRHAGRVAL NAGASSTVSE VRPSSPSGAE
ATGITTLLSY LASESEGSLK VQGWSASGGR AEVVSDAEGT GGKAVKLTKE AGKSSWVLEY
AAGNGAALLQ KGGQIRCRFK VSGALAANQY VMAFYWPVSS LPQGVALTGD GGNNLLAAFY
IQTDAKDLNV MYHNAKVATN NLKLGSFGAF DNEWHALAFR FAGNNSLQVT PVIDGQDGTP
FTLTQSPVSA FAADKLHVTD ITRNATYPVL IDSIAVEVNS TDTAA