Gene ECH74115_3234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3234 
Symbol 
ID6971037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2973250 
End bp2974254 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content48% 
IMG OID643387050 
Productphage replication protein O 
Protein accessionYP_002271514 
Protein GI209400638 
COG category 
COG ID 
TIGRFAM ID[TIGR01610] phage replication protein O, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.69672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.00032454 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGAAAG CCGGTCTGCT GGAACAGAAC CGACTTTCAG GTGCAAATCG TAACGCACTC 
ATTGCGGGAG GAATTATGGC AAACACTGCT GAGATATTCA ATTTTCCAGT GCCGGATGTG
GCACAAAAGG AGCCGCGCGT GGCAGATCTC GATGATGGTT ATACGCGCAT TGCAAATGAG
TTGCTGGAAG CTGTGATGCT GGCCGGATTA ACACAGCACC AGCTTCTGGT CTTCCTGGCT
GTCATGCGCA AAACATATGG CTTTAATAAA AAACTGGATT GGGTGAGCAA CGAGCAACTT
TCCGAATTGA CCGGGATATT GCCGCACAAG TGTTCTGCTG CAAAAAGCGT TCTGGTAAAG
CGTGGGATTC TTATTCAGAG CGGGCGGAAT ACCGGCATTA ATAATGTGGT CAGTGAATGG
TCAACATTAC CCGAATCAGG TAAGAAAAAT AAAGTTTACC TGAAAGAGGT AAATTTACCT
GAATCAGGTA AAAAAAGTTT ACCCAAATCA GGTAAAGACG TTTACCCGAA TCAGGTAAAC
ACAAAAGACA AAATAACAAA AGACAATATA AAACCTTATT CGTCCGAGAA TTCTGGCGAA
TCCTCTGACC AGCCAGAAAA CGACCTTCCT GTGGTGAAAC CGGATGCTGC GATTCAGAGC
GGCAGCAAGT GGGGGACAGC AGAAGACCTG ACCGCCGCAG AGTGGATGTT TGACATGGTG
AAGACCATCG CGCCATCAGC CAGAAAACCG AATTTTGCAG GGTGGGCTAA CGATATCCGC
CTGATGCGTG AACGTGACGG ACGTAACCAC CGCGACATGT GCGTGCTGTT CCGCTGGGCA
TGCCAGGACA ACTTCTGGTC CGGTAACGTG CTAAGTCCGG CCAAACTCCG TGACAAGTGG
ACCCAACTCG AAATCAACCG TAACAAGCAA CAGGCTGGAG TGACAGCCGG AAAATCAAAA
CTGGACCTGA CAAACACTGA CTGGATTTAT GGGGTGGATT TATGA
 
Protein sequence
MAKAGLLEQN RLSGANRNAL IAGGIMANTA EIFNFPVPDV AQKEPRVADL DDGYTRIANE 
LLEAVMLAGL TQHQLLVFLA VMRKTYGFNK KLDWVSNEQL SELTGILPHK CSAAKSVLVK
RGILIQSGRN TGINNVVSEW STLPESGKKN KVYLKEVNLP ESGKKSLPKS GKDVYPNQVN
TKDKITKDNI KPYSSENSGE SSDQPENDLP VVKPDAAIQS GSKWGTAEDL TAAEWMFDMV
KTIAPSARKP NFAGWANDIR LMRERDGRNH RDMCVLFRWA CQDNFWSGNV LSPAKLRDKW
TQLEINRNKQ QAGVTAGKSK LDLTNTDWIY GVDL