Gene ECH74115_A0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_A0001 
Symbol 
ID6966546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011351 
Strand
Start bp93 
End bp1103 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content32% 
IMG OID643384032 
Productplasmid replication protein 
Protein accessionYP_002268511 
Protein GI209395651 
COG category[L] Replication, recombination and repair 
COG ID[COG5527] Protein involved in initiation of plasmid replication 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.773057 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCCAG TAGAGGTGAA TACGCGAGAG GGGGAGTTAA TATTTCAATC AAATGCTATT 
ACTGATGGTG TTTATAAAAT AACACTTGAT GAATTTCGTT TGTTAAATTT AGCTATATCT
AAGATCTCAA GGCACGATAA ACCTGAGAAA CGTATAAGGA TTACAACAAA AGAGTTTGTT
GAAGTCTTTA ACATCAAGGA TAAAAACGTT AAAAATAGGT TATCAGCAAT TGCTGATGGG
TTGCTAGGAA AAACTATTGA CACATATTCT TTTGACCAGG ACTCTGGGAA GAAAACAAAA
AGAAGGCGCC TGTGGTTGTC AGAGGTAGAG TATGATATTG AAGGTGAAGA AAATGTATTT
TTAGAAATCG TTTTTTCTAC CGAAATATCG GATCTTTTAT TTCAATTAAA AGATAATTTT
ACGTTATTTG CATTAAGAGC AATATCAGCA TTTACCTCTC CTCATTCTTT TAGAATATAT
GCTTGGTTAT GTAAATATAG AAACATGTAT AATTATCGCA AGGGAGAGTT GATTACAACT
GATACAATCA GTGTTTATGA TTTTAAAGTG ATGTTGGGAT TGGAAAAATC CTATGAAGAG
TATAAGTTTC TAAAACAAAA AGTAATTGAA AGATCAATTA ATGAAATAAA TTCATCAACT
GATATTTCAG TTGTTTTGAA TGAATATAAG GCATCCAGAA AAGTTATTGC CATATCGTTT
AGTTTTGTTC GAGAGGATCA TCCAACTTTA CAATCTATAA AGCCAAAGAG GCGGCGCTTA
CCGGCTAGAC CAAGAGTTAA GTCAGGTTCT AATGCTGAAG TTGAGTGGGC TAAGAAATGT
ATTGAAATAA TATTAGAATA TGAAAACGCG CTAAAATTAT ATGATAAGGA AGCTAGGTTG
CCTCTTCCTG ATTTAGAAAA ATTAGTAGGG TATTATGAAT TATGTGGGGA GGATAAGAAA
ATATCAGAAA GGAATTGTGA ACTAAAAGAA AGGAAGTCCA AACGTAAATG A
 
Protein sequence
MLPVEVNTRE GELIFQSNAI TDGVYKITLD EFRLLNLAIS KISRHDKPEK RIRITTKEFV 
EVFNIKDKNV KNRLSAIADG LLGKTIDTYS FDQDSGKKTK RRRLWLSEVE YDIEGEENVF
LEIVFSTEIS DLLFQLKDNF TLFALRAISA FTSPHSFRIY AWLCKYRNMY NYRKGELITT
DTISVYDFKV MLGLEKSYEE YKFLKQKVIE RSINEINSST DISVVLNEYK ASRKVIAISF
SFVREDHPTL QSIKPKRRRL PARPRVKSGS NAEVEWAKKC IEIILEYENA LKLYDKEARL
PLPDLEKLVG YYELCGEDKK ISERNCELKE RKSKRK