Gene ECH74115_1577 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1577 
Symbol 
ID6970162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1539650 
End bp1541050 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content54% 
IMG OID643385541 
Productbacteriophage P4 DNA primease 
Protein accessionYP_002270035 
Protein GI209399659 
COG category[L] Replication, recombination and repair 
COG ID[COG3598] RecA-family ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0190049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.000349117 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAGCG CACCGAACTT AAAAAAACAG CCTTACGACA AGATGACCGA AGTCATTATT 
TTTGCGGGTA GTGATGCCTG GGCACATGCG AAACAGTGGC AGGAACAGGA CGGGCGACTG
GCTGGCGATA ATGTGCCTCC CGTTGTGCTG GCTGATGATC AACTGGATGA ACTGGCAGAA
CTGAGAATCA TCGACGAGGG GCGCTATTGT GTCCGGCTGT ACAAGGCAGG CCACATCAGG
CCATCAAATA TTAATGCCAT TGCGCACAAG CTGGCGGCGG CGGGTGTAAC TGATGCGAAT
TATTACCCCG AAGGGATGCA CAGCCATATG CGGGAGAACT GGCGCGAATA CCTGGAACGG
GTGCGCGGGA AAGAGCCGGC GGAAGAAAAA AACCACCAGC GAAAAACCAC GCTACCGATG
AGCGTTGGAT CTACCGGATA CGACACGCAA CTGGATTACG TGGTTAAGGG GATTATTCCG
GCGGTATCGC TATGCAGCAT ATACGGGGCT AGCGGGTCCT ATAAATCATT CCTTGCCGGA
TCGTGGGCGT GCCATGTTGC CACTGGTCGC CAGTGGGGAG GCCGCAGGGT TGCACATGGT
GCGGTTCTCT ATGTGGTTGG TGAAGGCGGT ATAGGTGTTC CGCGTCGTGT AAAAGCCTGG
GAGGTTGTGC ACGATGAGCA GGTGAAAAAT CTGTATCTGG TAAACCGCCC CATCTTTCCG
GCTGCCCCGC TTGATGTTGA TGAAATGGTT ATCGCTGCCC GTCAGGTGGA GCGGGAAACG
GGTAAACCTG TACGCATGAT TATTCTGGAT ACGCTGGCGC GTTGCTTTGG TGGGAATGAT
GAAAATGATT CCCGTGATAT GGGGGCGTTT ATCCGTGGTT GTGACGAACT GAAACGACGC
ACAGGGGCCA CGGTGCTGGT GGTTCACCAT TCCGGCAAGG ATGAGACGAA AGGCGCGCGC
GGTTCCAGTG CATTTCGTGC TTCGCTGGAT GCTGAATACC GGATACGCAG GGAGGACGCA
GGAAGCGAAG CGCTGGTTAT CTCATGCACC AAAATGAAGG ACGCGGAGGA ACTCAAAGAA
GCCGCATATG ACTTACGCGT GGTGGAGCTT TTTACCGACG CTGACGGTGA ATTAATCACG
TCGCTGGTGG TGGTGGATGA TCCGCGCCCT CCTGTTGAAC TGGAGCGCAT CGAGGAGGCA
GGGAACAAGA CGGAAAACCA TACCGCGCTA TGGGGGTGCA TCCGTTCACG CACACAGAAC
GGCGACAAGT GCACGATCCC GCTGTTACGT GATGACATGA AAAAGCTGGG GTATGAAATG
AAAAACTTCC GGCGCTGGCT GTACAAGCTG GAAAAAGATG GGGTTATTCG TATCGATGGG
GATGATGTAG CGCCGCTATA A
 
Protein sequence
MKSAPNLKKQ PYDKMTEVII FAGSDAWAHA KQWQEQDGRL AGDNVPPVVL ADDQLDELAE 
LRIIDEGRYC VRLYKAGHIR PSNINAIAHK LAAAGVTDAN YYPEGMHSHM RENWREYLER
VRGKEPAEEK NHQRKTTLPM SVGSTGYDTQ LDYVVKGIIP AVSLCSIYGA SGSYKSFLAG
SWACHVATGR QWGGRRVAHG AVLYVVGEGG IGVPRRVKAW EVVHDEQVKN LYLVNRPIFP
AAPLDVDEMV IAARQVERET GKPVRMIILD TLARCFGGND ENDSRDMGAF IRGCDELKRR
TGATVLVVHH SGKDETKGAR GSSAFRASLD AEYRIRREDA GSEALVISCT KMKDAEELKE
AAYDLRVVEL FTDADGELIT SLVVVDDPRP PVELERIEEA GNKTENHTAL WGCIRSRTQN
GDKCTIPLLR DDMKKLGYEM KNFRRWLYKL EKDGVIRIDG DDVAPL