Gene ECH74115_5550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5550 
Symbol 
ID6968195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5188838 
End bp5189935 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content42% 
IMG OID643389191 
Productphage integrase family protein 
Protein accessionYP_002273588 
Protein GI209397184 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTACT ATAACATAGA GAAACGACTA AAATCCGATG GCACACCACG CTATCGCTGT 
AATGTGATTA TCAAAGAAAA AGGTGTTATC ACTTACAGGG AAAGCAAAAC ATTCCCTAAA
CATGCTCATG CCAAAACATG GGGCACACAG AAAGTGATGG AATTAGATCT ATATGGCATT
CCATCATCAA ATGCAGTTGA CGGACTTACA GTCCGTGACT TACTACACAA ATATTTAAAT
GACCCAAATG CCGGAGGTAA AGCAGGCCGT ACTAAAAGAT ATGTGCTGGA ACTGCTTATG
GATAGTGACA TATCCGCGAT CAAACTATCT GAACTGACAG AAAATGACGT AATTGAACAT
TGCAGGCTAA GAAACAACGC TGGTGCAGGC CCAGCAACAG TCAGCCACGA TGTTAGTTAT
CTTGGCAGTG TTCTGGATGC GGCAAAACCT GTATACGGAA TCAATTACAC ATCAAACCCG
GCGAAAAGCG CTCGTCCATA TCTACTTAAA CTCGGTTTGA TTGGTAAATC AAACCGTCGT
AATCGTAGAC CAGCATCTGA TGAACTTGAC ATGCTCATTG AAGGCCTTCA ACAACGATCT
ACTCATAAAT GCTCAAAAAT TCCGTTCGTT GATATCCTCA AATTTTCTGT GTGGTCCTGT
ATGCGAATCG GAGAAGTATG CCGGTTACGA TGGGAAGATC TCGACCAAGA ACAAAAATCT
ATACTAGTAA GAGATAGGAA AGATCCACGT AAAAAGGAAG GTAACCATAT GAAAGTTGCC
TTGCTTGGGG AAGCCTGGGA TATCGTCCAG CGACAACCAA AAAAATCAGA ATTCATTTTT
CCATATAACA GCACTTCTGT TACCGCAGGA TTTCAGAGGG TAAGAAGCAA ATTAGGTATT
AAAGATCTGC GATATCATGA TTTGCGTAGA GAAGGGGCAA GTCGCTTATT TGAGGCTGGT
TTTAGTATTG AGGAAGTCGC TCAAGTTACA GGGCATCGTT CATTAAACGT GCTATGGCAG
GTATATACCG AACTGTATCC GAAATCTTTA CATAATCGTT TTGAAGAACT CCAAAAGAGC
AGAAACAAGA CCTCTTGA
 
Protein sequence
MAYYNIEKRL KSDGTPRYRC NVIIKEKGVI TYRESKTFPK HAHAKTWGTQ KVMELDLYGI 
PSSNAVDGLT VRDLLHKYLN DPNAGGKAGR TKRYVLELLM DSDISAIKLS ELTENDVIEH
CRLRNNAGAG PATVSHDVSY LGSVLDAAKP VYGINYTSNP AKSARPYLLK LGLIGKSNRR
NRRPASDELD MLIEGLQQRS THKCSKIPFV DILKFSVWSC MRIGEVCRLR WEDLDQEQKS
ILVRDRKDPR KKEGNHMKVA LLGEAWDIVQ RQPKKSEFIF PYNSTSVTAG FQRVRSKLGI
KDLRYHDLRR EGASRLFEAG FSIEEVAQVT GHRSLNVLWQ VYTELYPKSL HNRFEELQKS
RNKTS