Gene ECH74115_1298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1298 
Symbol 
ID6968240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1312903 
End bp1314105 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content49% 
IMG OID643385286 
Productintegrase 
Protein accessionYP_002269781 
Protein GI209398563 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.64566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.14409 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTAT TGACGGATAC GAAAGCAAGA CATATCAAAC CTGATGACAA ACCATTGCCC 
CATGGGGGAA TTACCGGACT GACCCTTCAT CCTTCTTCAG TAAAGGGGCG GGGGAAATGG
GTTTTTCGTT ATGTAAGTCC GGTGACACAA AAAAGACGTA ATGCTGGATT GGGAACTTAC
CCAGAGGTCA GTATTGCTGA AGCTGCACGT ACTGCCCGGA TAATGCGAGA GCAACTTGCT
GCAGGTGATG ATCCTCTGGA GATTAAAAAG GCTGAATCTG AGAAAGTCGC TATCCCAACA
TTTGCCGATG CAGCCAGGCG TGTACATGCA GAACTGTCTC CTGGATGGGA AAATCCAAAG
CATGTAAGGC AGTGGTTATC GACGCTTGAG AATTACGCGT TTCCTCAACT GGGAGCAAAA
ACGCTGGATT CGATTACGGC TGCGGACGTG GCAGAAACAC TGCGTCCAGT CTGGTTAACC
TTGTCAGAAA CGGCAAGCCG GGTTAAACAG CGCATTCATG TTGTTATGCA GTGGGGATGG
GCGCACGGTT TTTGTGTAGC AAATCCTGTT GATGTGGTTG ACCATTTGCT TCCTCAGCAG
ACAAGAGGAC GTGATGAACA CCAACCCGCA ATGCCCTGGA GGCAGTTACC GCTTTTTGTG
GCGACCAGTG TGTATACCGA TGAACCTTAT AATGTTACCC GCGCACTGTT ATTAATGGTG
ATACTTACAG CAACTCGCTC GGGCGAAGCC AGGGGAATGC GCTGGGCTGA AATTGATTTT
CATAAGCGGG TATGGACTAT ACCTGCAGAA AGAATGAAAG CCAGGATACA GCATCGTGTT
CCTTTATCCC GGCAGGCTAT TTACATTCTG GAAAATATAC GTGGCCTGCA TGATGAACTG
GTGTTCCCTT CACCCAGAAA GCAGCAGATC CTTTCCGATA TGGTGTTGAC AAGTTTTCTG
CGTAAAAAGA AAGCCGTCAG TGACATTCCG GGGCGAGTTG CCACGGCACA TGGTTTTCGC
TCAACATTCA GGGACTGGTG TAGCGAACAG GGGTATTCGC GGGATCTGGC GGAAAGGGCG
CTCGCTCATA CGCTGAAAAA TAAGGTTGAG GCGGCATATC ATCGTACTGA TCTACTGGAG
CAGCGTGTAC CGATGATGCA GGCATGGGCG GATTATGTGA TGTCTCAAAT TGTGAATAAA
TAA
 
Protein sequence
MAVLTDTKAR HIKPDDKPLP HGGITGLTLH PSSVKGRGKW VFRYVSPVTQ KRRNAGLGTY 
PEVSIAEAAR TARIMREQLA AGDDPLEIKK AESEKVAIPT FADAARRVHA ELSPGWENPK
HVRQWLSTLE NYAFPQLGAK TLDSITAADV AETLRPVWLT LSETASRVKQ RIHVVMQWGW
AHGFCVANPV DVVDHLLPQQ TRGRDEHQPA MPWRQLPLFV ATSVYTDEPY NVTRALLLMV
ILTATRSGEA RGMRWAEIDF HKRVWTIPAE RMKARIQHRV PLSRQAIYIL ENIRGLHDEL
VFPSPRKQQI LSDMVLTSFL RKKKAVSDIP GRVATAHGFR STFRDWCSEQ GYSRDLAERA
LAHTLKNKVE AAYHRTDLLE QRVPMMQAWA DYVMSQIVNK