Gene ECH74115_1861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1861 
Symbol 
ID6968173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1759857 
End bp1761980 
Gene Length2124 bp 
Protein Length707 aa 
Translation table11 
GC content57% 
IMG OID643385797 
Productphage terminase large subunit 
Protein accessionYP_002270286 
Protein GI209398699 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.000286067 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAATCAGG TGAACGAGAG CCATAGCCGC GCATCCGATA TCTGGCGCGA AGTGGCCTCG 
CTGTTTCGCC CACCCAGCCG GTTACCAGTA GCGGAAGCCA TCAGGCGTTA TATGCGGGTT
CCACGGGGAG CCAATACTTC CGGTCCGTGG GAGTCATCGC TGACGCCCTA TATGATTGAC
CCCATTAATA CATTATCAGC CCGTGAATAT GACGCGGTGG TGTTTGTGGG ACCTGCGCGA
ACCGGGAAAA CCGAAGGGCT GATTGATGGC TGGATTGTGT ACGGCATCAT CTGTGATCCG
GCGGATATGC TGGTGGTGCA GATGACTGAG ACGAAGGCGC GTGAGCATTC CAGAACGCGT
CTTTCCAGGA CGTTTCGCCA CAGTCCGGAG GTCAGCAAGC GCCTCAGTCC TTCCCGTAAT
GACAACAACG TCCACGATAA AATGTTTCTT GACGGCTCCT TCCTGAAGAT TGGCTGGCCG
TCGATCACCG TCTTTTCCTC TTCGGATTAC CGTCGTGTGG CGCTGACGGA TTATGACCGT
TTCCCTGAAA ACGTGGACGG GGAAGGGGAT GCCTTCACGC TGGCCTCAAA GCGTACCACC
ACCTTTATGT CCTCGGGGAT GACCCTGGTC GAGAGTTCAC CGGGGCGGGA TATCACCGAT
ACCAAATGGC GTTGTGGTGG CGCACATGAG GCACCGCCAA CAACGGGGAT CCTGTCACTG
TATAACCGGG GAGACCGCCG CCGGTGGTAC TGGCCGTGTC CGCACTGCGG GGAATATTTT
CAGCCGGTGA TGGATAACAT GACCGGATAC CGGAATAACC CGGATTTTGT GGCTGCCGGT
CAGGCTGCCC GTCTGATGTG TCCGCATTGT CGCGGGCTGA TCGCCCCTGA GCAGAAACGC
GAACTGAATA ACCAGGGGAT CTGGCTTCGT GAAGGTGAAC GGGCGGCGGC GGACGGCAGT
ATCACCGGAA CGCCACGAAA CTCCCGGATT GCGTCATTTT GGATGGAGGG GCCAGCTGCG
GCGTTTCAGA CCTGGGAACA ACTGATTTTT AAACTGCTGG CGGCAGAAGA AGAGTATGAG
CGAACCGGCA GTGAAGAGAC CCTGAAAGCG GTGGTGAACA CCGATATCGG ACGACCCTAT
CTGCCCCGTT CAGCCACGGA ACAGCGTAAA AGTGAACTGC TTGAACAGCG TGCCGAGCCG
TTTCCCCGGC GATCTGTGCC GGATGGTGTG CGTTTTATTG AGGCAACGGT TGACGTACAG
GGCGGTAAAA ATCGCCGTTT TGTTGTGCAG ATCACCGGAT ACGGAGAGCA GGGGGAACGC
TGGATTGTTG ATCGCTACAA CATCCGGCAT TCACTGCGCT GCAGTCCCAA CGGTGAAAGT
CTGCCGGTTG ATCCGGCGGC ATATCCGGAG GACTGGGATT TGTTGCTGAC GGATGTGTTC
CATAAAACAT GGCCGCTGGC TTCTGATCCG GATGTGCGCA TGCGTCTGAT GGCCATGGCG
GTGGATACGG GAGGGGAAGC CGGGGTGACA GATAACGCCT ATCGTTTCTG GCGTCGTTGC
CGGAGTGACG GACTGGGCAA CAGGGTGTTT CTGTTCAAGG GGGATGGACT TCGCCGTGAC
AGGCTGATTA ACCGAACCTT CCCGGATAAT ACCGGCAGAA GTGCCCGCCG TGCCAGAGCC
AGTGGCGATG TCGCGCTGTG GCTGGTTCAG ACGGATGCGT TTAAGGATCG TGTAAATAAT
GCCCTGTGGC GTGACACACC AGGGCCGAAC TATATCCACT TTCCCGACTG GCTGGGGCGG
TGGTTTTACG ATGAGCTGAC CTATGAAGAG CGCGGCAGTG ACGGAAAATG GCGAAAACCG
GGCAGGGGCG CTAACGAAGC GTTTGACCTG CTGGTTTATG CGGATGCGCT TGCCGTTCTG
CATGGTTACG AAAAGATCCG CTGGCCCTCC GCACCGGACT GGGCACAGCG GGAAACGTGG
CTCGTCTTCC CGCAGGAGCG TTCTAGTGAA ACGGTATCCC CGGAACTGAC GGCCGGGGCA
GAAAAACGCC GTCGCCGGAA GAAAAAACTG TGGACGGAGC GTGCGGAAGA TAATCCATGG
ATAACATCAG GAGGCTGGTT GTGA
 
Protein sequence
MNQVNESHSR ASDIWREVAS LFRPPSRLPV AEAIRRYMRV PRGANTSGPW ESSLTPYMID 
PINTLSAREY DAVVFVGPAR TGKTEGLIDG WIVYGIICDP ADMLVVQMTE TKAREHSRTR
LSRTFRHSPE VSKRLSPSRN DNNVHDKMFL DGSFLKIGWP SITVFSSSDY RRVALTDYDR
FPENVDGEGD AFTLASKRTT TFMSSGMTLV ESSPGRDITD TKWRCGGAHE APPTTGILSL
YNRGDRRRWY WPCPHCGEYF QPVMDNMTGY RNNPDFVAAG QAARLMCPHC RGLIAPEQKR
ELNNQGIWLR EGERAAADGS ITGTPRNSRI ASFWMEGPAA AFQTWEQLIF KLLAAEEEYE
RTGSEETLKA VVNTDIGRPY LPRSATEQRK SELLEQRAEP FPRRSVPDGV RFIEATVDVQ
GGKNRRFVVQ ITGYGEQGER WIVDRYNIRH SLRCSPNGES LPVDPAAYPE DWDLLLTDVF
HKTWPLASDP DVRMRLMAMA VDTGGEAGVT DNAYRFWRRC RSDGLGNRVF LFKGDGLRRD
RLINRTFPDN TGRSARRARA SGDVALWLVQ TDAFKDRVNN ALWRDTPGPN YIHFPDWLGR
WFYDELTYEE RGSDGKWRKP GRGANEAFDL LVYADALAVL HGYEKIRWPS APDWAQRETW
LVFPQERSSE TVSPELTAGA EKRRRRKKKL WTERAEDNPW ITSGGWL