Gene ECH74115_3519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3519 
Symbol 
ID6969320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3262435 
End bp3264141 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content50% 
IMG OID643387320 
Productlarge subunit terminase 
Protein accessionYP_002271783 
Protein GI209397398 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.918493 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.00317443 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACATTCC GGAAGAATGA ACCGCGATGT GATGAGCCGT CAGAAATGAC CGAGGCTGAA 
CAACGTCTGT TCATCATGAC TAAACTGAGC AATCCCTGGT GGCGGCTCAA TCATCTCTAC
AAAATACAGA ACGAAAAAGG TGAACTGGTC ACCTTCAGAA TGCGACCGGC GCAGCGCCAG
TTGTTCCGGA GCATGCACAA TAAAAATATT ATCCTGAAAG CGCGCCAGCT GGGATTTTCC
ACAGCCATTG ATATTTATCT TCTCGACCAG GCATTATTCA TTCCGCATCT CAAATGCGGG
ATCGTCGCTC AGGATAAACA GGCTGCCAGT GAAATTTTCC GCACAAAAAT TGCTGTACCG
TTTGATCATC TCCCTGACTG GCTGAGAGCC TCATTCACCA TCGTTGAACG TCGTAGCGGT
GCCAGCGGTG GCTATATCCT GTTTGGTCAC GGCTCGAGTA TCCAGGTGGC AACCTCATTC
CGTTCAGGTA CGGTGCAGCG CCTGCATATC TCAGAGCACG GCAAAATTTG CGCGAAATAT
CCGGCTAAGG CGAAAGAACT GCGAACCGGT ACGCTTAATG CCGTCTCTGA TGAATGCATT
ATTTTTGATG AGTCCACTGC TGAAGGCGTG GGTGGTGATT TTTACGAGAT GAGTAACCGA
GCACAGGAGA TCACTGCATC AGGCTTATTG CTGACGGCAC AGGATTATAA ATTCCATTTT
TACGCCTGGT GGCAGGATCC TAAATACAGC GCCAGAGTGC CGGAAAGCGG GCTGAAGCTG
TCACGGGAAA AAATGACGTA TTTTTCTGCG GTTGAGAAGG CAATGAACAT CACGCTTACT
GATGAACAGA AGCAGTGGTA CATCAATAAG GAAACTGAAC AGCGTGAGGA AATGAAGCAG
GAGTTTCCCT CAACGCCACA GGAGGCGTTT CTGACGTCCG GACGACGTGT GTTCAGTGCC
GAAAGTACGT TGCAGGCAGA ATCATTCTGT TCGCCACCGA TGATTGTTTA TGACATTGAA
CCTGTTACAG GAGCGAAGAC TAAAGCTCAG TCTCTGCGTG AAGGAAATAA AAACGAGTTG
CAGCGGACGC TGATGAATTA TCTGCTGGTA TGGGAACTGC CGGATCCGGA TGAAGAGTAT
GTTTGTGGGG CAGATACTGC CGAAGGGCTG GAGCACGGAG ACCGCTCATC GCTGGATGTT
GTCAAACGCA GTAATGGCGA GCAGGTGGCT CACTGGTTCG GGCATCTCGA TGCTGAACTT
TTTGCTCATC TCATTTCGCA GGTCTGTCGT ATGTATAACA ACGCGTTTGT GGGGCCGGAG
CGTAATAATC ACGGACATGC AGTTATCCTG AAACTCCGGG AACTCTATCC GACACGTTAT
ATCTACAACG AACAGCATCT TGACCAGGCA TATGACGACG ATACGCCCCG CCTTGGCTGG
CTGACAACCC GTCAGAGCAA ACCTGTTCTG ACCGAAGGAA TGAAAACGCT TCTGAATAAT
GGAATATCAG GGATCCGCTG GTCAGGCACA TTATCGGAAA TGAACACCTA CGTTTATGAC
GCGAAAGGCT CCATGAATGC ACAGGAAGGC TGCTTTGATG ATCAGCTCAT GAGCTACATG
ATTGCCCAGG AGATGCGCGC CAGAATGCCG GTGAGGGTAA AACAGAAAAC GGATAAACGC
AGAACCACAC ACTGGATGGC ACACTGA
 
Protein sequence
MTFRKNEPRC DEPSEMTEAE QRLFIMTKLS NPWWRLNHLY KIQNEKGELV TFRMRPAQRQ 
LFRSMHNKNI ILKARQLGFS TAIDIYLLDQ ALFIPHLKCG IVAQDKQAAS EIFRTKIAVP
FDHLPDWLRA SFTIVERRSG ASGGYILFGH GSSIQVATSF RSGTVQRLHI SEHGKICAKY
PAKAKELRTG TLNAVSDECI IFDESTAEGV GGDFYEMSNR AQEITASGLL LTAQDYKFHF
YAWWQDPKYS ARVPESGLKL SREKMTYFSA VEKAMNITLT DEQKQWYINK ETEQREEMKQ
EFPSTPQEAF LTSGRRVFSA ESTLQAESFC SPPMIVYDIE PVTGAKTKAQ SLREGNKNEL
QRTLMNYLLV WELPDPDEEY VCGADTAEGL EHGDRSSLDV VKRSNGEQVA HWFGHLDAEL
FAHLISQVCR MYNNAFVGPE RNNHGHAVIL KLRELYPTRY IYNEQHLDQA YDDDTPRLGW
LTTRQSKPVL TEGMKTLLNN GISGIRWSGT LSEMNTYVYD AKGSMNAQEG CFDDQLMSYM
IAQEMRARMP VRVKQKTDKR RTTHWMAH