Gene ECH74115_3026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3026 
Symbol 
ID6968919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2809529 
End bp2811976 
Gene Length2448 bp 
Protein Length815 aa 
Translation table11 
GC content56% 
IMG OID643386860 
Productphage tail tape measure protein, TP901 family 
Protein accessionYP_002271328 
Protein GI209397347 
COG category[S] Function unknown 
COG ID[COG5283] Phage-related tail protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0149182 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAACA ATGTAAAATT ACAGGTATTG CTCAGGGCTG TTGACCAGGC ATCCCGCCCG 
TTTAAATCCA TCCGCACAGC GAGTAAGTCG CTGTCGGGGG ATATCCGGGA AACACAAAAA
TCACTGCGCG AGCTGAACGG GCAGGCATCC CGTATTGAGG GATTCCGCAA GACCAGCGCA
CAGCTCGCCG TGACTGGTCA TGCACTTGAA AAGGCACGGC AGGAGGCCGA AGCCCTTGCC
ACACAGTTTA AAAACACCGA ACGTCCGACC CGTGCTCAGG CGAAAGTGCT GGAATCCGCA
AAGCGTGCGG CGGAGGACTT ACAGGCGAAA TATAACCGCC TGACGGATTC CGTTAAACGC
CAGCAGCGGG AACTGGCCGC TGTGGGAATT AATACCCGCA ATCTTGCACA TGATGAGCTG
GGGCTGAAAA ACCGTATCAG TGAAACCACC GCACAGCTTA ACCGGCAGCG TGACGCGCTG
GCGCGTGTCA GTGCGCAACA GGCAAAACTT AACGCAGTAA AACAGCGCTA TCAGGCAGGA
AAGGAGCTGG CCGGAAATAT GGCCTCAGTG GGCGCTGCCG GTGTGGGGAT TGCGGCGGCG
GGAACGATGG CCGGTGTTAA GCTGCTGATG CCCGGTTATG AATTTGCGCA GAAAAACTCA
GAATTACAGG CCGTGCTCGG TGTGGCAAAA GACTCCGCCG AAATGGCCGC GCTACGCAAG
CAGGCGCGCC AGCTCGGCGA CAATACCGCC GCCTCGGCGG ATGATGCGGC CGGTGCACAG
ATTATCATCG CGAAAGCGGG TGGGGATGTT GATGCCATTC AGGCGGCAAC GCCGGTCACG
CTGAATATGG CGCTGGCGAA CCGTCGCACG ATGGAAGAAA ACGCCGCCCT GCTGATGGGG
ATGAAATCTG CCTTTCAGCT TTCAAACGAT AAGGTTGCTC ATATCGGGGA TGTTCTCTCC
ATGACGATGA ACAAAACCGC CGCCGATTTT GACGGTATGA GCGATGCGCT GACCTATGCC
GCACCTGTGG CAAAAAATGC CGGTGTCAGC ATTGAAGAAA CCGCCGCAAT GGTCGGGGCG
CTGCATGATG CAAAAATCAC AGGCTCAATG GCGGGGACGG GAAGCCGTGC CGTGTTAAGT
CGCCTGCAGG CACCGACGGG AAAAGCATGG GATGCACTCA AAGAGCTTGG AGTGAAAACC
TCAGACAGCA AGGGGAATAC CCGACCAATA TTTACCATTC TGAAAGAAAT GCAGGCCAGT
TTTGAGAAAA ACCGGCTCGG TACTGCCCAG CAGGCTGAAT ACATGAAAAC CATTTTCGGG
GAGGAGGCCA GTTCAGCCGC CGCCGTGCTG ATGACTGCCG CCTCAACCGG AAAGCTGGAC
AAACTGACCG CTGCGTTTAA AGCCTCAGAC GGGAAGACCG CAGAGCTGGT AAATATCATG
CAGGACAACC TCGGCGGTGA CTTTAAGGAG TTTCAGTCCG CTTATGAGGC GGTGGGGACT
GACCTGTTTG ACCAACAGGA AGGCGCACTG CGTAATCTCA CGCAGACGGC CACAAAGTAT
GTGTTAAAAC TCGACGGCTG GATCCAGAAA AACAAATCAC TGGCGTCAAC CATCGGCCTT
ATTGTCGGTG GCGCACTGGC GCTTATTGGC ATCATCGGTG CAATTGGTCT TGTAGCCTGG
CCGGTTATCA CCGGCATTAA TGCCATCATC GCGGCAGCAG GCGCAATGGG GGCAATCTTC
ACGACGGTTG GCAGTGCTGT TATGACGGCC ATCGGGGCGA TTAGCTGGCC GGTTGTGGCC
GTGGTGGCTG CAATTGTCGC CGGGGCGTTG CTTATCCGTA AATACTGGGA GCCTGTCAGC
GCATTCTTTG GCGGTGTGGT TGAAGGGCTG AAAGCTGCAT TTGCGCCGGT GGGGGAACTG
TTCACGCCAC TTAAGCCGCT GTTTGACTGG CTGGGTGAAA AGTTACAGGC CGCGTGGCAG
TGGTTTAAAA AACTGATTGC CCCGGTCAAA GCCACCCAGG ACACCCTGAA CCGTTGCCGT
GACACGGGCG TCATGTTCGG GCAGGCACTG GCTGACGCGT TGATGCTGCC GCTTAATGCG
TTCAACAAAC TGCGCAGTGG TATTGACTGG GTACTGGAAA AACTCGGTGT TATCAACAAA
GAGTCAGACA CACTTGACCA GACCGCCGCC AGAACTCATG CCGCCACGTA TGGCACCGGT
GGTTATATTC CGGCGACCAG CTCTTATGCA GGCTATCAGG CTTATCAGCC GGTCACGGCA
CCGGCTGGCC GCTCTTATGT GGACCAGAGT AAAAACGAAT ATCACATCAG CCTGACGGGT
GGTACTGCGC CGGGGACACA GCTTGACCGC CAGTTACAGG ATGCGCTCGA AAAATACGAG
CGGGATAAAC GTGCGCGCGC CCGTGCCAGC ATGATGCATG ACGGTTAA
 
Protein sequence
MSNNVKLQVL LRAVDQASRP FKSIRTASKS LSGDIRETQK SLRELNGQAS RIEGFRKTSA 
QLAVTGHALE KARQEAEALA TQFKNTERPT RAQAKVLESA KRAAEDLQAK YNRLTDSVKR
QQRELAAVGI NTRNLAHDEL GLKNRISETT AQLNRQRDAL ARVSAQQAKL NAVKQRYQAG
KELAGNMASV GAAGVGIAAA GTMAGVKLLM PGYEFAQKNS ELQAVLGVAK DSAEMAALRK
QARQLGDNTA ASADDAAGAQ IIIAKAGGDV DAIQAATPVT LNMALANRRT MEENAALLMG
MKSAFQLSND KVAHIGDVLS MTMNKTAADF DGMSDALTYA APVAKNAGVS IEETAAMVGA
LHDAKITGSM AGTGSRAVLS RLQAPTGKAW DALKELGVKT SDSKGNTRPI FTILKEMQAS
FEKNRLGTAQ QAEYMKTIFG EEASSAAAVL MTAASTGKLD KLTAAFKASD GKTAELVNIM
QDNLGGDFKE FQSAYEAVGT DLFDQQEGAL RNLTQTATKY VLKLDGWIQK NKSLASTIGL
IVGGALALIG IIGAIGLVAW PVITGINAII AAAGAMGAIF TTVGSAVMTA IGAISWPVVA
VVAAIVAGAL LIRKYWEPVS AFFGGVVEGL KAAFAPVGEL FTPLKPLFDW LGEKLQAAWQ
WFKKLIAPVK ATQDTLNRCR DTGVMFGQAL ADALMLPLNA FNKLRSGIDW VLEKLGVINK
ESDTLDQTAA RTHAATYGTG GYIPATSSYA GYQAYQPVTA PAGRSYVDQS KNEYHISLTG
GTAPGTQLDR QLQDALEKYE RDKRARARAS MMHDG