Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1785 |
Symbol | |
ID | 6969667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1703973 |
End bp | 1705574 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643385732 |
Product | phage portal protein, lambda family |
Protein accession | YP_002270222 |
Protein GI | 209399040 |
COG category | [R] General function prediction only |
COG ID | [COG5511] Bacteriophage capsid protein |
TIGRFAM ID | [TIGR01539] phage portal protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.0988466 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATGT CCACCATTCC CACCCTTCTG GGGCCGGACG GCATGACATC GCTGCGTGAA TATGCCGGTT ATCACGGCGG TGGCAGCGGA TTTGGTGGGC AGTTGCGGGC GTGGAACCCA CCGGGTGAAA GTGTGGATGC AGCCCTGCTG CCCAACTTTA CCCGTGGCAA TGCCCGCGCA GACGATCTGG TACGCAATAA CGGCTATGCC GCCAACGCCA TCCAGTTGCA TCAGGATCAT ATCGTCGGGT CTTTTTTCCG GCTCAGTCAT CGCCCAAGCT GGCGCTATCT GGGCATCGGG GAGGAAGAAG CCCGTGCCTT TTCCCGCGAG GTTGAAGCGG CATGGAAAGA GTTTGCCGAA GATGACTGTT GCTGCATTGA CGTTGAGCGA AAACGCACGT TTACCATGAT GATTCGGGAA GGTGTGGCCA TGCACGCCTT TAACGGTGAA CTGTTCGTTC AGGCCACCTG GGATACCCGT CCCTCGCGAC TGTTCCGGAC ACAGTTCCGG ATGGTCAGCC CGAAGCGCAT CAGCAACCCG AACAATACCA GCGACAGCCG GAACTGCCGT GCCGGTGTGC AGATTAATGA CAGCGGTGCG GCGCTGGGAT ATTACGTCAG CGAGGACGGG TATCCTGGCT GGATGCCGCA GAAATGGACA TGGATACCCC GCGAGTTACC CGGCGGTCGT GCTTCGTTCA TTCACGTCTT TGAACCCGTG GAGGACGGGC AGACCCGCGG TGCAAATGTG TTTTACAGCG TGATGGAGCA GATGAAGATG CTCGACACGC TGCAGAACAC GCAGCTGCAG AGCGCCATTG TGAAGGCGAT GTATGCCGCC ACCATTGAGA GTGAGCTGGA TACGCAGTCA GCGATGGATT TTATTCTGGG CGCGAACAGT CAGGAGCAGC GGGAAAGGCT GACCGGCTGG ATTGGTGAAA TTGCCGCGTA TTACGCCGCA GCACCGGTCC GTCTGGGAGG CGCAAAAGTG CCGCACCTGA TGCCGGGGGA CTCACTGAAC CTGCAGACGG CTCAGGACAC GGATAACGGC TACTCCGTGT TTGAGCAGTC ACTGCTGCGG TATATCGCTG CCGGGCTGGG TGTCTCGTAT GAGCAGCTTT CCCGGAATTA CGCCCAGATG AGCTACTCCA CGGCACGGGC CAGTGCGAAC GAGTCGTGGG CGTACTTTAT GGGGCGGCGA AAATTCGTCG CATCCCGTCA GGCGAGCCAG ATGTTTCTGT GCTGGCTGGA AGAGGCCATC GTTCGCCGCG TGGTGACGTT ACCTTCAAAA GCGCGCTTCA GCTTTCAGGA AGCCCGCAGT GCCTGGGGGA ACTGCGACTG GATAGGCTCC GGTCGTATGG CCATCGATGG TCTGAAAGAA GTTCAGGAAG CGGTGATGCT GATAGAAGCC GGACTGAGCA CCTACGAGAA AGAGTGCGCG AAACGCGGTG ACGACTATCA GGAAATTTTT GCCCAGCAGG TCCGTGAAAC GATGGAGCGC CGCGCAGCTG GTCTTAAACC GCCCGCCTGG GCGGCTGCGG CATTTGAATC CGGGCTGCGA CAATCAACAG AGGAGGAGAA GAGTGACAGC AGAGCTGCGT AA
|
Protein sequence | MKMSTIPTLL GPDGMTSLRE YAGYHGGGSG FGGQLRAWNP PGESVDAALL PNFTRGNARA DDLVRNNGYA ANAIQLHQDH IVGSFFRLSH RPSWRYLGIG EEEARAFSRE VEAAWKEFAE DDCCCIDVER KRTFTMMIRE GVAMHAFNGE LFVQATWDTR PSRLFRTQFR MVSPKRISNP NNTSDSRNCR AGVQINDSGA ALGYYVSEDG YPGWMPQKWT WIPRELPGGR ASFIHVFEPV EDGQTRGANV FYSVMEQMKM LDTLQNTQLQ SAIVKAMYAA TIESELDTQS AMDFILGANS QEQRERLTGW IGEIAAYYAA APVRLGGAKV PHLMPGDSLN LQTAQDTDNG YSVFEQSLLR YIAAGLGVSY EQLSRNYAQM SYSTARASAN ESWAYFMGRR KFVASRQASQ MFLCWLEEAI VRRVVTLPSK ARFSFQEARS AWGNCDWIGS GRMAIDGLKE VQEAVMLIEA GLSTYEKECA KRGDDYQEIF AQQVRETMER RAAGLKPPAW AAAAFESGLR QSTEEEKSDS RAA
|
| |