Gene ECH74115_3203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3203 
Symbol 
ID6972182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2954131 
End bp2955633 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content55% 
IMG OID643387022 
Productphage portal protein, lambda family 
Protein accessionYP_002271489 
Protein GI209398345 
COG category[R] General function prediction only 
COG ID[COG5511] Bacteriophage capsid protein 
TIGRFAM ID[TIGR01539] phage portal protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.00254824 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAATTA TTGATGATGT GATCGGCGTG TTTTCCCCCG GGTGGAAAGC AGCCAGACTG 
CGTTCAAGGG CGTTAATCAT GGCCTATGAG GCGGTGAAAC CGACCCGGAC ACATAAAGCC
CGGCGGGAAA ATCGCTCTGC TGATCAGCTC AGTAAATACG GTGCGGTTTC CCTGCGGGAG
CAGGCCCGTT TTCTGGATAT CAATCATGAC CTGGTGATTG GTGTGTTTGA CAAGCTGGAA
GAGCGGGTGA TTGGTGCCAG GGGAATTATT GTGGAGCCTC AGCCATTACG AAAAAACGGG
GAAATGGCGG CTGAGCTGGC TGCGGATATC CGCCGTTTGT GGGCTGAATG GTCCGTGAGT
CCGGATGTGA CAGGGCAGTA TACCCGTCCT GTGCTTGAAC GTTTACTGCT GCGGACCTGG
CTGCGGGATG GTGAAGTGTT TGCGCAGATG GTCAGTGGTG CGGGAAACGG TCTGGAACGG
ACGGCGGGAG TGCCATTCTG GCTTGAGGCG ATGGAGCCGG ATTTTGTTCC CATGCGCACT
GATGAATCCG CCGGACTGAA TCAGGGGGTT TTTCTTGATG AGTGGGGAAG ACCGAAAAAA
TATCTGGTTT ATAAAAATTA TCCGGTCAGA GGCCGGCAGA GTGATACGAA AGAAATCGCT
GCCGGAAAAA TGATCCACCT GAAGTTCACT CGTCGTCTGC ATCAGACGCG AGGCTCATCC
ATGTTATCGG GGGTGCTGAT GCGGATCAGT GCCCTTAAGG AGTATGAGGA TGCGGAACTG
ACAGCGGCGC GTATTGCTGC GGCGCTGGGA CTGTATATCC GTAAAGGTGA CGGACAGGAC
TATGAAGATC CGGGGAGCAA AGAGACCGAG CGGGAAGTCC ATATCACCCC GGGTATTATT
TATGACGATT TGCGCAAGGG CGAGGATATC GGCATGGTCA AATCTGACCG TCCCAATCCC
AACCTTGAAA CTTTCCGCAA CGGCCAGTTG CGTGCAGTGG CAGCAGGCAG TCGTCTGAGT
TTTTCCAGTG CGGCGCGTAA CTATAACGGC ACCTACAGCG CCCAGCGGCA GGAGTTGGTC
GAGTCCACGG ATGGTTACCT GATCCTGCAG GACTGTTTTA TTGGCGCGGT AACCCGCCCG
GTGTACCGGA CATGGCTGAA TATGGTGGTT GCGGCAGGTC TGCTGAAAAT TCCGGCGGAT
GTGGAGATGA AAACGCTATA TAACGCGACG TATTCCGGTC CGGTGATGCC GTGGATCGAC
CCGGTTAAGG AAGCTGAAGC CTGGAGAATT CAGATCCGGG GTGGTGCAGC GACAGAATCT
GACTGGGTGC GTGCTGGTGG GCGCAATCCG GATGAGGTCA AACGTCGCCG CAAGGCTGAA
ATTGATGAAA ACAGCAGACT GGGGCTGGTC TTTGATACTG ACCCCGTCAA CGACAAAGGA
GGCAACAGTG CCGGAACTGA ACGACAGTAT CAGCGCGACA CCGAAAGCCA GCATGAAGAA
TAA
 
Protein sequence
MAIIDDVIGV FSPGWKAARL RSRALIMAYE AVKPTRTHKA RRENRSADQL SKYGAVSLRE 
QARFLDINHD LVIGVFDKLE ERVIGARGII VEPQPLRKNG EMAAELAADI RRLWAEWSVS
PDVTGQYTRP VLERLLLRTW LRDGEVFAQM VSGAGNGLER TAGVPFWLEA MEPDFVPMRT
DESAGLNQGV FLDEWGRPKK YLVYKNYPVR GRQSDTKEIA AGKMIHLKFT RRLHQTRGSS
MLSGVLMRIS ALKEYEDAEL TAARIAAALG LYIRKGDGQD YEDPGSKETE REVHITPGII
YDDLRKGEDI GMVKSDRPNP NLETFRNGQL RAVAAGSRLS FSSAARNYNG TYSAQRQELV
ESTDGYLILQ DCFIGAVTRP VYRTWLNMVV AAGLLKIPAD VEMKTLYNAT YSGPVMPWID
PVKEAEAWRI QIRGGAATES DWVRAGGRNP DEVKRRRKAE IDENSRLGLV FDTDPVNDKG
GNSAGTERQY QRDTESQHEE