Gene ECH74115_3503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3503 
Symbol 
ID6971922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3248705 
End bp3249919 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content50% 
IMG OID643387305 
Producthypothetical protein 
Protein accessionYP_002271768 
Protein GI209400413 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATTG GCACCGGGCA GCGTGGTGAC GGACGCCACG CATTTGTGAC CCGTGAGGAA 
CTGGTTGGTC TTAAACTCGC CCGGCGTCGA ACATCGGGTG GTGCCTCATA TGCACTGAAT
CCGGGTATTG AGATTGACAG TACTTTAATG ACTGTTGATT TTCCCACAAA ACCGCTGAAT
TTTAAGGCGA CAGGAGGATT TGGCTCGGTT CTTCTTGAAT GGGATATGCC TAATTATCGC
GGACATTCAC TGACTGAAAT CTGGCGGGGT ACGGAGGATG ACCTTGCTGA TGCAGTGCTG
GTTGCCACGA CGCCGGGGCA GGTTTACGGC GATCCGGTTG ACCCTGGCTG GTCGGGATTT
TACTGGATAC GTTTTGTTAA CGCGGCAGGA GTGAAAGGTC CATGGAATGC TGAAAAAGGC
ACTCAGGCAC AAACACAGAT CGGCGTGAAG GCCATCATTG ACCAGATCCG CGATGAGGCT
GCAAAGTCGC CGGTTGTGTC CGAGCTGCGT AAAGAAATAA AAAACGCGCA GGGGCAGGCT
GTAAAGGATG CTGCAATTAA GACAACCGAA GTTGTGGGGA CTCTCAGGGA AGAAACGACA
AGAACGATTG GTGGTATTGA AACCCGCATT AGCACACTGG ATTCGTCAAC CAGTGAATCG
CTTAATGAGG TCGACAAGCG CATCACTAAA CTGGATAAAG AAGGCGGTGA GGCGTTTCTG
GCAATGTGGT CAAAAAAAGC GGGAGTTGAT GGTATCACTG CGGGGATCGG GATTGTCGCC
GGAAAAGACA GTGAAGGCAG GCCTGTAAGT CAGGTTGCAA TTTCTGCGTC GCAGTTGTTT
GTCTTTGACC CGAACAACCC GGATAACACC GCCTATCCGT TTGCGGTATC AGGTGGCAAG
GTTGTGATCC CGAAAGCGAT GATTTATGAC GCGGTGATTG AAACACTGGT GTCGCGGAAG
GTTGTGGCGG ATGAGGTAAA AGCCGGGGTA AGTATCACTT CGCCAGTTAT CCGGAGTGCC
GTTATTCAGA ACGGAAACTT TCAGGTTGAT TCTCAGGGTA ACCTGAATAT TGGAGGCCTT
TTCAGTGTTA CGTCACAAGG GCAACTGACA ATTCGTTACT CTAATCAGAA TGTAGGACTG
GTGATCCGCA ATGATAAAAT TGAGGTTTAT GATCAGAATG GACGACTGGC TGTTCGCATA
GGCAGATTAC GCTGA
 
Protein sequence
MEIGTGQRGD GRHAFVTREE LVGLKLARRR TSGGASYALN PGIEIDSTLM TVDFPTKPLN 
FKATGGFGSV LLEWDMPNYR GHSLTEIWRG TEDDLADAVL VATTPGQVYG DPVDPGWSGF
YWIRFVNAAG VKGPWNAEKG TQAQTQIGVK AIIDQIRDEA AKSPVVSELR KEIKNAQGQA
VKDAAIKTTE VVGTLREETT RTIGGIETRI STLDSSTSES LNEVDKRITK LDKEGGEAFL
AMWSKKAGVD GITAGIGIVA GKDSEGRPVS QVAISASQLF VFDPNNPDNT AYPFAVSGGK
VVIPKAMIYD AVIETLVSRK VVADEVKAGV SITSPVIRSA VIQNGNFQVD SQGNLNIGGL
FSVTSQGQLT IRYSNQNVGL VIRNDKIEVY DQNGRLAVRI GRLR