Gene ECH74115_1788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1788 
Symbol 
ID6971438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1707272 
End bp1708297 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content56% 
IMG OID643385735 
Productphage major capsid protein E 
Protein accessionYP_002270225 
Protein GI209400938 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.123536 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATGT ACACAACCGC CCAACTGCTG GCGGCAAATG AGCAGAAATT TAAGTTTGAT 
CCGCTGTTTC TGCGTCTCTT TTTCCGTGAG AGCTATCCCT TCACCACGGA GAAAGTCTAT
CTCTCACAAA TTCCGGGACT GGTAAACATG GCGCTGTACG TTTCGCCGAT TGTTTCTGGT
GAGGTTATCC GTTCCCGTGG CGGCTCCACC TCTGAATTTA CGCCGGGATA TGTCAAGCCG
AAGCATGAAG TGAATCCGCA GATGACCCTG CGTCGCCTGC CGGATGAAGA TCCGCAGAAT
CTGGCGGACC CGGCTTACCG CCGCCGTCGC ATCATCATGC AGAACATGCG TGACGAAGAG
CTGGCCATTG CTCAGGTCGA AGAGATGCAG GCAGTTTCTG CCGTGCTCAA GGGCAAATAC
ACCATGACCG GTGAAGCCTT CGATCCGGTT GAGGTGGATA TGGGCCGCAG TGCGGCGAAC
AACATCACGC AGTCCGGCGG CACGGAGTGG AGCAAGCGTG ACAAGTCCAC GTATGACCCG
ACCGACGATA TCGAAGCCTA CGCGCTGAAC GCCAGCGGCG TGGTGAATAT CATCGTGTTT
GACCCGAAAG GCTGGGCGCT GTTCCGTTCC TTCAAAGCCG TCAGGGAGAA GCTGGATACC
CGTCGCGGCT CTCATTCCGA ACTGGAGACA GCGGTAAAAG ACCTGGGCAA AGCGGTGTCT
TATAAGGGAA TGTATGGCGA TGTGGCCATC GTCGTGTATT CCGGACAGTA CGTGGAAAAC
GGCGTCAAAA AGAACTTCCT GCCGGACAAC ACGATGGTGC TGGGGAACAC TCAGGCACGC
GGTCTGCGTA CCTATGGCTG CATTCAGGAT GCGGACGCAC AGCGCGAAGG CATTAACGCC
TCTGCCCGTT ACCCGAAAAA CTGGGTGACC ACCGGCGATC CGGCGCGTGA GTTCACCATG
ATTCAGTCAG CACCGCTGAT GCTGCTGGCT GACCCTGATG AGTTCGTGTC CGTACAACTG
GCGTAA
 
Protein sequence
MSMYTTAQLL AANEQKFKFD PLFLRLFFRE SYPFTTEKVY LSQIPGLVNM ALYVSPIVSG 
EVIRSRGGST SEFTPGYVKP KHEVNPQMTL RRLPDEDPQN LADPAYRRRR IIMQNMRDEE
LAIAQVEEMQ AVSAVLKGKY TMTGEAFDPV EVDMGRSAAN NITQSGGTEW SKRDKSTYDP
TDDIEAYALN ASGVVNIIVF DPKGWALFRS FKAVREKLDT RRGSHSELET AVKDLGKAVS
YKGMYGDVAI VVYSGQYVEN GVKKNFLPDN TMVLGNTQAR GLRTYGCIQD ADAQREGINA
SARYPKNWVT TGDPAREFTM IQSAPLMLLA DPDEFVSVQL A