Gene ECH74115_3504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3504 
Symbol 
ID6971614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3249970 
End bp3251595 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content51% 
IMG OID643387306 
Producthypothetical protein 
Protein accessionYP_002271769 
Protein GI209400018 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.619572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.985155 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTATA TCGATATCAC CACGATGCGT GGGATGATGC CGCGCGTTGT GACATCCATG 
CTGCCCGAGC ATTCCGCTGT ACTGGCGGAG GACTGCCATT TCCGGTTTGG TGTTATTACA
CCAGAACGTC AGATATCCGG GGTTGAGAAA ACATTCACAA TTAAGCCAAA AACAATTTTT
CATTACCGTG ACGATTTCTG GTTTGCATGG CCGGATGTGG TGGATGTGAT CCGCAGTCCG
ATCGCTCAGG ACCCCCACGG GCGTATTTAC TACACTGACG GGCGTTTTCC TAAAGTGACG
GATGCGACTA TTGCCACAAA AGGGGACGGG AATCACCCGA CATCATCGTA TCGTCTGGGG
ATCCCCGCGC CGACGACAGC TCCTGTCTGT ACTGTTCAGC AGGGCGGTGA TGTTTCTGAC
GATAACCCGA ATGATGACGA AACCCGGTTT TATACTGAAA CCTTTGTCTC AGATTATGGT
GAAGAAGGTC CGCCTGGTCC GGCGTCTCTG GAGGTAACAC TCCGTACTCC GGGGACTGCG
GTACAACTGA CGCTGGCTCC GGTGCCATTG CAGAATGCCA ATATTAAACG TCGCCGGATT
TATCGCTCTG CATCAGGTGG AGGAGAAGCG GATTTTTTAC TTGTGGCTGA ACTGGATGCG
TCAGTGCTCA GTTACACGGA CAAAATACCG ACGAAAAACC TTGGGCCTTC CCTGGCAACA
TGGGATTACC TGCCGCCACC GGAGAATATG ACGGGTCTTT GCCTGATGGC TAATGGTATT
GCTGCCGGGT TTGCCGGTAA TGAAGTGATG TTTTCGGAAG CGTATCTGCC GTATGCATGG
CCGGAAGTGA ATCGTCACAC GACGGCAGAA GATATTGTGG CTATCTGTCC GCTGGGAACG
TCACTGGTGG TGGCGACAAA GGGGGAGCCT TATCTGTTCA GTGGTGTATC GCCTTCCACA
ATTTCTGGCT CCAGAATTCC TTCCATGCAG GCATGCCTGA GCCGAAGAAG CATGGTGGCG
ATGGAGGGAT TCGTACTGTA TGCCGGGACA AACGGTCTGG TATCTGTTGA TGTAAACGGT
AATACAGCAC TGGCAACGGA AAAGATTATT TCACCAGAAC AGTGGCAGAG TCAGTTTAAC
CCGATGTCCA TTGTGGCTTA TTCCTGGCGT GGTGACTATA TCGGTTGTTA CACAAAACCG
GATGGTAAGC AGGATGTGTT TGTATTCAGT CCGGCGAACA TGGATATCCG TTATCTCAGC
ACGCCGTTTG ACTGTGCATG GATTGATCTT GCAAAAGATA TGATGCGCGT GGTGACAGGG
GACAAAATGT CAGTGCTTGC CGGGGACTCT CTGCCGTCCA TGATAAGGTG GCATTCAAAA
ATTTTTTCAT TACCTGAAAG AACCTCTTTT TCCTGTATCA GAGTGAAATC TCCGGCACCT
GAGCGGGTGG GGATCACTGT TATGGCTGAT GATGTTCCTG TGATTCATTT TGCGCCGGGT
ACGTTTAAGG GAAGTGTGGT GAGACTTCCG GCAGCAACCG GGCAAAACTG GCAGGTGATG
GTATCCGGAT TCGGGCAGGT GGAACGAATA ACCCTGAGTA CATCGATGTC GGAGATGCCG
GTATGA
 
Protein sequence
MPYIDITTMR GMMPRVVTSM LPEHSAVLAE DCHFRFGVIT PERQISGVEK TFTIKPKTIF 
HYRDDFWFAW PDVVDVIRSP IAQDPHGRIY YTDGRFPKVT DATIATKGDG NHPTSSYRLG
IPAPTTAPVC TVQQGGDVSD DNPNDDETRF YTETFVSDYG EEGPPGPASL EVTLRTPGTA
VQLTLAPVPL QNANIKRRRI YRSASGGGEA DFLLVAELDA SVLSYTDKIP TKNLGPSLAT
WDYLPPPENM TGLCLMANGI AAGFAGNEVM FSEAYLPYAW PEVNRHTTAE DIVAICPLGT
SLVVATKGEP YLFSGVSPST ISGSRIPSMQ ACLSRRSMVA MEGFVLYAGT NGLVSVDVNG
NTALATEKII SPEQWQSQFN PMSIVAYSWR GDYIGCYTKP DGKQDVFVFS PANMDIRYLS
TPFDCAWIDL AKDMMRVVTG DKMSVLAGDS LPSMIRWHSK IFSLPERTSF SCIRVKSPAP
ERVGITVMAD DVPVIHFAPG TFKGSVVRLP AATGQNWQVM VSGFGQVERI TLSTSMSEMP
V