Gene ECH74115_1558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1558 
Symbol 
ID6966997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1520091 
End bp1522439 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content35% 
IMG OID643385525 
Producthypothetical protein 
Protein accessionYP_002270019 
Protein GI209400535 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.000244044 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGCCCA CTACAAATAT CTCTGTAAAT TCTGGAGTAA TATCTTTTGA AAGTCCTGTA 
GATTCACCAT CTAACGAGGA TGTTGAAGTT GCCCTCGAAA AGTGGTGTGC TGAGGGAGAA
TTTAGCGAAA ATCGTCATGA GGTTGCATCA AAAATACTTG ATGTTATAAG TACTAATGGA
GAGACTTTAT CAATCAGTGA GCCAATAACA ACATTACCAG ACTTGCTTCC AGGTTCTCTG
AAAGAACTGG TTTTGAATGG ATGTACAGAG CTTAAATCAA TAAACTGCTT ACCCCCCAAC
TTATCTTCAT TAAGTATGGT TGGATGCTCA TCATTAGAGG TTATAAATTG CAGCATACCT
GAAAATGTCA TTAATTTATC TTTATGCCAT TGTAGTTCTT TGAAACATAT AGAAGGTTCC
TTTCCTGAGG CACTCAGAAA TTCCGTATAT TTAAATGGCT GTAATTCATT AAATGAATCG
CAATGTCAAT TCCTTGCATA TGATGTCAGT CAAGGCCGTG CCTGCCTGAG CAAAGCTGAG
CTTACTGCTG ACTTAATTTG GTTGTCAGCT AACCGAACGG GTGAAGAGTC TGCTGAAGAA
TTGAATTACT CTGGATGTGA CTTGTCAGGT CTAAGTCTTG TAGGGCTGAA TTTATCATCA
GTAAATTTTT CTGGAGCAGT GCTTGATGAT ACAGATCTCA GGATGAGTGA TTTGTCTCAG
GCTGTATTGG AAAACTGTTC TTTTAAAAAC TCGATTTTGA ATGAATGTAA TTTTTGTTAT
GCTAATTTAT CTAATTGTAT TATTAGGGCT TTGTTTGAAA ACTCTAATTT CAGCAATTCC
AATCTTAAAA ATGCATCATT TAAAGGATCT TCATATATAC AATATCCTCC AATTTTGAAC
GAGGCTGATT TAACAGGAGC TATTATAATT CCTGGAATGG TTTTAAGTGG TGCTATCTTA
GGTGATGTAA AGGAGCTCTT TAGTGAAAAA AGTAATACCA TTAATCTAGG AGGGTGTTAC
ATAGATCTAT CTGACATACA GGAAAATATA TTATCTGTGT TGGATAACTA TACAAAATCA
AATAAATCAA TTTTATTGAC TATGAATACA TCTGATGATA AGTATAACCA TGATAAAGTA
AGGGCCGCTG AAGAACTTAT CAAAAAAATA TCTCTTGACG AATTAGCGGC GTTCCGGCCC
TATGTTAAGA TGTCTTTGGC TGATTCATTT AGTATTCATC CTTATTTGAA CAACGCAAAT
ATACAGCAAT GGCTCGAGCC TATATGTGAT GACTTTTTTG ATACTATAAT GTCTTGGTTT
AATAATTCAA TAATGATGTA TATGGAGAAT GGTAGTTTAT TGCAGGCAGG GATGTATTTT
GAGCGACATC CAGGTGCGAT GGTATCTTAT AATAGTTCCT TTATACAAAT TGTAATGAAT
GGTTCACGGC GTGATGGAAT GCAGGAACGA TTTAGGGAAC TCTATGAAGT ATATTTAAAA
AATGAAAAAG TTTATCCTGT CACACAGCAG AGTGATTTTG GATTGTGCGA TGGCTCTGGG
AAGCCTGACT GGGATGATGA TTCCGATTTG GCTTATAACT GGGTTTTGTT ATCATCACAG
GATGATGGTA TGGCAATGAT GTGTTCTTTG AGTCATATGG TTGATATGTT ATCTCCTAAT
ACATCAACTA ATTGGATGTC CTTTTTTTTA TATAAGGATG GAGAAGTTCA AAATACATTT
GGGTATTCAT TGAGCAATCT TTTTTCTGAA TCATTTCCAA TTTTCAGTAT TCCTTATCAT
AAAGCTTTTT CCCAGAATTT CGTTTCTGGT ATTCTGGATA TACTCATTTC TGATAATGAA
CTCAAAGAGA GATTTATTGA GGCACTTAAT TCCAATAAAT CAGATTATAA AATGATTGCT
GATGATCAGC AAAGGAAACT TGCCTGTGTC TGGAATCCCT TTCTTGATGG TTGGGAACTG
AACGCTCAGC ATGTAGATAT GATTATGGGG AGCCATGTAT TGAAAGATAT GCCACTAAGA
AAACAGGCTG AAATATTATT TTGTTTAGGG GGGGTTTTCT GTAAATACTC ATCGAGTGAT
ATGTTTGGTA CAGAGTATGA TTCTCCTGAG ATTCTACGGA GATATGCAAA TGGATTGATT
GAACAAGCTT ATAAAACAGA TCCTCAGGTA TTTGGCTCAG TTTATTATTA CAATGATATT
TTAGACAGGC TACAAGGAAG AAATAATGTT TTTACTTGTA CCGCTGTGCT GACTGATATG
CTAACGGAGC ATGCAAAAGA ATCTTTTCCT GAAATATTTT CATTGTATTA TCCTGTTGCG
TGGCGTTGA
 
Protein sequence
MLPTTNISVN SGVISFESPV DSPSNEDVEV ALEKWCAEGE FSENRHEVAS KILDVISTNG 
ETLSISEPIT TLPDLLPGSL KELVLNGCTE LKSINCLPPN LSSLSMVGCS SLEVINCSIP
ENVINLSLCH CSSLKHIEGS FPEALRNSVY LNGCNSLNES QCQFLAYDVS QGRACLSKAE
LTADLIWLSA NRTGEESAEE LNYSGCDLSG LSLVGLNLSS VNFSGAVLDD TDLRMSDLSQ
AVLENCSFKN SILNECNFCY ANLSNCIIRA LFENSNFSNS NLKNASFKGS SYIQYPPILN
EADLTGAIII PGMVLSGAIL GDVKELFSEK SNTINLGGCY IDLSDIQENI LSVLDNYTKS
NKSILLTMNT SDDKYNHDKV RAAEELIKKI SLDELAAFRP YVKMSLADSF SIHPYLNNAN
IQQWLEPICD DFFDTIMSWF NNSIMMYMEN GSLLQAGMYF ERHPGAMVSY NSSFIQIVMN
GSRRDGMQER FRELYEVYLK NEKVYPVTQQ SDFGLCDGSG KPDWDDDSDL AYNWVLLSSQ
DDGMAMMCSL SHMVDMLSPN TSTNWMSFFL YKDGEVQNTF GYSLSNLFSE SFPIFSIPYH
KAFSQNFVSG ILDILISDNE LKERFIEALN SNKSDYKMIA DDQQRKLACV WNPFLDGWEL
NAQHVDMIMG SHVLKDMPLR KQAEILFCLG GVFCKYSSSD MFGTEYDSPE ILRRYANGLI
EQAYKTDPQV FGSVYYYNDI LDRLQGRNNV FTCTAVLTDM LTEHAKESFP EIFSLYYPVA
WR