Gene ECH74115_0243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0243 
Symbol 
ID6969966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp258862 
End bp260712 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content53% 
IMG OID643384314 
Producthypothetical protein 
Protein accessionYP_002268830 
Protein GI209396969 
COG category[S] Function unknown 
COG ID[COG3519] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03359] type VI secretion protein, VC_A0110 family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.956418 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTTG AAGAACGCTA TTTTCGGGAA GAACTCGATT ACCTGCGCCA GCTTAGCAAG 
CTGCTGGCAA CGGAAAAACC CCATCTGGCC CGCTTCCTGG CCGAAAAAGA TGCGGATCCG
GATATTGAAC GCCTGCTGGA AGGGGTGGCT TTTCTTACCG GCAATCTCCG CCAGAAAATT
GAGGATGAAT TCCCAGAACT GACGCACGGG CTTATTAAGA TGCTATGGCC TAATTACCTG
CGTCCGGTTC CGGCAATGAC CCTTATTGAA TATACGCCGG ATATGGATAA GTCTTCTGTA
CCGGTGTTAA TCCCCCGTAA TGAGCAGTTT ACAACCAACG CCGGGGAAAT CAGAGTTGAT
GAAGTGCTGC CCTCTGATGC TAAAAAGGAG GAGCCGCCTC CCTGTACCTT CACCCTCTGC
CGGGATATCT GGCTGCTGCC CGTTCGCCTG GAGCAGATTG AAAACCGCAG TACGACCCGT
AATGGTGTTA TCAACATCAC CTTTTCGGTC GCACCGGGAA CGGACTTCCG CACGCTGGAT
CTGAACAAAC TTCGCTTCTG GCTCGGCAAT GACGACAACT ATACCCGTGA CCAGCTTTAT
TTATGGTTCT GCGAATACTT GCAGGGTGCC GACCTGACTG TGGGTGAACA GCATATTCGC
CTGCCTGAGT TTATGCTAAA AGCTGTCGGT TTTGAGCCGC AGGATGCCAT GCTGCCCTGG
CCGAAAAACG TCCACAGCGG CTACCGGATC CTTCAGGAGT ATTTCTGTTA CCCCGATGCG
TTTCTCTTTT TTGATCTTTG TGGTTGTCCG GCTTTGCCTG ACGGATTGCA GGCGGAATTC
TTTACCCTGC AACTGCGTTT TTCGCGCCCT TTGCCCGTGG ACATCCGGCT GCGCCGCGAT
TCCCTGCGCC TGTATTGCGC ACCTGCCATT AATTTATTTA TCCACCATGC AGAAGCCATC
ACGCTGGACA ACCGGCGGGC AGACTATCCG CTGGTTCCCA GCCGCCATTA CCCACAACAT
TACGATGTAT TTTCCGTTAA CAGTGTGGTG AGCCAGGTCC AGGATATGTT CAGGAAAAAA
GATCTGGGGC GTCCTGTTTC GACGCAGGCC GCGCGCCAGT GGCCAGCCTT TGAAAGTTTC
AGCCATCAGA TGGAATACAG CCGGAAGCGG GAGGTGGTGT ACTGGCATCA CCGGACCAAA
ACATCCCTGT TCCATCGCGG CTTTGATCAT ACCCTTGCCT TTATACATGC TGATGGCAGT
TATCCGTCAG ACGAATCTCT GCTCAGTAAT GAAGTGGTTT CGGTATCGCT GACCTGTACC
AACCGTGAGC TTCCGTCACA AATTCGTTCC GGCGATATCA CCGGCACAAC CGGTAAAAAT
GCGGCTGTTG CTTCCTTTCG CAACATTACC CGCCCGACGC AACCACTCTG GCCGGTCATT
GATGGCAGCC TGCACTGGTC CCTGCTCTCC GCCATGAACC TGAATTATCT GTCATTACTG
GATACGGACG CGCTGAAGCA GGTCATCGCC AACTTTGATC GCCACGCAAT CCACCATCCG
CAGACGGCGC GGCTGTCACA ACAAAAGCTG GATGCCATTG AGCGTCTGGA GACCCGCCCC
GTTGATCGCC TGTTTACGGG TATTCCCGTC CGGGGACTGG CCTCCACGCT GTATCTGCAC
CCGGAGCCGT TTGTCTGTGA AGGGGAAATG TATCTGCTCG GTACGGTGCT TTCGCATTTT
CTGTCGCTGT ACGCCAGCGT TAACTCATTC CACATGCTGA CCGTTGTGAA CACAGAAAGC
CAGGAGACAT GGAAATGGAC GGAAAGAATC GGGCAGCATC CTCTTATCTG A
 
Protein sequence
MEFEERYFRE ELDYLRQLSK LLATEKPHLA RFLAEKDADP DIERLLEGVA FLTGNLRQKI 
EDEFPELTHG LIKMLWPNYL RPVPAMTLIE YTPDMDKSSV PVLIPRNEQF TTNAGEIRVD
EVLPSDAKKE EPPPCTFTLC RDIWLLPVRL EQIENRSTTR NGVINITFSV APGTDFRTLD
LNKLRFWLGN DDNYTRDQLY LWFCEYLQGA DLTVGEQHIR LPEFMLKAVG FEPQDAMLPW
PKNVHSGYRI LQEYFCYPDA FLFFDLCGCP ALPDGLQAEF FTLQLRFSRP LPVDIRLRRD
SLRLYCAPAI NLFIHHAEAI TLDNRRADYP LVPSRHYPQH YDVFSVNSVV SQVQDMFRKK
DLGRPVSTQA ARQWPAFESF SHQMEYSRKR EVVYWHHRTK TSLFHRGFDH TLAFIHADGS
YPSDESLLSN EVVSVSLTCT NRELPSQIRS GDITGTTGKN AAVASFRNIT RPTQPLWPVI
DGSLHWSLLS AMNLNYLSLL DTDALKQVIA NFDRHAIHHP QTARLSQQKL DAIERLETRP
VDRLFTGIPV RGLASTLYLH PEPFVCEGEM YLLGTVLSHF LSLYASVNSF HMLTVVNTES
QETWKWTERI GQHPLI