Gene ECH74115_B0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_B0001 
Symbol 
ID6966416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011350 
Strand
Start bp78551 
End bp81247 
Gene Length2697 bp 
Protein Length898 aa 
Translation table11 
GC content45% 
IMG OID643384017 
Producthypothetical protein 
Protein accessionYP_002268496 
Protein GI209395628 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.362485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.442936 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACTA AAATGAATGA GAGATGGAGA ACACCGATGA AATTAAAGTA TCTGTCATGT 
ACGATCCTTG CCCCTCTGGC GATTGGGGTA TTTTCTGCAA CAGCTGCTGA TAATAATTCA
GCCATTTATT TCAATACCTC CCAGCCTATA AATGATCTGC AGGGTTCGTT GGCCGCAGAG
GTGAAATTTG CACAAAGCCA GATTTTACCC GCCCATCCTA AAGAAGGGGA TAGTCAACCA
CATCTGACCA GCCTGCGGAA AAGTCTGCTG CTTGTCCGTC CGGTGAAAGC TGATGATAAA
ACACCTGTTC AGGTGGAAGC CCGCGATGAT AATAATAAAA TTCTCGGTAC GTTAACCCTT
TATCCTCCTT CATCACTACC GGATACAATC TACCATCTGG ATGGTGTTCC GGAAGGTGGT
ATCGATTTCA CACCTCATAA TGGAACGAAA AAGATCATTA ATACGGTGGC TGAAGTAAAC
AAACTCAGTG ATGCCAGCGG GAGTTCTATT CATAGCCATC TAACAAATAA TGCACTGGTG
GAGATCCATA CTGCAAATGG TCGTTGGGTA AGAGACATTT ATCTGCCGCA GGGACCCGAC
CTTGAAGGTA AGATGGTTCG CTTTGTTTCG TCTGCAGGCT ATAGTTCAAC GGTTTTTTAT
GGTGATCGAA AAGTCACACT CTCGGTGGGT AACACTCTTC TGTTCAAATA TGTAAATGGT
CAGTGGTTCC GCTCCGGTGA ACTGGAGAAT AATCGAATCA CTTATGCTCA GCATATTTGG
AGTGCTGAAC TGCCTGCGCA CTGGATCGTG CCTGGTTTAA ACTTGGTGAT TAAACAGGGC
AATCTGAGCG GTCGCCTAAA TGATATCAAG ATTGGAGCAC CGGGTGAGCT GTTGTTGCAT
ACAATTGATA TCGGGATGTT GACCACTCCC CGGGATCGCT TTGATTTTGC CAAAGACAAA
GAAGCACATA GGGAATATTT CCAGACCATT CCTGTAAGTC GTATGATTGT TAATAATTAT
GCGCCTCTAC ACCTAAAGGA AGTTATGTTA CCAACCGGAG AGTTATTGAC AGATATGGAT
CCAGGAAATG GTGGGTGGCA TAGTGGTACA ATGCGTCAAA GAATAGGTAA AGAATTGGTT
TCGCATGGCA TTGATAATGC TAACTATGGT TTAAATAGTA CCGCAGGCTT AGGGGAGAAT
AGTCATCCAT ATGTAGTTGC GCAATTAGCG GCACATAATA GCCGCGGTAA TTATGCTAAT
GGCATCCAGG TTCATGGTGG CTCCGGAGGT GGGGGAATTG TTACTTTAGA TTCCACATTG
GGGAATGAGT TCAGTCATGA AGTTGGTCAT AATTATGGTC TTGGTCATTA TGTAGATGGT
TTCAAGGGTT CTGTACATCG TAGTGCAGAA AATAACAACT CAACTTGGGG ATGGGATGGT
GATAAAAAAC GGTTTATTCC TAACTTTTAT CCGTCTCAAA CAAATGAAAA GAGTTGTCTG
AATAATCAGT GTCAAGAACC GTTTGATGGA CACAAATTTG GTTTTGACGC CATGGCGGGA
GGCAGCCCTT TCTCTGCTGC AAACCGTTTC ACAATGTATA CTCCGAATTC ATCGGCTATC
ATCCAGCGTT TTTTTGAAAA TAAAGCTGTG TTCGATAGCC GTTCCTCCAC CGGCTTCAGC
AAGTGGAATG CAGATACGCA GGAAATGGAA CCGTATGAAC ACACCATTGA CCGTGCGGAG
CAGATTACGG CTTCAGTCAA TGAGCTAAGT GAAAGCAAAA TGGCTGAGCT GATGGCAGAG
TACGCTGTCG TCAAAGTGCA TATGTGGAAC GGTAACTGGA CAAGAAACAT CTATATCCCT
ACAGCCTCCG CAGATAATAG AGGCAGTATC CTGACCATCA ACCATGAGGC CGGTTATAAT
AGTTATCTGT TTATAAATGG TGACGAAAAG GTCGTTTCCC AGGGGTATAA AAAGAGCTTT
GTTTCCGATG GTCAGTTCTG GAAAGAACGT GATGTGGTTG ATACTCGTGA AGCGCGTAAG
CCAGAGCAGT TTGGTGTTCC TGTGACGACC CTGGTGGGGT ATTACGATCC GGAAGGCACG
CTGTCAAGCT ACATCTATCC TGCGATGTAT GGTGCCTATG GCTTCACTTA TTCCGATGAT
AGTCAGAATC TATCCGATAA CGACTGCCAG CTGCAGGTGG ATACGAAAGA AGGGCAGTTG
CGATTCAGAC TGGCTAATCA CCGGGCTAAC AACACTGTAA TGAATAAGTT CCATATTAAC
GTGCCAACAG AAAGTCAGCC CACACAGGCC ACATTGGTTT GCAATAACAA GATACTGGAT
ACCAAATCGC TCACACCTGC GCCAGAAGGA CTTACCTATA CTGTAAATGG GCAGGCACTT
CCAGCAAAAG AAAACGAGGG ATGCATCGTG TCCGTGAATT CAGGTAAACG TTACTGTTTG
CCGGTTGGTC AACGGTCAGG ATATAGCCTT CCTGACTGGA TTGTTGGGCA GGAAGTCTAT
GTCGACAGCG GGGCTAAAGC GAAAGTGCTG CTTTCTGACT GGGATAACCT GTCCTATAAC
AGGATTGGTG AGTTTGTAGG TAATGTGAAC CCAGCTGATA TGAAAAAAGT TAAAGCCTGG
AACGGACAGT ATTTGGACTT CAGTAAACCT AGGTCAATGA GGGTTGTATA TAAATAA
 
Protein sequence
MNTKMNERWR TPMKLKYLSC TILAPLAIGV FSATAADNNS AIYFNTSQPI NDLQGSLAAE 
VKFAQSQILP AHPKEGDSQP HLTSLRKSLL LVRPVKADDK TPVQVEARDD NNKILGTLTL
YPPSSLPDTI YHLDGVPEGG IDFTPHNGTK KIINTVAEVN KLSDASGSSI HSHLTNNALV
EIHTANGRWV RDIYLPQGPD LEGKMVRFVS SAGYSSTVFY GDRKVTLSVG NTLLFKYVNG
QWFRSGELEN NRITYAQHIW SAELPAHWIV PGLNLVIKQG NLSGRLNDIK IGAPGELLLH
TIDIGMLTTP RDRFDFAKDK EAHREYFQTI PVSRMIVNNY APLHLKEVML PTGELLTDMD
PGNGGWHSGT MRQRIGKELV SHGIDNANYG LNSTAGLGEN SHPYVVAQLA AHNSRGNYAN
GIQVHGGSGG GGIVTLDSTL GNEFSHEVGH NYGLGHYVDG FKGSVHRSAE NNNSTWGWDG
DKKRFIPNFY PSQTNEKSCL NNQCQEPFDG HKFGFDAMAG GSPFSAANRF TMYTPNSSAI
IQRFFENKAV FDSRSSTGFS KWNADTQEME PYEHTIDRAE QITASVNELS ESKMAELMAE
YAVVKVHMWN GNWTRNIYIP TASADNRGSI LTINHEAGYN SYLFINGDEK VVSQGYKKSF
VSDGQFWKER DVVDTREARK PEQFGVPVTT LVGYYDPEGT LSSYIYPAMY GAYGFTYSDD
SQNLSDNDCQ LQVDTKEGQL RFRLANHRAN NTVMNKFHIN VPTESQPTQA TLVCNNKILD
TKSLTPAPEG LTYTVNGQAL PAKENEGCIV SVNSGKRYCL PVGQRSGYSL PDWIVGQEVY
VDSGAKAKVL LSDWDNLSYN RIGEFVGNVN PADMKKVKAW NGQYLDFSKP RSMRVVYK