Gene ECH74115_0641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0641 
Symbol 
ID6970240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp662415 
End bp665387 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content55% 
IMG OID643384679 
Productbacteriophage N4 receptor, outer membrane subunit 
Protein accessionYP_002269192 
Protein GI209397557 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.369188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGAGA ATAACCTTAA TCGCGTCATC GGATGGTCTG GTTTACTGCT GACGTCTTTA 
TTGAGTACCA GCGCACTCGC AGACAATATC GGCACCAGCG CAGAAGAGCT GGGGCTGAGC
GATTATCGCC ATTTTGTTAT TTATCCCCGT CTCGATAAGG CGCTGAAGGC ACAGAAAAAT
AACGACGAAG CAACCGCCAT CCGCGAATTT GAATATATAC ACCAGCAGGT GCCGGATAAT
ATTCCGCTGA CTTTATACCT TGCGGAAGCC TATCGCCATT TTGGTCATGA TGACCGGGCA
CGGCTGTTGC TTGAGGATCA ACTGAAACGT CACCCAGGAG ATGCCCGACT TGAGCGCAGT
CTGGCGGCTA TTCCGGTTGA AGTGAAAAGC ATTACAACTG TTGAAGAACT GCTTGCCCAA
CAAAAAGCGT GCGATGCTGC GCCGACCCTG CGTTGTCGCA GTGAAGTCGG GCAGAATGCC
CTGCGGCTGG CACAGTTACC TGTCGCCAGA GCGCAACTGA ACGACGCGAC GTTTGCTGCA
TCGCCGGAAG GAAAAACGCT GCGAACCGAT CTGCTGCAAC GGGCAATCTA CCTGAAACAA
TGGTCCCAGG CAGATACGCT ATACAATGAA GCACGCCAGC AGAACACATT AAGCGCGGCA
GAACGCCGTC AGTGGTTTGA CGTGCTTCTT GCCGGGCAGC TGGACGATCG GATCCTGGCA
CTGCAATCAC AGGGGATCTT CACCGATCCT CAGTCATATA TTACTTACGC GACCGCGCTG
GCTTATCGTG GCGAAAAAGC ACGCCTCCAG CATTATCTCA TTGAAAATAA GCCGCTGTTT
ACCACGGACG CACAAGAGAA AAGTTGGCTC TATCTGTTAT CTAAATACAG CGCCAACCCC
GTTCAGGCAT TGGCGAATTA TACGGTGCAG TTTGCCGATA ACCGCCAGTA TGTTGTTGGC
GCGACGCTAC CGGTGCTGTT AAAAGAAGGT CAGTACGACG CAGCGCAAAA ACTGCTCGCC
ACCCTCCCCG CCAATGAAAT GCTTGAGGAG CGTTATGCTG TCAGCGTGGC GACCCGTAAC
AAGGCTGAAG CTCTGCGTCT GGCACGCTTG CTGTATCAGC AAGAATCGGC AAATCTTACC
CGCCTGGATC AACTAACCTG GCAACTGATG CAGAACGAGC AGTCACGGGA AGCTGCCGAT
TTATTGCTGC AACGCTATCC TTTCCAGGGC GATGCGCGTA TCAGCCAGAC TTTAATGGCG
CGACTGGCGT CTCTGCTGGA AAGTCATCCT TACCTGGCAA CGCCGGCGAA GGTGGCGATT
TTATCGAAAC CCTTACCGCT GGCGGAGCAA CGTCAGTGGC AAAGTCAGTT GCCGGGTATT
GCAGATAATT GCCCGGCAAT AGTTCGCTTG CTGGGCGATA TGTCGCCTTC CTACGATGCC
GCCGCCTGGA ACCGTCTGGC AAAGTGTTAT CGGGACACGC TACCCGGTGT GGCGTTGTAT
GCATGGCTTC AGGCCGAACA ACGACAACCG AACGCCTGGC AACATCGTGC GGTAGCCTAT
CAGGCGTATC AGGTTGAGGA CTACGCCACC GCACTGGCGG CCAGGCAGAA AATCAGTCTT
CACGACATGA GCAATGAGGA TCTGCTTGCT GCTGCCAATA CCGCCCAGGC GGCAGGAAAT
GGTGCGGCTC GCGATCGCTG GCTACAACAG GCAGAACAAC GTGGACTGGG AAGCAATGCC
CTCTACTGGT GGCTGCATGC GCAACGTTAC ATTCCTGGTC AGCCGGAACT CGCACTGAAC
GATCTCACGC GCTCAATCAA TATTGCGCCT TCTGCCAACG CTTACGTTGC GCGGGCGACA
ATTTATCGCC AACGTCATAA TGTCCCGGCC GCGGTGAGTG ATTTGCGCGC CGCGCTGGAA
CTGGAACCGA ATAATAGCAA CACCCAGGCA GCGCTTGGTT ACGCCTTGTG GGATAGCGGT
GATATCGCAC AGTCGCGGGA AATGCTCGAA CAGGCGCATA AAGGGCTACC GGACGATCCG
GCACTGATCC GACAACTGGC CTATGTGAAC CAGCGTCTGG ATGACATGCC TGCGACGCAG
CACTACGCCC GGCTGGTGAT TGATGACATT GATAATCAGG CGCTGATAAC CCCACTGACC
CCAGAGCAAA ATCAGCAACG CTTCAATTTC CGCCGTCTGC ATGAGGAGGT CGGTCGCCGC
TGGACGTTCA GTTTCGATTC TTCCATCGGC TTGCGTTCCG GGGCAATGAG TACCGCTAAC
AATAATGTTG GCGGCGCAGC GCCAGGGAAA AGCTATCGTA GCTACGGGCA ACTGGAAGCC
GAATACCGCA TCGGACGCAA TATGCTGCTG GAAGGCGACC TGCTCTCAGT TTATAGCCGC
TTCTTTGCCG ATACCGGAGA AAACGGGGTG ATGATGCCGG TGAAAAATCC GATGTCCGGC
ACCGGTCTGC GCTGGAAGCC GCTGCGCGAT CAGATCTTTT TCCTCGCCGT CGAACAGCAG
TTGCCGCTGA ACGGGCAAAA TGGCGCATCC GATACCATGC TGCGCGCCAG CGCCTCATTC
TTTAATGGCG GCAAATACAG CGACGAATGG CACCCGAACG GTTCAGGCTG GTTTGCCCAA
AACCTGTACC TCGATGCGGC GCAATATATC CGCCAGGATA TTCAGGCGTG GACGGCAGAT
TATCGCGTCA GCTGGCATCA GAAGGTAGCT AACGGACAGA CTATTGAGCC TTACGCTCAC
GTTCAGGACA ACGGCTATCG TGATAAAGGC ACTCAGGGCG CGCAGCTTGG CGGTGTCGGG
GTCCGCTGGA ATATCTGGAC CGGCGAGACG CACTACGACG CCTGGCCGCA CAAAGTCAGT
CTCGGCGTCG AGTATCAACA TACCTTTAAG GCGATTAATC AACGTAACGG AGAGCGCAAC
AACGCGTTTC TCACCATTGG AGTGCACTGG TAA
 
Protein sequence
MKENNLNRVI GWSGLLLTSL LSTSALADNI GTSAEELGLS DYRHFVIYPR LDKALKAQKN 
NDEATAIREF EYIHQQVPDN IPLTLYLAEA YRHFGHDDRA RLLLEDQLKR HPGDARLERS
LAAIPVEVKS ITTVEELLAQ QKACDAAPTL RCRSEVGQNA LRLAQLPVAR AQLNDATFAA
SPEGKTLRTD LLQRAIYLKQ WSQADTLYNE ARQQNTLSAA ERRQWFDVLL AGQLDDRILA
LQSQGIFTDP QSYITYATAL AYRGEKARLQ HYLIENKPLF TTDAQEKSWL YLLSKYSANP
VQALANYTVQ FADNRQYVVG ATLPVLLKEG QYDAAQKLLA TLPANEMLEE RYAVSVATRN
KAEALRLARL LYQQESANLT RLDQLTWQLM QNEQSREAAD LLLQRYPFQG DARISQTLMA
RLASLLESHP YLATPAKVAI LSKPLPLAEQ RQWQSQLPGI ADNCPAIVRL LGDMSPSYDA
AAWNRLAKCY RDTLPGVALY AWLQAEQRQP NAWQHRAVAY QAYQVEDYAT ALAARQKISL
HDMSNEDLLA AANTAQAAGN GAARDRWLQQ AEQRGLGSNA LYWWLHAQRY IPGQPELALN
DLTRSINIAP SANAYVARAT IYRQRHNVPA AVSDLRAALE LEPNNSNTQA ALGYALWDSG
DIAQSREMLE QAHKGLPDDP ALIRQLAYVN QRLDDMPATQ HYARLVIDDI DNQALITPLT
PEQNQQRFNF RRLHEEVGRR WTFSFDSSIG LRSGAMSTAN NNVGGAAPGK SYRSYGQLEA
EYRIGRNMLL EGDLLSVYSR FFADTGENGV MMPVKNPMSG TGLRWKPLRD QIFFLAVEQQ
LPLNGQNGAS DTMLRASASF FNGGKYSDEW HPNGSGWFAQ NLYLDAAQYI RQDIQAWTAD
YRVSWHQKVA NGQTIEPYAH VQDNGYRDKG TQGAQLGGVG VRWNIWTGET HYDAWPHKVS
LGVEYQHTFK AINQRNGERN NAFLTIGVHW