Gene ECH74115_1086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1086 
Symbol 
ID6968180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1114057 
End bp1115892 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content53% 
IMG OID643385098 
Producthypothetical protein 
Protein accessionYP_002269597 
Protein GI209399966 
COG category[S] Function unknown 
COG ID[COG2989] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0169645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.732672 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTGTG GTCGTCGGCT GTCGGCAATC AGTTTGTGCC TGGCCGTAAC ATTCGCTCCA 
CTGTTCAATG CGCAGGCCGA TGAGCCTGAA GTAATCCCTG GCGACAGCCC GGTGGCTGTC
AGTGAACAGG GCGAGGCACT GCCGCAGGCG CAAGCCACGG CAATAATGGC GGGGATCCTG
CCATTGCCTG AAGGTGCGGC AGAAAAAGCC CGCACGCAAA TCGAATCTCA ATTACCCGCA
GGTTACAAGC CGGTTTATCT TAACCAGCTT CAACTGTTGT ATGCCGCACG CGATATGCAA
CCCATGTGGG AAAACCGTGA TGCTGTTAAA GCCTTCCAGC AACAGCTGGC AGAGGTGGCG
ATTGCCGGTT TCCAGCCGCA GTTTAATAAA TGGGTAGAGT TACTGACCGA TCCTGGTGTT
AACGGGATGG CACGCGACGT GGTGCTCTCT GATGCGATGA TGGGCTATCT CCATTTCATT
GCAAATATTC CGGTCAAAGG CACTCGCTGG CTATATAGCA GTAAACCTTA TGCGCTTGCA
ACGCCGCCGC TTTCGGTGAT TAACCAATGG CAGCAGGCGC TGGATAAAGG TCAATTGCCT
ACGTTTGTTG CAGGACTGGC ACCGCAGCAT CCGCAATATG CGGTGATGCA TGAATCGTTA
CTGGCCTTAC TCAGTGACAC CAAACCGTGG CCCCAACTGA CCGGCAAAGC AACGTTGCGC
CCAGGGCAGT GGAGTAACGA CGTACCGGCG TTGCGTGAAA TATTGCAACG CACAGGCATG
TTGGACGGGG GGCCGAAAAT TACTCTACCT GGCGATGACA CGCCAACTGA CGCGGTAGTC
AGCCCATCCG CTGTTACTGT TGAAACAGCA GAAACTAAGC CGATGGATAA GCAAACGACG
TCTCGTAGTA AACCTGCGCC TGCCGTTCGC GCCGCCTACG ATAATGAACT GGTGGAAGCC
GTTAAACGTT TTCAGGCATG GCAAGGATTG GGGGCAGATG GTGCTATTGG CCCGGCAACG
CGTGACTGGT TAAACGTAAC GCCCGCCCAG CGTGCTGGCG TGTTGGCTCT CAACATCCAG
CGATTGCGCT TGCTGCCAAC AGAGCTTTCT ACCGGGATCA TGGTTAACAT TCCGGCCTAT
TCGCTGGTCT ACTATCAGAA CGGCAATCAG GTGCTGGATT CGCGAGTCAT TGTCGGTCGC
CCCGATCGCA AAACGCCGAT GATGAGCAGT GCCCTTAACA ATGTAGTGGT AAACCCGCCG
TGGAACGTAC CTCCAACTCT GGCACGCAAA GATATTCTGC CGAAAGTGCG CAACGATCCG
GGATATCTCG AAAGCCATGG CTATACGGTG ATGCGCGGCT GGAACAGCAG AGAAGCGATT
GACCCATGGC AGGTTGACTG GTCTACAATC ACGGCCTCGA ATTTACCGTT TCGCTTCCAA
CAGGCTCCAG GCCCACGGAA CTCGCTGGGG CGCTATAAAT TCAATATGCC GAGTTCAGAG
GCCATTTATT TGCATGACAC GCCGAACCAC AATCTGTTCA AGCGTGATAC ACGCGCATTG
AGCTCAGGCT GTGTACGAGT GAATAAAGCT TCCGATCTGG CGAATATGCT GTTGCAGGAT
GCAGGCTGGA ATGACAAACG TATTTCTGAT GCGCTGAAGC AGGGTGATAC ACGTTACGTC
AATATTCGGC AGTCGATTCC GGTGAATCTC TACTACCTGA CGGCCTTTGT TGGTGCAGAT
GGTCGTACCC AGTATCGTAC AGATATTTAC AATTATGATC TGCCTGCGCG ATCCAGCTCG
CAAATCGTAT CGAAAGCGGA ACAATTAATC AGGTAA
 
Protein sequence
MMCGRRLSAI SLCLAVTFAP LFNAQADEPE VIPGDSPVAV SEQGEALPQA QATAIMAGIL 
PLPEGAAEKA RTQIESQLPA GYKPVYLNQL QLLYAARDMQ PMWENRDAVK AFQQQLAEVA
IAGFQPQFNK WVELLTDPGV NGMARDVVLS DAMMGYLHFI ANIPVKGTRW LYSSKPYALA
TPPLSVINQW QQALDKGQLP TFVAGLAPQH PQYAVMHESL LALLSDTKPW PQLTGKATLR
PGQWSNDVPA LREILQRTGM LDGGPKITLP GDDTPTDAVV SPSAVTVETA ETKPMDKQTT
SRSKPAPAVR AAYDNELVEA VKRFQAWQGL GADGAIGPAT RDWLNVTPAQ RAGVLALNIQ
RLRLLPTELS TGIMVNIPAY SLVYYQNGNQ VLDSRVIVGR PDRKTPMMSS ALNNVVVNPP
WNVPPTLARK DILPKVRNDP GYLESHGYTV MRGWNSREAI DPWQVDWSTI TASNLPFRFQ
QAPGPRNSLG RYKFNMPSSE AIYLHDTPNH NLFKRDTRAL SSGCVRVNKA SDLANMLLQD
AGWNDKRISD ALKQGDTRYV NIRQSIPVNL YYLTAFVGAD GRTQYRTDIY NYDLPARSSS
QIVSKAEQLI R