Gene ECH74115_1302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1302 
Symbol 
ID6968636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1318160 
End bp1319545 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content43% 
IMG OID643385289 
Producthypothetical protein 
Protein accessionYP_002269784 
Protein GI209398579 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.414207 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTTTAC TTGCACCAGA AAATCCTTAT CCGATATATG CACTACCACC GCTGGTGAGA 
AATGCAATAA TTGAAACTCA AAAAAATACA CAGGCACCTT TGGCTATGGT GGCAACATCC
GCATTAACAG CGATCTCAAT TGCCTGTCAG AATCAGATTG ACGTGTGCAG ACCTGGAAAT
TTACATGGGC CTGTTAATCT TTACTCTCTG ATTCTGGCTG ATTCCGGTGA AAGGAAAACG
ACCGTGGATA AAGTGTTTAT GAAAGCATTT TATCTCAGGG ATGAAGCCCT GGCGGATGAA
TACGCGAAAC TGGTTGAGAA TTACAGTACA GAAAAGGAAA TATGGGAGCA AAAACAAAAA
GCGCTGGAAT CAAAATTTCA TAAAGAGATT CGTGCCGGTA AAGATTACAA GGCAACAGAA
TCAGAGCTTG AAACGCATCT GAATAAACCT CCTGTTCCGC CGCAGATACG TCGAACAATT
TTTAATGAGA CAACGATAGA GGGAATGTTA AAATATTACT CCGATAGCAA TCGCTCTTTT
GCTCTTGTAT CCAGTGAAGG GGGGGTAATT TTTGACAGCC GGGCCATGAG TAAACTGGGA
ATTATTAACA CTTTGTGGGA TGGAGGTTCT CTTTTCATCG ACAGGAAATC ATCTCCCGGA
ATTAATTTGA AGGAGCCAAG ACTGACGATA TCGGTGATGA TTCAGCCTGA TGTTTACCAC
AAAGGTTTTT GTACGCGAAA AAAAGAAATT GTGAAAACGT CAGGACATCA TGCAAGGTTT
TTGATGTGTC AACCAACATC AACGCAGGGG ACAAGGATAA TGATCAATGA GCTTATTGAT
GAAAGTCTGG CGATGAGTGG TGAACGACGT TGCCTTCACT TTTCTCCTCA GGCTGCCAGA
ATCTGGACGG ATTATTACAA TGATGTGGAA TCGAAGCTGG GGGGATTGGG GCCTTTAAGA
CATTGCCGGG AATATGCTGC TAAAAATGCA GAGTATATGG CAAGACTGGC AGGACTTATT
TACCATTCGA GCGGTGAAGA GGGGGAGATT TCCCCTTACA CTGCAGAAAT GGCGAGAGAA
TTAGCAATAT GGTACGGTAA TGAGTATGTG CGGTTGTCTA ATCCGTTAAC TTTTGACAAC
TCTGCCCTGA CCGTACCTGT GCGACTCATT CCAGAGGAGC TTGAACTTTT CAACTGGATA
AAAAGCTATT GTATTGAGAA GGGGATCCTC TGTATGAAAA AAAATGACAT TTTACAGCGT
GGTCCGAACC GTTTCCGGAA AAAGGATAAA ATCAACTGGT TACTGGATTT ACTGTATGAA
CAAAACAGAG TTGTACCGGT TATTGAGGGA AAAACGTTGT GTGTCGCACC TAACTTTGAC
CTCTGA
 
Protein sequence
MCLLAPENPY PIYALPPLVR NAIIETQKNT QAPLAMVATS ALTAISIACQ NQIDVCRPGN 
LHGPVNLYSL ILADSGERKT TVDKVFMKAF YLRDEALADE YAKLVENYST EKEIWEQKQK
ALESKFHKEI RAGKDYKATE SELETHLNKP PVPPQIRRTI FNETTIEGML KYYSDSNRSF
ALVSSEGGVI FDSRAMSKLG IINTLWDGGS LFIDRKSSPG INLKEPRLTI SVMIQPDVYH
KGFCTRKKEI VKTSGHHARF LMCQPTSTQG TRIMINELID ESLAMSGERR CLHFSPQAAR
IWTDYYNDVE SKLGGLGPLR HCREYAAKNA EYMARLAGLI YHSSGEEGEI SPYTAEMARE
LAIWYGNEYV RLSNPLTFDN SALTVPVRLI PEELELFNWI KSYCIEKGIL CMKKNDILQR
GPNRFRKKDK INWLLDLLYE QNRVVPVIEG KTLCVAPNFD L