Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5571 |
Symbol | |
ID | 6971233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5210129 |
End bp | 5211421 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643389210 |
Product | hypothetical protein |
Protein accession | YP_002273607 |
Protein GI | 209396870 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.422571 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTATA ATGGTTTAAA TAATATGTTT TTCCCTCTTT GCCTGATTAA CGATAACCAC TCTGTCACAA GTCTATCACA TACAAAGAAA ACAAAATCAG ATAATTACAG CAAACATCAT AAAAACACGT TAATTGACAA TAAAGCCCTC TCTCTTTTCA AAATGGATGA TCATGAAAAA GTGATAGACT TGATTCAGAA AATGAAAAGA ATTTATGATA GTTTACCATC AGGAAAAATC ACGAAAGAAA CGGACAGGAA AATACATAAA TATTTTATAG ATATAGCTTC ATATGCAAAT AATAAATGTG ACGATAGAAT TACGAGAAGA GTTTACCTTA ATAAAGATAA GGAAGTGTCA ATTAAGGTGG TATATTTTAT AAATAATGTC ACCGTCCATA ATAATACTAT CGAAATCCCA CAGACAGTGA ATGGTGGTTA CGATTTTTCA CACCTTAGCC TGAAAGGTAT CGTGATTAAA GATGAAGATT TATCCAATTC GAATTTTGCA GGTTGCAGAC TACAAAACGC TATTTTCCAG GACTGTAATA TGTATAAAAC GAATTTTAAT TTCGCCATAA TGGAAAAAAT ACTTTTTGAT AATTGTATTC TCGATGACTC ATATTTCGCT CAGATAAAAA TGACTGACGG AACTCTAAAT TCATGCTCCG CTATGCATGT TCAATTCTAC AATGCAACAA TGAATAGAGC CAATATTAAA AATACCTTCC TTGATTATTC AAATTTTTAT ATGGCATACA TGGCTGAGGT AAATCTTTAT AAAGTAATAG CGCCATATAT TAATTTATTT AGAGCCGACC TTAGCTTCTC TAAACTTGAT TTAATTAACT TTAAACATGC TGATCTGTCT CGCGTCAATC TGAACAAAGC AATCCTCCAG AATATAAACT TAATTGATAG TAAACTCTTT TTTACACGGC TAACAAATAC GTTCCTCGAA ATGGTTATAT GCACCGATTC TAATATGGCT AATGTTAATT TTAATAATGC CAATTTAAAC AATTGCCATT TCAACTGTTC TGTTTTAACA AAAGCCTGGA TGTTTAATAC CCGTCTCTAT CGGGTTAATT TCGATGAGGC TAGCGTCCAG GGAATGGGCA TTTCCATTCT CCGTGGGGAG GAGAATATTC CCATTAATAG TGATACCCTG GTAACACTAC AGAAATTCTT TGAAGAAGAT TGTACCTCTC ATACTGGCAT GTCACAAACT GAGAATAATA CTCATGAAGT AGCTATGAAG ATTACTGCAG ATATTATGCA ACACGCAGAT TGA
|
Protein sequence | MRYNGLNNMF FPLCLINDNH SVTSLSHTKK TKSDNYSKHH KNTLIDNKAL SLFKMDDHEK VIDLIQKMKR IYDSLPSGKI TKETDRKIHK YFIDIASYAN NKCDDRITRR VYLNKDKEVS IKVVYFINNV TVHNNTIEIP QTVNGGYDFS HLSLKGIVIK DEDLSNSNFA GCRLQNAIFQ DCNMYKTNFN FAIMEKILFD NCILDDSYFA QIKMTDGTLN SCSAMHVQFY NATMNRANIK NTFLDYSNFY MAYMAEVNLY KVIAPYINLF RADLSFSKLD LINFKHADLS RVNLNKAILQ NINLIDSKLF FTRLTNTFLE MVICTDSNMA NVNFNNANLN NCHFNCSVLT KAWMFNTRLY RVNFDEASVQ GMGISILRGE ENIPINSDTL VTLQKFFEED CTSHTGMSQT ENNTHEVAMK ITADIMQHAD
|
| |