Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4621 |
Symbol | |
ID | 5590413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 4620866 |
End bp | 4622158 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640928237 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001465569 |
Protein GI | 157158802 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTATA ATGGTTTAAA TAATATGTTT TTCTCTCTTT GCCAGATTAA CGATAACCAC TCTCTCACAA GTTCATCACA TACAAAGAAA ACAAAATCAT ATAATTACAG CAAACATCAT AAAAACACGT TAATTGACAA TAAAGCCCTC TCTCTTTTCA AAATGGATGA TCATGAAAAA GTGATAGGTT TGATTCAGAA AATGAAAAGA ATTTATGATA GTTTACCATC AGGAAAAATC ACGAAGGAAA CGGACAGGAA AATACATAAA CATTTTATAG ATATAGCTTT ATATGCAAAT AATAAATGTG ACGATAGAAT TACGAGAAGA GTTTACCTTA GTAAAGAAAA GGAAGTATCC ATTAAGGTGG TATATTTTAT AAATAATGTC GCCGCCCATA ATAATACTAT CGAAATTCCA CAGACAGTAA ATGGTGGTTA CGATTTTTCA CACCTTAGCC TGAAAGGTAT CGTGATTAAA GATGAAGATT TATCCAATTC GAATTTTGCA GGTTGCAGAC TACAAAACGC TATTTTCCAG GACTGTAATA TGTATAAAAC GAATTTTTAT TACGCCATAA TGGAAAAAAT ACTTTTTGAT AATTGTATTC TCGATGACTC AAATTTCGCT CAGATAAAAA TGGCCGACGG AACTCTAAAT GCATGCTCCG CTATGCATGT TCAATTCTAC AATGCAGCAA TGAATAGAGC CAATATTAAA AACACCTTCC TTGACTATTC AAATTTTTAT ATGGCGTACA TGGCTGAGGT AAATCTTTAT AAAGTAATAG CGCCATATGT TAATTTATTT AAAGCCGACC TTAGTTTCTC TAAACTCGAT TTAATTAACT TTGAACATGC TGATCTGTCT CGCGTCAATC TGAACAAAGC AATCCTCCAG AATATAAACT TAATTGATAG CAAACTCTTT TGTACGTGGC TGACAAATAC ATTCCTCGAA ATGGTTATAT GTACCGGCTC TAATATGGCT AATGTTAATT TTAATAATGC CAATTTAAGC AACTGCCATT TCAACTGTTC TATTTTAACA AAAGCCTGTA TGTTTAATAC CCGTCTCTAT CGGGTTAATT TTGATGAGGC TAGCGTCCAG GGAATGGGCA TTTCCATTCT CCGTGGGGAG GAAAATATCC CCATTGATAG TGATACCCTG GTAACACTAC AGAAATTCTT TGAAGAAGAT TGTACCTCTC ATACTGGCAT GTCACAAACT GAGGATAATA TTAATGCAGT CGCTATGAAG ATTACTGCAG ATATTATGCA ACACGCAGAT TGA
|
Protein sequence | MRYNGLNNMF FSLCQINDNH SLTSSSHTKK TKSYNYSKHH KNTLIDNKAL SLFKMDDHEK VIGLIQKMKR IYDSLPSGKI TKETDRKIHK HFIDIALYAN NKCDDRITRR VYLSKEKEVS IKVVYFINNV AAHNNTIEIP QTVNGGYDFS HLSLKGIVIK DEDLSNSNFA GCRLQNAIFQ DCNMYKTNFY YAIMEKILFD NCILDDSNFA QIKMADGTLN ACSAMHVQFY NAAMNRANIK NTFLDYSNFY MAYMAEVNLY KVIAPYVNLF KADLSFSKLD LINFEHADLS RVNLNKAILQ NINLIDSKLF CTWLTNTFLE MVICTGSNMA NVNFNNANLS NCHFNCSILT KACMFNTRLY RVNFDEASVQ GMGISILRGE ENIPIDSDTL VTLQKFFEED CTSHTGMSQT EDNINAVAMK ITADIMQHAD
|
| |