Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0585 |
Symbol | |
ID | 5589845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 608971 |
End bp | 613773 |
Gene Length | 4803 bp |
Protein Length | 1600 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640924303 |
Product | YD repeat-containing protein |
Protein accession | YP_001461729 |
Protein GI | 157158186 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGTCAC AGCAGGCGAT GGTCGCCAGC CAGATGAAAC GGGCAGCCAA CGACAAAGCC ATTGCACAGA TGTTGGCATC AAAAAAGTCC GGCCCCCCCG CCGCCAGGCT GGGCGATGAA ATTCAGCACA AGAGTTTTTT GGGGGCGCAG GCAGGGGCCG TGCTGGGGGC GATAGTGACC ATCGCAGAAG GTTGCCTGAT TATGGCCGCC TGTGCCACCG GCCCTTATGC GCTGGTTCTG GTGCCTGCGC TGATGTATGC CAGCTATAAG GCGAGTGATT ATGTGGAGGA GAAACAGAAC CAGCTTGAAT CATGGATAAA CAGCTTTTGT GACACGGACG GCGCCATCAA TACCGGTTCT GAAAACGTAA AAATTAACGG CGAGCTGGCC GCACGTGCAG CGGTCACCCT TCCCCCTCCT CCCCCACCTG GAGCAATACC TGAAGTCCCA CAGGGGGAAC CCTCATGGGG TGATATTGCC ACTGACCTGC TTGAATCGGC AGCGGAAAAA GCAGTACCAC TGGCGAAGGC CTGGGAGAAC GCTGTTATCA CCCTGACGGA CAGCAATGCC GGTTTTATGG ATCGCGTCAG CGCCGGCGCA TCGCTTCTGT TTCCCGCCGG TCCGGTATTA ATGGAGTTTG CCACCATGGT GGGCGGGCGT GGCGAAATCA AAAAAGAGGT GGATTTCCCG GAAGCCGGTG AGGACACGGC GCTCTGCGAC AAGGAGAACA AACCACCGAG GATAGCCCAG GGCAGCAGCA ACGTCTTTAT CAACAATCAG CCTGCCGCGC GCAAGGGCGA CAAACTGGAG TGCAGTGCGG CAATCGTGGG AGGTTCGCCG GACGTCTTTA TTGGCGGTGA GCAGGTCACA TATCTGGATA TCCAGCCGGA GTTCCCGCCG TGGCAGAGAA TGATCCTGGG GGGAATAACG ATAGCCAGCT ATCTTCTGCC GCCTGCGGGA CTGCTGGGAA AACTGAAGAA TCTGGCGAGA CTCGGCAAAC TGGGAAACCT GCTGGGGAAA AGCGGGAAGC TGCTGGGCGC AAAGCTCGGC GCGTTGCTGG GGAAAACAGG TAAGTCGTTA AAAAGTATTG CCAATAAAGT CATCAGATGG GTAACAGATC CTGTCGATCC GGTAACCGGC GCATACTGCG ACGAACGTAC CGACTTCACC CTGGGCCAGA CCCTCCCCCT CTCCTTCACC CGTTTCCACA GTTCTGTACT GCCGCTGCAT GGCCTGACGG GCGTGGGCTG GAGCGACTCC TGGAGCGAAT ACGCCTGGGT GCGTGAACAG GGAAACCGGG TGGATATCAT CAGCCAGGGA GCCACGCTGA GATTTGCCTT CGACGGTGAC AGTGATACGA CGGTTAACCC GTATCACGCC CAGTACATTC TGCGCCGCCG CGATGATTAT CTGGAGCTGT TCGACAGGGA TGCACTGAGC AGCCGCTTCT TTTATGACGC CTTTCCGGGA ATGCGTCTGC GCCACCCGGT GACTGACGAT ACCAGCGATG ACCGCCTGGC ACACAGCCCC AATGACCGGA TGTACATGCT GGGCGGGATG AGCGACACCG CCAGCAACCG CATCACGTTT GAGCGCGACA GCCAGTACCG GATCACGGGT GTCAGTCACA CCGACGGGAT CCGGCTTAAA CTGACGTACC ACGCCAGCGG CTACCTGAAA GCCATTCACC GCACGGATAA CGGCATACAG ACGCTGGCGA CCTACGAACA GGATGCGCGG GGGCGGCTGA CAGAAGCGGA TGCGCGGCTG GACTACCACC TGTTTTATGA GTACGACGCT GCGGACCGGA TCATCCGCTG GTCCGATAAC GACCAGACGT GGAGCCGTTT CACCTACGAT GAACAGGGCC GGTGCGTGAA TGTCACCGGG GCGGAGGGCT ATTACAACGC CACGCTGGAC TATGGTGACG GCTGCACCAC CGTGACGGAC GGCAAGGGCA CTCACCGTTA TTACTATGAT CCTGACGGCA ATATTCTGCG GGAAGAAGCG CCGGACGGCA GCACCACCAC GTATGAATGG GATGAATTCC ATCACCTGCT GGCCCGTCAC TCCCCTGCCG GGCGGGTGGA GAAGTTTGAA TACAACGCCG CACTCGGTCA GTTAAGCCGT TACACGGCGG CAGACGGCGC GGAGTGGCTG TACCGCTATG ATGAGCGCGG CCTGCTCAGC AACATCACCG ACCCAGCCGG GCAGACGTGG ACACAGCAGT GTGATGAACG CGGCCTGCCG GTAAGTCTGG TGTCGCCACA GGGCGAAGAG ACCCGGCTGG CATACACCGC TCAGGGGCTG CTGTCGGGGA TATTCCGCCA GGATGAACGG CGTCTGGGCA TAGAGTACGA CCACCACAAC CGGCCGGAAA CACTCACCGA CGTGATGGGC CGTGAACACC ACACCGAATA CAGCGGTCAC GACCTGCCGG TGAAGATGCG CGGCCCCGGC GGTCAGTCAG TGCGGTTACA GTGGCAGCAG CACCATAAAC TGAGCGGCAT TGAACGTGCT GGAACCGGCG CGGAAGGATT CCGCTATGAC CGCCACGGCA ACCTGCTGGC GTACACGGAC GGTAACGGCG TTGTCTGGAC AATGGAATAC GGCCCGTTCG ATTTGCCGGT GGCGCGAACG GACGGTGAAG GCCACCGCTG GCAGTACCGC TACGATAAAG ACACGCTGCA ACTGACAGAA GTCATCAATC CGCAGGGCGA GTCATACCGT TATATTCTGG ACAACTGTGG ACGGGTGACG GAAGAGCGTG ACTGGGGCGG CGTGGTCTGG CGTTACCGCT ATGACGCCGA TGGCCTCTGT ACCGCCAGGG TCAACGGTCT GGAGGAAACC ATCCTCTACA GCCGGGACGC CGCAGGCCGC CTGGCAGAAA TCATCACGCC GGAAGGCAAA ACGCAGTATG CGTATGACAA ATCCGGCAGG CTGACGGGTA TCTTCAGCCC GGACGGCATA TCACAGCGCA CCGGCTATGA CGAACGCGGG CGGGTGAATG TCACCACTCA GGGCCGACGG GCCATTGAAT ACCACCACCC CGATGAACAC ACCGTCATCC GCTGTATCCT GCCACCGGAA GATGAACGCG ACAGACATCC CGATGAATCC CTGCTGAAAA CCACCTACCG TTACAACGCC GCCGGAGAAC TGACGGAGAT TATCCTGCCG GGGGATGAGA CGCTGACGTT CAGCCGTGAT GAGGCGGGAC GTGAAGTGCT CCGGCACAGT AACCGGGGTT TTGCCTGTGA ACAGGGCTGG AATGCAGCCG GTCAGCCTGT CAGCCAGCGC GCCGGATTTT TTCCGGCAGA AGCCACATGG GGCGGGCTGG TTCCGTCACT GGTACGGGAG TACCGTTACG ACAGCGCGGG TAACGTGTCA GGCGTCACCA GCCGGGAAGA TTACGGACGG GAAACACGGC GGGAGTACCG GCTGGACCGG AACGGCCAGG TCACGGCGGT GACAGCCTCA GGCACCGGGC TGGGCTATGG CGAAGGCGAC GAGTCCTATG GCTATGACAG TTGTGGCTAC CTGAAGGCGC AGTCTGCGGG CAGGCACCGG ATAAGTGAAG AGACTGACCA GTATGCCGGA GGCCACCGGC TGAAACAGGC CGGAAACACG CAGTATGACT ATGACGCCGC AGGCCGGATG GTCAGCCGGA CAAAACACCG TGACGGCTAC CGCCCGGAAA CAGAGCGGTT CCGGTGGGAC AGCCTGGACC AGCTGACCGG GTATTGCAGC GCACAGGGGG AGCAGTGGGA ATACCGCTAC GACGCCAGCG GCAGGCGGAC GGAAAAACGC TGCGACCGGA AGAAAATCCG TTTTACGTAC CTGTGGGATG GCGACAGTAT TGCGGAAATC CGGGAATACC GCGATGATAA ACTGTACAGC GTAAGGCACC TGGTGTTTAA CGGCTTTGAG CTGATAAGCC AGCAGTTCAG CCGGGTACGG CAGCCGCACC CGTCCGTGGC CCCGCAGTGG GTGACGCGGA CGAATCACGC GGTGAGCGAC CTGACGGGCC GCCCGCTGAT GCTCTTTAAC AGTGAAGGTA AAACCGTCTG GCGACCGGGG CAGACCAGCC TGTGGGGGCT GGCACTCAGT CTGCCCGCAG ACACCGGCTA CCCGGACCCG CGCGGGGAAC TGGACCCGGA AGCCGACCCC GGCCTGCTGT ATGCAGGACA GTGGCAGGAT GCGGAATCGG GGCTGTGCTA TAACCGGTTC CGGTATTACG AGCCGGAAAC CGGAATGTAC CTGGTGAGTG ATCCGCTGGG GTTGCTGGGC GGGGAGCAGA CTTACCGGTA TGTGCCGAAT CCTTGTGGGT GGGTTGATCC GCTGAGATTG GCTGCAAGTT CTAAAATCAG CAGTTTGATG GACTATATTG GCGATGGTCG TCGTGTTAGT GGGCATACGG GTTTCCTGGA TGGGGTTCGT TTATCACGTA GTCAAATAAA CAATATTGCT AAAGAAATGG AGAAGCTAGG AATTAAAGTA ATAAGGAAAG CAGATAAATA TTTGCCACCA AATGCTAGGG CAGCTTTTGA TTATGGCCTT CGCAATATTT ATCTTAGGAA AAATGCTACC TTATATGAGG TGTATCATGA AGTGATTCAT GCTAAGCAAT TTGCGAAAAT TGGACGAGAA GCATACGAAG CACTAGGACG TTTATCTAGG GAGAAACACG TTCTAAATGA AATATTAAAA AGTAAAAATT TATTCAATGA AGCGGAAATA GCTCATGCCA TAAAATATGT TGAGGGATTG AGAGAAAAAT TCATGATGGG ACTAACAAAT TGA
|
Protein sequence | MLSQQAMVAS QMKRAANDKA IAQMLASKKS GPPAARLGDE IQHKSFLGAQ AGAVLGAIVT IAEGCLIMAA CATGPYALVL VPALMYASYK ASDYVEEKQN QLESWINSFC DTDGAINTGS ENVKINGELA ARAAVTLPPP PPPGAIPEVP QGEPSWGDIA TDLLESAAEK AVPLAKAWEN AVITLTDSNA GFMDRVSAGA SLLFPAGPVL MEFATMVGGR GEIKKEVDFP EAGEDTALCD KENKPPRIAQ GSSNVFINNQ PAARKGDKLE CSAAIVGGSP DVFIGGEQVT YLDIQPEFPP WQRMILGGIT IASYLLPPAG LLGKLKNLAR LGKLGNLLGK SGKLLGAKLG ALLGKTGKSL KSIANKVIRW VTDPVDPVTG AYCDERTDFT LGQTLPLSFT RFHSSVLPLH GLTGVGWSDS WSEYAWVREQ GNRVDIISQG ATLRFAFDGD SDTTVNPYHA QYILRRRDDY LELFDRDALS SRFFYDAFPG MRLRHPVTDD TSDDRLAHSP NDRMYMLGGM SDTASNRITF ERDSQYRITG VSHTDGIRLK LTYHASGYLK AIHRTDNGIQ TLATYEQDAR GRLTEADARL DYHLFYEYDA ADRIIRWSDN DQTWSRFTYD EQGRCVNVTG AEGYYNATLD YGDGCTTVTD GKGTHRYYYD PDGNILREEA PDGSTTTYEW DEFHHLLARH SPAGRVEKFE YNAALGQLSR YTAADGAEWL YRYDERGLLS NITDPAGQTW TQQCDERGLP VSLVSPQGEE TRLAYTAQGL LSGIFRQDER RLGIEYDHHN RPETLTDVMG REHHTEYSGH DLPVKMRGPG GQSVRLQWQQ HHKLSGIERA GTGAEGFRYD RHGNLLAYTD GNGVVWTMEY GPFDLPVART DGEGHRWQYR YDKDTLQLTE VINPQGESYR YILDNCGRVT EERDWGGVVW RYRYDADGLC TARVNGLEET ILYSRDAAGR LAEIITPEGK TQYAYDKSGR LTGIFSPDGI SQRTGYDERG RVNVTTQGRR AIEYHHPDEH TVIRCILPPE DERDRHPDES LLKTTYRYNA AGELTEIILP GDETLTFSRD EAGREVLRHS NRGFACEQGW NAAGQPVSQR AGFFPAEATW GGLVPSLVRE YRYDSAGNVS GVTSREDYGR ETRREYRLDR NGQVTAVTAS GTGLGYGEGD ESYGYDSCGY LKAQSAGRHR ISEETDQYAG GHRLKQAGNT QYDYDAAGRM VSRTKHRDGY RPETERFRWD SLDQLTGYCS AQGEQWEYRY DASGRRTEKR CDRKKIRFTY LWDGDSIAEI REYRDDKLYS VRHLVFNGFE LISQQFSRVR QPHPSVAPQW VTRTNHAVSD LTGRPLMLFN SEGKTVWRPG QTSLWGLALS LPADTGYPDP RGELDPEADP GLLYAGQWQD AESGLCYNRF RYYEPETGMY LVSDPLGLLG GEQTYRYVPN PCGWVDPLRL AASSKISSLM DYIGDGRRVS GHTGFLDGVR LSRSQINNIA KEMEKLGIKV IRKADKYLPP NARAAFDYGL RNIYLRKNAT LYEVYHEVIH AKQFAKIGRE AYEALGRLSR EKHVLNEILK SKNLFNEAEI AHAIKYVEGL REKFMMGLTN
|
| |