Gene EcE24377A_0585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0585 
Symbol 
ID5589845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp608971 
End bp613773 
Gene Length4803 bp 
Protein Length1600 aa 
Translation table11 
GC content58% 
IMG OID640924303 
ProductYD repeat-containing protein 
Protein accessionYP_001461729 
Protein GI157158186 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTCAC AGCAGGCGAT GGTCGCCAGC CAGATGAAAC GGGCAGCCAA CGACAAAGCC 
ATTGCACAGA TGTTGGCATC AAAAAAGTCC GGCCCCCCCG CCGCCAGGCT GGGCGATGAA
ATTCAGCACA AGAGTTTTTT GGGGGCGCAG GCAGGGGCCG TGCTGGGGGC GATAGTGACC
ATCGCAGAAG GTTGCCTGAT TATGGCCGCC TGTGCCACCG GCCCTTATGC GCTGGTTCTG
GTGCCTGCGC TGATGTATGC CAGCTATAAG GCGAGTGATT ATGTGGAGGA GAAACAGAAC
CAGCTTGAAT CATGGATAAA CAGCTTTTGT GACACGGACG GCGCCATCAA TACCGGTTCT
GAAAACGTAA AAATTAACGG CGAGCTGGCC GCACGTGCAG CGGTCACCCT TCCCCCTCCT
CCCCCACCTG GAGCAATACC TGAAGTCCCA CAGGGGGAAC CCTCATGGGG TGATATTGCC
ACTGACCTGC TTGAATCGGC AGCGGAAAAA GCAGTACCAC TGGCGAAGGC CTGGGAGAAC
GCTGTTATCA CCCTGACGGA CAGCAATGCC GGTTTTATGG ATCGCGTCAG CGCCGGCGCA
TCGCTTCTGT TTCCCGCCGG TCCGGTATTA ATGGAGTTTG CCACCATGGT GGGCGGGCGT
GGCGAAATCA AAAAAGAGGT GGATTTCCCG GAAGCCGGTG AGGACACGGC GCTCTGCGAC
AAGGAGAACA AACCACCGAG GATAGCCCAG GGCAGCAGCA ACGTCTTTAT CAACAATCAG
CCTGCCGCGC GCAAGGGCGA CAAACTGGAG TGCAGTGCGG CAATCGTGGG AGGTTCGCCG
GACGTCTTTA TTGGCGGTGA GCAGGTCACA TATCTGGATA TCCAGCCGGA GTTCCCGCCG
TGGCAGAGAA TGATCCTGGG GGGAATAACG ATAGCCAGCT ATCTTCTGCC GCCTGCGGGA
CTGCTGGGAA AACTGAAGAA TCTGGCGAGA CTCGGCAAAC TGGGAAACCT GCTGGGGAAA
AGCGGGAAGC TGCTGGGCGC AAAGCTCGGC GCGTTGCTGG GGAAAACAGG TAAGTCGTTA
AAAAGTATTG CCAATAAAGT CATCAGATGG GTAACAGATC CTGTCGATCC GGTAACCGGC
GCATACTGCG ACGAACGTAC CGACTTCACC CTGGGCCAGA CCCTCCCCCT CTCCTTCACC
CGTTTCCACA GTTCTGTACT GCCGCTGCAT GGCCTGACGG GCGTGGGCTG GAGCGACTCC
TGGAGCGAAT ACGCCTGGGT GCGTGAACAG GGAAACCGGG TGGATATCAT CAGCCAGGGA
GCCACGCTGA GATTTGCCTT CGACGGTGAC AGTGATACGA CGGTTAACCC GTATCACGCC
CAGTACATTC TGCGCCGCCG CGATGATTAT CTGGAGCTGT TCGACAGGGA TGCACTGAGC
AGCCGCTTCT TTTATGACGC CTTTCCGGGA ATGCGTCTGC GCCACCCGGT GACTGACGAT
ACCAGCGATG ACCGCCTGGC ACACAGCCCC AATGACCGGA TGTACATGCT GGGCGGGATG
AGCGACACCG CCAGCAACCG CATCACGTTT GAGCGCGACA GCCAGTACCG GATCACGGGT
GTCAGTCACA CCGACGGGAT CCGGCTTAAA CTGACGTACC ACGCCAGCGG CTACCTGAAA
GCCATTCACC GCACGGATAA CGGCATACAG ACGCTGGCGA CCTACGAACA GGATGCGCGG
GGGCGGCTGA CAGAAGCGGA TGCGCGGCTG GACTACCACC TGTTTTATGA GTACGACGCT
GCGGACCGGA TCATCCGCTG GTCCGATAAC GACCAGACGT GGAGCCGTTT CACCTACGAT
GAACAGGGCC GGTGCGTGAA TGTCACCGGG GCGGAGGGCT ATTACAACGC CACGCTGGAC
TATGGTGACG GCTGCACCAC CGTGACGGAC GGCAAGGGCA CTCACCGTTA TTACTATGAT
CCTGACGGCA ATATTCTGCG GGAAGAAGCG CCGGACGGCA GCACCACCAC GTATGAATGG
GATGAATTCC ATCACCTGCT GGCCCGTCAC TCCCCTGCCG GGCGGGTGGA GAAGTTTGAA
TACAACGCCG CACTCGGTCA GTTAAGCCGT TACACGGCGG CAGACGGCGC GGAGTGGCTG
TACCGCTATG ATGAGCGCGG CCTGCTCAGC AACATCACCG ACCCAGCCGG GCAGACGTGG
ACACAGCAGT GTGATGAACG CGGCCTGCCG GTAAGTCTGG TGTCGCCACA GGGCGAAGAG
ACCCGGCTGG CATACACCGC TCAGGGGCTG CTGTCGGGGA TATTCCGCCA GGATGAACGG
CGTCTGGGCA TAGAGTACGA CCACCACAAC CGGCCGGAAA CACTCACCGA CGTGATGGGC
CGTGAACACC ACACCGAATA CAGCGGTCAC GACCTGCCGG TGAAGATGCG CGGCCCCGGC
GGTCAGTCAG TGCGGTTACA GTGGCAGCAG CACCATAAAC TGAGCGGCAT TGAACGTGCT
GGAACCGGCG CGGAAGGATT CCGCTATGAC CGCCACGGCA ACCTGCTGGC GTACACGGAC
GGTAACGGCG TTGTCTGGAC AATGGAATAC GGCCCGTTCG ATTTGCCGGT GGCGCGAACG
GACGGTGAAG GCCACCGCTG GCAGTACCGC TACGATAAAG ACACGCTGCA ACTGACAGAA
GTCATCAATC CGCAGGGCGA GTCATACCGT TATATTCTGG ACAACTGTGG ACGGGTGACG
GAAGAGCGTG ACTGGGGCGG CGTGGTCTGG CGTTACCGCT ATGACGCCGA TGGCCTCTGT
ACCGCCAGGG TCAACGGTCT GGAGGAAACC ATCCTCTACA GCCGGGACGC CGCAGGCCGC
CTGGCAGAAA TCATCACGCC GGAAGGCAAA ACGCAGTATG CGTATGACAA ATCCGGCAGG
CTGACGGGTA TCTTCAGCCC GGACGGCATA TCACAGCGCA CCGGCTATGA CGAACGCGGG
CGGGTGAATG TCACCACTCA GGGCCGACGG GCCATTGAAT ACCACCACCC CGATGAACAC
ACCGTCATCC GCTGTATCCT GCCACCGGAA GATGAACGCG ACAGACATCC CGATGAATCC
CTGCTGAAAA CCACCTACCG TTACAACGCC GCCGGAGAAC TGACGGAGAT TATCCTGCCG
GGGGATGAGA CGCTGACGTT CAGCCGTGAT GAGGCGGGAC GTGAAGTGCT CCGGCACAGT
AACCGGGGTT TTGCCTGTGA ACAGGGCTGG AATGCAGCCG GTCAGCCTGT CAGCCAGCGC
GCCGGATTTT TTCCGGCAGA AGCCACATGG GGCGGGCTGG TTCCGTCACT GGTACGGGAG
TACCGTTACG ACAGCGCGGG TAACGTGTCA GGCGTCACCA GCCGGGAAGA TTACGGACGG
GAAACACGGC GGGAGTACCG GCTGGACCGG AACGGCCAGG TCACGGCGGT GACAGCCTCA
GGCACCGGGC TGGGCTATGG CGAAGGCGAC GAGTCCTATG GCTATGACAG TTGTGGCTAC
CTGAAGGCGC AGTCTGCGGG CAGGCACCGG ATAAGTGAAG AGACTGACCA GTATGCCGGA
GGCCACCGGC TGAAACAGGC CGGAAACACG CAGTATGACT ATGACGCCGC AGGCCGGATG
GTCAGCCGGA CAAAACACCG TGACGGCTAC CGCCCGGAAA CAGAGCGGTT CCGGTGGGAC
AGCCTGGACC AGCTGACCGG GTATTGCAGC GCACAGGGGG AGCAGTGGGA ATACCGCTAC
GACGCCAGCG GCAGGCGGAC GGAAAAACGC TGCGACCGGA AGAAAATCCG TTTTACGTAC
CTGTGGGATG GCGACAGTAT TGCGGAAATC CGGGAATACC GCGATGATAA ACTGTACAGC
GTAAGGCACC TGGTGTTTAA CGGCTTTGAG CTGATAAGCC AGCAGTTCAG CCGGGTACGG
CAGCCGCACC CGTCCGTGGC CCCGCAGTGG GTGACGCGGA CGAATCACGC GGTGAGCGAC
CTGACGGGCC GCCCGCTGAT GCTCTTTAAC AGTGAAGGTA AAACCGTCTG GCGACCGGGG
CAGACCAGCC TGTGGGGGCT GGCACTCAGT CTGCCCGCAG ACACCGGCTA CCCGGACCCG
CGCGGGGAAC TGGACCCGGA AGCCGACCCC GGCCTGCTGT ATGCAGGACA GTGGCAGGAT
GCGGAATCGG GGCTGTGCTA TAACCGGTTC CGGTATTACG AGCCGGAAAC CGGAATGTAC
CTGGTGAGTG ATCCGCTGGG GTTGCTGGGC GGGGAGCAGA CTTACCGGTA TGTGCCGAAT
CCTTGTGGGT GGGTTGATCC GCTGAGATTG GCTGCAAGTT CTAAAATCAG CAGTTTGATG
GACTATATTG GCGATGGTCG TCGTGTTAGT GGGCATACGG GTTTCCTGGA TGGGGTTCGT
TTATCACGTA GTCAAATAAA CAATATTGCT AAAGAAATGG AGAAGCTAGG AATTAAAGTA
ATAAGGAAAG CAGATAAATA TTTGCCACCA AATGCTAGGG CAGCTTTTGA TTATGGCCTT
CGCAATATTT ATCTTAGGAA AAATGCTACC TTATATGAGG TGTATCATGA AGTGATTCAT
GCTAAGCAAT TTGCGAAAAT TGGACGAGAA GCATACGAAG CACTAGGACG TTTATCTAGG
GAGAAACACG TTCTAAATGA AATATTAAAA AGTAAAAATT TATTCAATGA AGCGGAAATA
GCTCATGCCA TAAAATATGT TGAGGGATTG AGAGAAAAAT TCATGATGGG ACTAACAAAT
TGA
 
Protein sequence
MLSQQAMVAS QMKRAANDKA IAQMLASKKS GPPAARLGDE IQHKSFLGAQ AGAVLGAIVT 
IAEGCLIMAA CATGPYALVL VPALMYASYK ASDYVEEKQN QLESWINSFC DTDGAINTGS
ENVKINGELA ARAAVTLPPP PPPGAIPEVP QGEPSWGDIA TDLLESAAEK AVPLAKAWEN
AVITLTDSNA GFMDRVSAGA SLLFPAGPVL MEFATMVGGR GEIKKEVDFP EAGEDTALCD
KENKPPRIAQ GSSNVFINNQ PAARKGDKLE CSAAIVGGSP DVFIGGEQVT YLDIQPEFPP
WQRMILGGIT IASYLLPPAG LLGKLKNLAR LGKLGNLLGK SGKLLGAKLG ALLGKTGKSL
KSIANKVIRW VTDPVDPVTG AYCDERTDFT LGQTLPLSFT RFHSSVLPLH GLTGVGWSDS
WSEYAWVREQ GNRVDIISQG ATLRFAFDGD SDTTVNPYHA QYILRRRDDY LELFDRDALS
SRFFYDAFPG MRLRHPVTDD TSDDRLAHSP NDRMYMLGGM SDTASNRITF ERDSQYRITG
VSHTDGIRLK LTYHASGYLK AIHRTDNGIQ TLATYEQDAR GRLTEADARL DYHLFYEYDA
ADRIIRWSDN DQTWSRFTYD EQGRCVNVTG AEGYYNATLD YGDGCTTVTD GKGTHRYYYD
PDGNILREEA PDGSTTTYEW DEFHHLLARH SPAGRVEKFE YNAALGQLSR YTAADGAEWL
YRYDERGLLS NITDPAGQTW TQQCDERGLP VSLVSPQGEE TRLAYTAQGL LSGIFRQDER
RLGIEYDHHN RPETLTDVMG REHHTEYSGH DLPVKMRGPG GQSVRLQWQQ HHKLSGIERA
GTGAEGFRYD RHGNLLAYTD GNGVVWTMEY GPFDLPVART DGEGHRWQYR YDKDTLQLTE
VINPQGESYR YILDNCGRVT EERDWGGVVW RYRYDADGLC TARVNGLEET ILYSRDAAGR
LAEIITPEGK TQYAYDKSGR LTGIFSPDGI SQRTGYDERG RVNVTTQGRR AIEYHHPDEH
TVIRCILPPE DERDRHPDES LLKTTYRYNA AGELTEIILP GDETLTFSRD EAGREVLRHS
NRGFACEQGW NAAGQPVSQR AGFFPAEATW GGLVPSLVRE YRYDSAGNVS GVTSREDYGR
ETRREYRLDR NGQVTAVTAS GTGLGYGEGD ESYGYDSCGY LKAQSAGRHR ISEETDQYAG
GHRLKQAGNT QYDYDAAGRM VSRTKHRDGY RPETERFRWD SLDQLTGYCS AQGEQWEYRY
DASGRRTEKR CDRKKIRFTY LWDGDSIAEI REYRDDKLYS VRHLVFNGFE LISQQFSRVR
QPHPSVAPQW VTRTNHAVSD LTGRPLMLFN SEGKTVWRPG QTSLWGLALS LPADTGYPDP
RGELDPEADP GLLYAGQWQD AESGLCYNRF RYYEPETGMY LVSDPLGLLG GEQTYRYVPN
PCGWVDPLRL AASSKISSLM DYIGDGRRVS GHTGFLDGVR LSRSQINNIA KEMEKLGIKV
IRKADKYLPP NARAAFDYGL RNIYLRKNAT LYEVYHEVIH AKQFAKIGRE AYEALGRLSR
EKHVLNEILK SKNLFNEAEI AHAIKYVEGL REKFMMGLTN