Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3079 |
Symbol | |
ID | 6066190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3367048 |
End bp | 3371565 |
Gene Length | 4518 bp |
Protein Length | 1505 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641602495 |
Product | YD repeat-containing protein |
Protein accession | YP_001726030 |
Protein GI | 170021076 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00723203 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAG GACCAGGCGG GCCACAGGGA GCGACCGCAG GCGGTACGCT GGCAATGCGA ATGCTGTCAC AGCAGGCGAT GGTCGCCAGC CAGATGAAAC GGGCAGCCAA CGACAAAGCC ATTGCACAGA TGCTGGCAGC AAAGAAGTCC GGCCCACCTG CCGCCAGGCT GGGCGATGAA ATTCAGCATA AAAGTTTTCT GGGGGCACTG GCAGGGGCCG TGCTGGGGGC GATAGTGACC ATCGCAGAAG GTTGCCTGAT TATGGCCGCC TGCGCCACCG GCCCTTATGC GCTGGTTCTG GTGCCTGCGC TGATGTATGC CAGCTATAAG GCGAGTGATT ATGTGGAGGA GAAACAGAAC CAGCTTGAAT CATGGATAAA CAGCTTTTGT GACACGGACG GCGCCATCAA TACCGGTTCT GAAAATGTAA ACATTAACGG AAAGCCCGCT GCAAGGGCCG CCGTCACCCT TCCCCCTCCT CCCCCACCTG GAGCAATACC TGAAATCCCA CAGGGGGAAC CCTCATGGGG TGATATTGCC ACTGACCTGC TTGAATCGGC AGCGGAAAAA GCAGTACCAC TGGCGAAGGC CTGGGGGAAC GCTGTTATCA CCCTGACGGA AAGCAATGCC GGTTTTATGG ATCGCGTCAG CGCCGGCGCA TCGCTTCTGT TTCCCGCCGG TCCGGTATTA ATGGAGTTTG CCACCATGGT GGGCGGGCGT GGCGAAATCA AAAAAGATGT GGATTTCCCG GAAGCCGGTG AGGACACGGC GCTCTGCGAC AAGGAGAACA AACCACCGAG GATAGCCCAG GGCAGTAGCA ACGTCTTTAT CAACAATCAG CCTGCCGCGC GCAAGGGCGA CAAACTGGAG TGCAGTGCGG CGATCGTGGA AGGTTCGCCG GACGTCTTTA TTGGGGGTGA GCAGGTCACC TATCTGGATA TCCAGCCGGA GTTCCCGCCA TGGCAGAGAA TGATCCTGGG AGGAATAACG ATAGCCAGCT ATCTTCTGCC GCCAGCAGGA CTGCTGGGAA AACTGGGGAA TCTGGCGAAA CTGGGCAAAC TGGGAAACCT GCTGGGGAAA AGCGGGAAGC TGCTGGGCGC AAAGCTCGGC GCGTTGCTGG GGAAAACAGG TAAGTCGTTA AAAAGTATTG CCAATAAAGT CATCAGATGG GTAACAGATC CTGTCGATCC GGTAACCGGC GCGTACTGCG ACGAACGTAC CGACTTCACC CTGGGCCAGA CCCTCCCCCT CTCCTTCACC CGTTTCCACA GTTCGGTACT GCCACTGCAT GGCCTGACGG GCGTGGGCTG GAGTGACTCC TGGAGCGAAT ACGCCTGGGT GCGTGAACAG GGAAACCGGG TGGATATCAT CAGCCTGGGA GCCACGCTGA ACTTCGCCTT CGACGGTGAA AGTGATACGG CGGTTAACCC GTATCACGCC CAGTACATTC TGCGCCGCCG TGATGATTAT CTGGAGCTGT TCGACAGGGA TGCACTGAGC AGCCGCTTCT TTTATGACGC CTTTCCGGGA ATGCGTCTGC GCCACCCGGT GACTGACGAT ACCAGCGATG ACCGCCTGGC ACACAGCCCC GCAGACCGGA TGTACATGCT GGGCGGGATG AGCGACACCG CCAGCAACCG CATCACGTTT GAGCGCGACA GCCAGTACCG GATCACGGGT GTCAGTCACA CCGACGGGAT CCGGCTTAAA CTGACGTACC ACGCCAGCGG CTACCTGAAA GCCATTCACC GCACGGATAA CGGCATACAG ACGCTGGCGA CCTACGAACA GGATGCGCGG GGGCGGCTGA CAGAAGCGGA TGCGCGGCTG GACTACCACC TGTTTTATGA GTACGACGCT GCGGACCGGA TCATCCGCTG GTCCGATAAC GACCAGACGT GGAGCCGTTT CACCTACGAT GCACAGGGCC GGTGCGTGAC CGTCACCGGG GCGGAGGGCT ATTACAACGC CACGCTGGAC TATGGTGACG GCTGCACCAC CGTGACGGAC GGCAAGGGCA TTCACTGTTA TTACTATGAT CCTGACGGCA ATATTCTGCG GGAAGCAGCG CCGGACGGCA GTACCACCAC GTATGAATGG GATGAATTCC ATCACCTGCT GGCCCGCCAC TCCCCTGCCG GACGGGTGGA GAAATTTGAA TACAACGCCG CACACGGTCA GTTAAGCCGT TATACGGCGG CAGACGGCGC GGAGTGGCAG TACCGCTATG ATGAGCGCGG CCTGCTCAGC AACATCACCG ACCCTGCCGG ACAGACGTGG ACACAGCAGT GCGATGAACG CGGCCTGCCG GTGAGTCTGG TATCGCCACA GGGCGAAGAG ACCCGGCTGG CGTACACCGC TCAGGGGCTG CTATCGGGGA TATTCCGCCA GGATGAACGG CGTCTGGGCA TAGAGTACGA CCACCACAAC CGGCCGGAAA CACTCACCGA CGTGATGGGC CGTGAACACC ACACCGAATA CAGCGGTCAC GACCTGCCGG TGAAGATGCG CGGCCCCGGC GGTCAGTCAG TGCGGTTGCA GTGGCAGCAG CACCATAAAC TGAGTGGCAT TGAGCGGGCA GAAACCGGCG CAGAAGGATT CCGCTATGAC CGCCACGGCA ACCTGCTGGC GTACACGGAC GGTAACGGCG TTGTCTGGAC AATGGAATAC GGCCCGTTCG ATTTGCCGGT GGCGCGAACG GACGGTGAAG GCCACCGCTG GCAGTACCGC TACGATAAAG ACACGCTGCA GCTCACAGAA GTCATTAACC CGCAGGGCGA GTCATACCGT TATATTCTGG ACAACTGTGG CCGGGTGACG GAAGAGCGTG ACTGGGGCGG CGTGGTCTGG CGTTACCGCT ATGACGCTGA TGGCCTGTGT ACCGCCAGGG TCAACGGCCT GGAGGAAACC ATCCTCTACA GCCGGGACGC CGCAGGCCGC CTGGCAGAAG TCATCACGCC GGAAGGCAAA ACGCAGTATG CCTATGACAA ATCCGGCAGG CTGACGGGTA TCTTCAGCCC GGACGGTACA TCACAGCGCA CCGGCTATGA CGAACGCGGG CGGGTGAATG TCACCACTCA GGGCCGACGG GCCATTGAAT ACCACTACCC CGATGAACAC ACCGTTATCC GCTGTATCCT GCCACCGGAA GATGAACGCG ACAGACACCC CGATGAATCC CTGCTGAAAA CCACGTACCG TTATAACGCC GCCGGAGAAC TGACGGAGGT CATTCTGCCG GGGGATGAGA CGCTGACGTT CAGCCGTGAT GAGGCGGGAC GTGAAGTGTT CCGGCACAGT AACCGGGGTT TTGCCTGTGA GCAGGGCTGG AATGCAGCCA GCCAGCTTGT CACCCAGCGC GCCGGATTTT TCCCGGAGGA AACCACATGG GGCGGGCTGC TCCCCTCACT GGTACGGGAG TACCGTTACG ACAGCGCGGG CAATGTGTCG GCTGTCACCA GCCGGGAAGA TTACGGACGG GAAACACGGC GGGAATACCG GCTGGACCGG AACGGTCAGG TCACGGCGGT GACAGCCTCA GGCACCGGGC TGGGCTATGG CGAAGGCGAT GAGTCCTATG GCTATGACAG TTGTGGCTAC CTGAAGGCGC AGTCTGCGGG CAGGCACCGG ATAAGTGAAG AGACTGAGCG GTATGCCGGA GGCCACCGGC TGAAACAGGC CGGAAACATG CAGTATGACT ATGACGCCGC AGGCCGGATG GTCAGCCGGA CAAAACACCG TGACGGCTAC CGCCCGGAAA CAGAGCGGTT CCGGTGGGAC AGCCGGGACC AGCTGACCGG GTATTGCAGC GCACAGGGTG AGCAGTGGGA ATACCGCCAC GACGCCAGCG GCAGACGAAC GGAAAAACGC TGCGACCGGA AGAAAATCCG TTTTACGTAC CTGTGGGACG GCGACAGTAT TGGGGAAATC CGGGAATACC GCGATGATAA ACTGTACAGC GTACGGCACC TGGTGTTTAA CAGCTTTGAG CTGATAAGCC AGCAGTTCAG CCGGGTACGA CAGCCGCACC CGTCCGTGGC CCCGCAGTGG GTGACGCGGA CGAATCATGC GGTGAACGAC CTGACGGGCC GTCCGCTGAT GCTCTTTAAC AGTGAAGGTA AAACCGTCTG GCGACCGGGA CAGACCAGCC TGTGGGGGCT GGCACTCAGC CTGCCCGCAG ACACCGGCTA CCCGGACCCG CGCGGGGAAC TGGACCCGGA AGCCGACCCC GGCCTGCTGT ATGCGGGACA GTGGCGGGAT GGAGAATCAG GGCTGTGCTA TAACCGGTTC CGGTATTACG AGCCGGAAAC CGGGATGTAC CTGGTGAGTG ATCCACTGGG GTTGCAGGGA GGGGAGCAGA CTTACCAGTA TGTGCCGAAT CCTTTAAGAT GGATAGATCC CTTAGGATTA AATAAAGGAG CTTCATTATC TAAAATGATG AATAGCTCCA GTGATCTCAT GGGGTTGAGA AGGCAGCCCC AGAACTTCTG GCGGCTATAT CGCGGAAAAG ACATTTAA
|
Protein sequence | MSEGPGGPQG ATAGGTLAMR MLSQQAMVAS QMKRAANDKA IAQMLAAKKS GPPAARLGDE IQHKSFLGAL AGAVLGAIVT IAEGCLIMAA CATGPYALVL VPALMYASYK ASDYVEEKQN QLESWINSFC DTDGAINTGS ENVNINGKPA ARAAVTLPPP PPPGAIPEIP QGEPSWGDIA TDLLESAAEK AVPLAKAWGN AVITLTESNA GFMDRVSAGA SLLFPAGPVL MEFATMVGGR GEIKKDVDFP EAGEDTALCD KENKPPRIAQ GSSNVFINNQ PAARKGDKLE CSAAIVEGSP DVFIGGEQVT YLDIQPEFPP WQRMILGGIT IASYLLPPAG LLGKLGNLAK LGKLGNLLGK SGKLLGAKLG ALLGKTGKSL KSIANKVIRW VTDPVDPVTG AYCDERTDFT LGQTLPLSFT RFHSSVLPLH GLTGVGWSDS WSEYAWVREQ GNRVDIISLG ATLNFAFDGE SDTAVNPYHA QYILRRRDDY LELFDRDALS SRFFYDAFPG MRLRHPVTDD TSDDRLAHSP ADRMYMLGGM SDTASNRITF ERDSQYRITG VSHTDGIRLK LTYHASGYLK AIHRTDNGIQ TLATYEQDAR GRLTEADARL DYHLFYEYDA ADRIIRWSDN DQTWSRFTYD AQGRCVTVTG AEGYYNATLD YGDGCTTVTD GKGIHCYYYD PDGNILREAA PDGSTTTYEW DEFHHLLARH SPAGRVEKFE YNAAHGQLSR YTAADGAEWQ YRYDERGLLS NITDPAGQTW TQQCDERGLP VSLVSPQGEE TRLAYTAQGL LSGIFRQDER RLGIEYDHHN RPETLTDVMG REHHTEYSGH DLPVKMRGPG GQSVRLQWQQ HHKLSGIERA ETGAEGFRYD RHGNLLAYTD GNGVVWTMEY GPFDLPVART DGEGHRWQYR YDKDTLQLTE VINPQGESYR YILDNCGRVT EERDWGGVVW RYRYDADGLC TARVNGLEET ILYSRDAAGR LAEVITPEGK TQYAYDKSGR LTGIFSPDGT SQRTGYDERG RVNVTTQGRR AIEYHYPDEH TVIRCILPPE DERDRHPDES LLKTTYRYNA AGELTEVILP GDETLTFSRD EAGREVFRHS NRGFACEQGW NAASQLVTQR AGFFPEETTW GGLLPSLVRE YRYDSAGNVS AVTSREDYGR ETRREYRLDR NGQVTAVTAS GTGLGYGEGD ESYGYDSCGY LKAQSAGRHR ISEETERYAG GHRLKQAGNM QYDYDAAGRM VSRTKHRDGY RPETERFRWD SRDQLTGYCS AQGEQWEYRH DASGRRTEKR CDRKKIRFTY LWDGDSIGEI REYRDDKLYS VRHLVFNSFE LISQQFSRVR QPHPSVAPQW VTRTNHAVND LTGRPLMLFN SEGKTVWRPG QTSLWGLALS LPADTGYPDP RGELDPEADP GLLYAGQWRD GESGLCYNRF RYYEPETGMY LVSDPLGLQG GEQTYQYVPN PLRWIDPLGL NKGASLSKMM NSSSDLMGLR RQPQNFWRLY RGKDI
|
| |