Gene EcolC_3079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3079 
Symbol 
ID6066190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3367048 
End bp3371565 
Gene Length4518 bp 
Protein Length1505 aa 
Translation table11 
GC content60% 
IMG OID641602495 
ProductYD repeat-containing protein 
Protein accessionYP_001726030 
Protein GI170021076 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00723203 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAG GACCAGGCGG GCCACAGGGA GCGACCGCAG GCGGTACGCT GGCAATGCGA 
ATGCTGTCAC AGCAGGCGAT GGTCGCCAGC CAGATGAAAC GGGCAGCCAA CGACAAAGCC
ATTGCACAGA TGCTGGCAGC AAAGAAGTCC GGCCCACCTG CCGCCAGGCT GGGCGATGAA
ATTCAGCATA AAAGTTTTCT GGGGGCACTG GCAGGGGCCG TGCTGGGGGC GATAGTGACC
ATCGCAGAAG GTTGCCTGAT TATGGCCGCC TGCGCCACCG GCCCTTATGC GCTGGTTCTG
GTGCCTGCGC TGATGTATGC CAGCTATAAG GCGAGTGATT ATGTGGAGGA GAAACAGAAC
CAGCTTGAAT CATGGATAAA CAGCTTTTGT GACACGGACG GCGCCATCAA TACCGGTTCT
GAAAATGTAA ACATTAACGG AAAGCCCGCT GCAAGGGCCG CCGTCACCCT TCCCCCTCCT
CCCCCACCTG GAGCAATACC TGAAATCCCA CAGGGGGAAC CCTCATGGGG TGATATTGCC
ACTGACCTGC TTGAATCGGC AGCGGAAAAA GCAGTACCAC TGGCGAAGGC CTGGGGGAAC
GCTGTTATCA CCCTGACGGA AAGCAATGCC GGTTTTATGG ATCGCGTCAG CGCCGGCGCA
TCGCTTCTGT TTCCCGCCGG TCCGGTATTA ATGGAGTTTG CCACCATGGT GGGCGGGCGT
GGCGAAATCA AAAAAGATGT GGATTTCCCG GAAGCCGGTG AGGACACGGC GCTCTGCGAC
AAGGAGAACA AACCACCGAG GATAGCCCAG GGCAGTAGCA ACGTCTTTAT CAACAATCAG
CCTGCCGCGC GCAAGGGCGA CAAACTGGAG TGCAGTGCGG CGATCGTGGA AGGTTCGCCG
GACGTCTTTA TTGGGGGTGA GCAGGTCACC TATCTGGATA TCCAGCCGGA GTTCCCGCCA
TGGCAGAGAA TGATCCTGGG AGGAATAACG ATAGCCAGCT ATCTTCTGCC GCCAGCAGGA
CTGCTGGGAA AACTGGGGAA TCTGGCGAAA CTGGGCAAAC TGGGAAACCT GCTGGGGAAA
AGCGGGAAGC TGCTGGGCGC AAAGCTCGGC GCGTTGCTGG GGAAAACAGG TAAGTCGTTA
AAAAGTATTG CCAATAAAGT CATCAGATGG GTAACAGATC CTGTCGATCC GGTAACCGGC
GCGTACTGCG ACGAACGTAC CGACTTCACC CTGGGCCAGA CCCTCCCCCT CTCCTTCACC
CGTTTCCACA GTTCGGTACT GCCACTGCAT GGCCTGACGG GCGTGGGCTG GAGTGACTCC
TGGAGCGAAT ACGCCTGGGT GCGTGAACAG GGAAACCGGG TGGATATCAT CAGCCTGGGA
GCCACGCTGA ACTTCGCCTT CGACGGTGAA AGTGATACGG CGGTTAACCC GTATCACGCC
CAGTACATTC TGCGCCGCCG TGATGATTAT CTGGAGCTGT TCGACAGGGA TGCACTGAGC
AGCCGCTTCT TTTATGACGC CTTTCCGGGA ATGCGTCTGC GCCACCCGGT GACTGACGAT
ACCAGCGATG ACCGCCTGGC ACACAGCCCC GCAGACCGGA TGTACATGCT GGGCGGGATG
AGCGACACCG CCAGCAACCG CATCACGTTT GAGCGCGACA GCCAGTACCG GATCACGGGT
GTCAGTCACA CCGACGGGAT CCGGCTTAAA CTGACGTACC ACGCCAGCGG CTACCTGAAA
GCCATTCACC GCACGGATAA CGGCATACAG ACGCTGGCGA CCTACGAACA GGATGCGCGG
GGGCGGCTGA CAGAAGCGGA TGCGCGGCTG GACTACCACC TGTTTTATGA GTACGACGCT
GCGGACCGGA TCATCCGCTG GTCCGATAAC GACCAGACGT GGAGCCGTTT CACCTACGAT
GCACAGGGCC GGTGCGTGAC CGTCACCGGG GCGGAGGGCT ATTACAACGC CACGCTGGAC
TATGGTGACG GCTGCACCAC CGTGACGGAC GGCAAGGGCA TTCACTGTTA TTACTATGAT
CCTGACGGCA ATATTCTGCG GGAAGCAGCG CCGGACGGCA GTACCACCAC GTATGAATGG
GATGAATTCC ATCACCTGCT GGCCCGCCAC TCCCCTGCCG GACGGGTGGA GAAATTTGAA
TACAACGCCG CACACGGTCA GTTAAGCCGT TATACGGCGG CAGACGGCGC GGAGTGGCAG
TACCGCTATG ATGAGCGCGG CCTGCTCAGC AACATCACCG ACCCTGCCGG ACAGACGTGG
ACACAGCAGT GCGATGAACG CGGCCTGCCG GTGAGTCTGG TATCGCCACA GGGCGAAGAG
ACCCGGCTGG CGTACACCGC TCAGGGGCTG CTATCGGGGA TATTCCGCCA GGATGAACGG
CGTCTGGGCA TAGAGTACGA CCACCACAAC CGGCCGGAAA CACTCACCGA CGTGATGGGC
CGTGAACACC ACACCGAATA CAGCGGTCAC GACCTGCCGG TGAAGATGCG CGGCCCCGGC
GGTCAGTCAG TGCGGTTGCA GTGGCAGCAG CACCATAAAC TGAGTGGCAT TGAGCGGGCA
GAAACCGGCG CAGAAGGATT CCGCTATGAC CGCCACGGCA ACCTGCTGGC GTACACGGAC
GGTAACGGCG TTGTCTGGAC AATGGAATAC GGCCCGTTCG ATTTGCCGGT GGCGCGAACG
GACGGTGAAG GCCACCGCTG GCAGTACCGC TACGATAAAG ACACGCTGCA GCTCACAGAA
GTCATTAACC CGCAGGGCGA GTCATACCGT TATATTCTGG ACAACTGTGG CCGGGTGACG
GAAGAGCGTG ACTGGGGCGG CGTGGTCTGG CGTTACCGCT ATGACGCTGA TGGCCTGTGT
ACCGCCAGGG TCAACGGCCT GGAGGAAACC ATCCTCTACA GCCGGGACGC CGCAGGCCGC
CTGGCAGAAG TCATCACGCC GGAAGGCAAA ACGCAGTATG CCTATGACAA ATCCGGCAGG
CTGACGGGTA TCTTCAGCCC GGACGGTACA TCACAGCGCA CCGGCTATGA CGAACGCGGG
CGGGTGAATG TCACCACTCA GGGCCGACGG GCCATTGAAT ACCACTACCC CGATGAACAC
ACCGTTATCC GCTGTATCCT GCCACCGGAA GATGAACGCG ACAGACACCC CGATGAATCC
CTGCTGAAAA CCACGTACCG TTATAACGCC GCCGGAGAAC TGACGGAGGT CATTCTGCCG
GGGGATGAGA CGCTGACGTT CAGCCGTGAT GAGGCGGGAC GTGAAGTGTT CCGGCACAGT
AACCGGGGTT TTGCCTGTGA GCAGGGCTGG AATGCAGCCA GCCAGCTTGT CACCCAGCGC
GCCGGATTTT TCCCGGAGGA AACCACATGG GGCGGGCTGC TCCCCTCACT GGTACGGGAG
TACCGTTACG ACAGCGCGGG CAATGTGTCG GCTGTCACCA GCCGGGAAGA TTACGGACGG
GAAACACGGC GGGAATACCG GCTGGACCGG AACGGTCAGG TCACGGCGGT GACAGCCTCA
GGCACCGGGC TGGGCTATGG CGAAGGCGAT GAGTCCTATG GCTATGACAG TTGTGGCTAC
CTGAAGGCGC AGTCTGCGGG CAGGCACCGG ATAAGTGAAG AGACTGAGCG GTATGCCGGA
GGCCACCGGC TGAAACAGGC CGGAAACATG CAGTATGACT ATGACGCCGC AGGCCGGATG
GTCAGCCGGA CAAAACACCG TGACGGCTAC CGCCCGGAAA CAGAGCGGTT CCGGTGGGAC
AGCCGGGACC AGCTGACCGG GTATTGCAGC GCACAGGGTG AGCAGTGGGA ATACCGCCAC
GACGCCAGCG GCAGACGAAC GGAAAAACGC TGCGACCGGA AGAAAATCCG TTTTACGTAC
CTGTGGGACG GCGACAGTAT TGGGGAAATC CGGGAATACC GCGATGATAA ACTGTACAGC
GTACGGCACC TGGTGTTTAA CAGCTTTGAG CTGATAAGCC AGCAGTTCAG CCGGGTACGA
CAGCCGCACC CGTCCGTGGC CCCGCAGTGG GTGACGCGGA CGAATCATGC GGTGAACGAC
CTGACGGGCC GTCCGCTGAT GCTCTTTAAC AGTGAAGGTA AAACCGTCTG GCGACCGGGA
CAGACCAGCC TGTGGGGGCT GGCACTCAGC CTGCCCGCAG ACACCGGCTA CCCGGACCCG
CGCGGGGAAC TGGACCCGGA AGCCGACCCC GGCCTGCTGT ATGCGGGACA GTGGCGGGAT
GGAGAATCAG GGCTGTGCTA TAACCGGTTC CGGTATTACG AGCCGGAAAC CGGGATGTAC
CTGGTGAGTG ATCCACTGGG GTTGCAGGGA GGGGAGCAGA CTTACCAGTA TGTGCCGAAT
CCTTTAAGAT GGATAGATCC CTTAGGATTA AATAAAGGAG CTTCATTATC TAAAATGATG
AATAGCTCCA GTGATCTCAT GGGGTTGAGA AGGCAGCCCC AGAACTTCTG GCGGCTATAT
CGCGGAAAAG ACATTTAA
 
Protein sequence
MSEGPGGPQG ATAGGTLAMR MLSQQAMVAS QMKRAANDKA IAQMLAAKKS GPPAARLGDE 
IQHKSFLGAL AGAVLGAIVT IAEGCLIMAA CATGPYALVL VPALMYASYK ASDYVEEKQN
QLESWINSFC DTDGAINTGS ENVNINGKPA ARAAVTLPPP PPPGAIPEIP QGEPSWGDIA
TDLLESAAEK AVPLAKAWGN AVITLTESNA GFMDRVSAGA SLLFPAGPVL MEFATMVGGR
GEIKKDVDFP EAGEDTALCD KENKPPRIAQ GSSNVFINNQ PAARKGDKLE CSAAIVEGSP
DVFIGGEQVT YLDIQPEFPP WQRMILGGIT IASYLLPPAG LLGKLGNLAK LGKLGNLLGK
SGKLLGAKLG ALLGKTGKSL KSIANKVIRW VTDPVDPVTG AYCDERTDFT LGQTLPLSFT
RFHSSVLPLH GLTGVGWSDS WSEYAWVREQ GNRVDIISLG ATLNFAFDGE SDTAVNPYHA
QYILRRRDDY LELFDRDALS SRFFYDAFPG MRLRHPVTDD TSDDRLAHSP ADRMYMLGGM
SDTASNRITF ERDSQYRITG VSHTDGIRLK LTYHASGYLK AIHRTDNGIQ TLATYEQDAR
GRLTEADARL DYHLFYEYDA ADRIIRWSDN DQTWSRFTYD AQGRCVTVTG AEGYYNATLD
YGDGCTTVTD GKGIHCYYYD PDGNILREAA PDGSTTTYEW DEFHHLLARH SPAGRVEKFE
YNAAHGQLSR YTAADGAEWQ YRYDERGLLS NITDPAGQTW TQQCDERGLP VSLVSPQGEE
TRLAYTAQGL LSGIFRQDER RLGIEYDHHN RPETLTDVMG REHHTEYSGH DLPVKMRGPG
GQSVRLQWQQ HHKLSGIERA ETGAEGFRYD RHGNLLAYTD GNGVVWTMEY GPFDLPVART
DGEGHRWQYR YDKDTLQLTE VINPQGESYR YILDNCGRVT EERDWGGVVW RYRYDADGLC
TARVNGLEET ILYSRDAAGR LAEVITPEGK TQYAYDKSGR LTGIFSPDGT SQRTGYDERG
RVNVTTQGRR AIEYHYPDEH TVIRCILPPE DERDRHPDES LLKTTYRYNA AGELTEVILP
GDETLTFSRD EAGREVFRHS NRGFACEQGW NAASQLVTQR AGFFPEETTW GGLLPSLVRE
YRYDSAGNVS AVTSREDYGR ETRREYRLDR NGQVTAVTAS GTGLGYGEGD ESYGYDSCGY
LKAQSAGRHR ISEETERYAG GHRLKQAGNM QYDYDAAGRM VSRTKHRDGY RPETERFRWD
SRDQLTGYCS AQGEQWEYRH DASGRRTEKR CDRKKIRFTY LWDGDSIGEI REYRDDKLYS
VRHLVFNSFE LISQQFSRVR QPHPSVAPQW VTRTNHAVND LTGRPLMLFN SEGKTVWRPG
QTSLWGLALS LPADTGYPDP RGELDPEADP GLLYAGQWRD GESGLCYNRF RYYEPETGMY
LVSDPLGLQG GEQTYQYVPN PLRWIDPLGL NKGASLSKMM NSSSDLMGLR RQPQNFWRLY
RGKDI