Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0234 |
Symbol | |
ID | 6067783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 270033 |
End bp | 274268 |
Gene Length | 4236 bp |
Protein Length | 1411 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641599633 |
Product | YD repeat-containing protein |
Protein accession | YP_001723240 |
Protein GI | 170018286 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGAA AACCGGCGGC GCGTCAGGGT GACATGACGC AGTATGGCGG TAGCATTGTT CAGGGTTCAG CCGGGGTACG TATTGGTGCC CCCACCGGCG TGGCCTGTTC GGTGTGCCCC GGCGGGGTGA CGTCCGGCCA TCCGGTCAAT CCGCTGCTGG GTGCAAAGGT CCTTCCCGGT GAAACCGACA TCGCCCTGCC CGGCCCGCTG CCGTTCATTC TCTCCCGCAC CTACAGCAGT TACCGGACAA AAACGCCCGC GCCGGTGGGG AGCCTCGGCC CCGGCTGGAA AATGCCTGCG GATATCCGCT TACAGCTGCG CGATAACACA CTGATACTCA GTGATAACGG CGGCAGAAGC CTGTATTTTG AGCACCTGTT TCCCGGTGAG GACGGTTACA GCCGCAGCGA ATCACTCTGG CTGGTGCGCG GCGGTGTGGC GAAACTGGAT GAAGGTCACC GGCTGGCCGC ACTCTGGCAG GCGCTGCCGG AAGAGCTTCG CCTGAGTCCG CATCGTTATC TGGCGACAAA CAGTCCGCAG GGGCCGTGGT GGGTACTCGG CTGGTGTGAG CGGGTGCCGG AAGCGGATGA GGTGCTGCCT GCGCCGCTGC CGCCGTACCG GGTACTGACC GGGCTGGTGG ACCGCTTCGG GCGCACACAG ACGTTCCACC ACGAAGCCGC CGGTGAATTC AGCGGCGAAA TCACCGGCGT GACGGATGGT GCCGGGCGTC ACTTCCGGCT GGTACTGACC ACGCAGGCGC AGCGGGCAGA AGAAGCCCGG CAGCAGGCCT CTTCCGGCGG GACGGAACCG TCCGCTTTTC CTGATACCCT GCCGGGTTAC ACCGAATATG GCCGGGACAA CGGCATCCGT CTGTCTGCCG TGTGGCTGAC GCACGACCCG GAATACCCGG AGAATTTACC TGCCGCGCCG CTGGTGCGCT ATGGCTGGAC GCCACGCGGC GAACTGGCGG TGGTGTATGA CCGTAGTGGC AAACAGGTGC GCAGCTTTAC TTACGATGAT AAATACCGGG GCCGGATGGT GGCGCACCGT CACACGGGCC GGCCGGAAAT CCGTTACCGT TACGACAGCG ACGGGCGGGT GACAGAACAG CTAAACCCGG CAGGCTTAAG CTACACGTAT CAGTATGAGA AAGACCGCAT CACCATCACC GACAGCCTGG ACCGCCGTGA AGTGCTGCAC ACGCAGGGCG AAGCCGGGCT GAAGCGGGTG GTGAAAAAGG AACACGCGGA CGGCAGCGTC ACGCAGAGTC AGTTTGACGC CGTGGGCAGG CTCAGGACAC AGACGGATGC CGCAGGCCGG ACAACAGAAT ACAGCCCGGA TGTGGTGACG GGCCTCATCA CGCGCATCAC CACGCCGGAT GGCAGGGCAT CGGCGTTTTA CTATAACCAC CACAGCCAGT TAACGTCAGC CACCGGGCCT GACGGGCTGG AAATACGCCG GGAATATGAT GAATTGGGCC GTCTGATTCA GGAAACTGCC CCTGACGGCG ATATCACCCG CTACCGTTAT GATAATCCAC ACAGTGACTT ACCCTGCGCA ACGGAAGATG CCACCGGCAG CCGGAAAACC ATGACGTGGA GCCGTTACGG TCAGTTGCTG AGCTTCACCG ACTGTTCCGG TTATGTAACC CGTTATGACC ATGACCGCTT CGGGCAGATG ACGGCGGTGC ACCGCGAGGA AGGGCTGAGT CAGTACCGCG CATACGACAG CCGTGGACAG TTAATTGCCG TGAAAGACAC GCAGGGCCAT GAAACGCGGT ATGAATACAA CATCGCCGGT GACCTGACCG CCGTCATTGC CCCGGACGGC AGCAGAAACG GGACACAGTA CGATGCGTGG GGAAAGGCCG TCCGTACCAC GCAGGGCGGG CTGACCCGTA GTATGGAATA CGATGCTGCC GGACGGGTCA TCCGCCTGAC CAGTGAAAAC GGCAGCCACA CCACCTTCCG TTACGATGTA CTCGACCGGC TGATACAGGA AACCGGCTTT GACGGCCGCA CACAGCGTTA TCACCACGAC CTGACCGGCA AACTTATCCG CAGCGAGGAT GAGGGGCTGG TCACCCACTG GTACTATGAC GAAGCAGACC GCCTCACTCA CCGCACCGTG AAGGGTGAAA CCGCAGAGCA GTGGCAGTAT GACGAACGCG GCTGGCTGAC AGACATCAGC CATATCAGCG AAGGGCACCG GGTGGCGGTG CACTATGGTT ATGACAGTAA AGGCCGCCTC GCCAGTGAAC ACCTGACGGT GCATCATCCG CAGACGAATG AACTGCTCTG GCAGCATGAG ACCAGACATG CGTACAACGC ACAGGGACTG GCGAACCGCT GTATACCGGA CAGCCTGCCC GCCGTGGAAT GGCTGACCTA CGGCAGCGGC TGGCTGGCAG GCATGAAGCT CGGCGACACA CCGCTGGTGG ATTTCACCCG CGACCGCCTG CACCGGGAAA CGCTGCGCAG CTTCGGCCGT TATGAACTCA CCACCGCTTA TACCCCTGCC GGGCAGTTAC AGAGCCAGCA CCTGAACAGC CTGCTGTCTG ACCGCGATTA CACCTGGAAC GACAACGGCG AACTCATCCG CATCAGCAGC CCGCGCCAGA CCCGGAGTTA CAGCTACAGC ACCACCGGCA GGCTGACCGG CGTTCACACC ACCGCAGCGA ATCTGGATAT CCGCATCCCG TATGCCACAG ACCCGGCAGG TAACCGCCTG CCCGACCCGG AGCTGCACCC GGACAGCACC CTCAGCATGT GGCCGGATAA CCGTATCGCC CGTGACGCGC ACTATCTTTA CCGGTATGAC CGTCACGGCA GGCTGACAGA GAAAACCGAC CTCATCCCGG AAGGGGTTAT CCGCACGGAT GATGAGCGCA CCCACCAGTA CCATTACGAC AGTCAGCACC GGCTGGTGCA CTACACGCGG ACACAATATG CAGAGCCGCT GGTCGAAAGT CGCTATCTTT ACGACCCGCT GGGCCGCAGG GTGGCAAAAC GGGTATGGCG ACGTGAACGG GACCTGACGG GCTGGATGTC GCTGTCACGG AAACCGGAAG TGACCTGGTA CGGCTGGGAC GGCGACCGGC TGACCACGAT ACAGAACGAC AGGAGCCGCA TCCAGACGAT TTATCAGCCG GGGAGCTTCA CGCCACTCAT CAGGGTCGAA ACTGCCACCG GTGAGCTGGC GAAAACGCAG CGCCGCAGCC TGGCGGATGC CCTTCAGCAG TCCGGCGGCG AAGACGGTGG CAGTGTGGTG TTCCCGCCGG TGCTGGTGCA GATGCTCGAC CGGCTGGAAA GTGAAATCCT GGCTGACCGG GTGAGTGAGG AAAGCCGCCG CTGGCTGGCA TCGTGCGGCC TGACCGTGGA GCAGATGCAA AACCAGATGG ACCCGGTGTA CACGCCGGCG CGAAAAATCC ACCTGTACCA CTGCGACCAT CGCGGCCTGC CGCTGGCCCT TATCAGCAAG GAAGGGACAA CAGAATGGTG CGCAGAATAC GATGAATGGG GCAACCTGCT GAATGAAGAG AACCCGCATC AGCTGCAGCA GCTTATCCGC CTGCCGGGGC AGCAGTATGA TGAGGAGTCC GGCCTGTATT ACAACCGCCA CCGCTATTAT GACCCGCTGC AGGGGCGATA TATCACTCAG GATCCGATTG GACTGAAGGG GGGATGGAAC CTGTATGGAT ATCAATTGAA TCCGATATCA GACATCGACC CCCTGGGTTT ATCTATGTGG GAGGATGCAA AATCGGGGGC ATGTACTAAT GGTCTTTGCG GCACACTATC CGCTATGATA GGTCCAGATA AATTTGATTC TATAGATAGC ACCGCATATG ACGCCTTAAA TAAAATAAAT AGCCAATCTA TTTGCGAAGA TAAAGAGTTC GCTGGTTTAA TATGTAAGGA TAATAGTGGC AGATATTTCT CAACAGCACC TAACCGAGGA GAAAGAAAAG GATCATATCC ATTCAATAGC CCTTGCCCTA ATGGTACTGA GAAAGTATCA GCTTATCATA CTCATGGTGC AGATAGTCAT GGAGAATATT GGGACGAAAT ATTTTCAGGT AAAGATGAGA AAATAGTTAA AAGTAAAGAT AACAATATCA AGTCATTTTA TTTAGGTACG CCCAGTGGTA ATTTTAAAGC AATAGATAAC CACGGGAAGG AAATAACAAA CAGAAAAGGA TTACCTAATG TCTGCAGAGT TCATGGTAAT ATGTAA
|
Protein sequence | MSGKPAARQG DMTQYGGSIV QGSAGVRIGA PTGVACSVCP GGVTSGHPVN PLLGAKVLPG ETDIALPGPL PFILSRTYSS YRTKTPAPVG SLGPGWKMPA DIRLQLRDNT LILSDNGGRS LYFEHLFPGE DGYSRSESLW LVRGGVAKLD EGHRLAALWQ ALPEELRLSP HRYLATNSPQ GPWWVLGWCE RVPEADEVLP APLPPYRVLT GLVDRFGRTQ TFHHEAAGEF SGEITGVTDG AGRHFRLVLT TQAQRAEEAR QQASSGGTEP SAFPDTLPGY TEYGRDNGIR LSAVWLTHDP EYPENLPAAP LVRYGWTPRG ELAVVYDRSG KQVRSFTYDD KYRGRMVAHR HTGRPEIRYR YDSDGRVTEQ LNPAGLSYTY QYEKDRITIT DSLDRREVLH TQGEAGLKRV VKKEHADGSV TQSQFDAVGR LRTQTDAAGR TTEYSPDVVT GLITRITTPD GRASAFYYNH HSQLTSATGP DGLEIRREYD ELGRLIQETA PDGDITRYRY DNPHSDLPCA TEDATGSRKT MTWSRYGQLL SFTDCSGYVT RYDHDRFGQM TAVHREEGLS QYRAYDSRGQ LIAVKDTQGH ETRYEYNIAG DLTAVIAPDG SRNGTQYDAW GKAVRTTQGG LTRSMEYDAA GRVIRLTSEN GSHTTFRYDV LDRLIQETGF DGRTQRYHHD LTGKLIRSED EGLVTHWYYD EADRLTHRTV KGETAEQWQY DERGWLTDIS HISEGHRVAV HYGYDSKGRL ASEHLTVHHP QTNELLWQHE TRHAYNAQGL ANRCIPDSLP AVEWLTYGSG WLAGMKLGDT PLVDFTRDRL HRETLRSFGR YELTTAYTPA GQLQSQHLNS LLSDRDYTWN DNGELIRISS PRQTRSYSYS TTGRLTGVHT TAANLDIRIP YATDPAGNRL PDPELHPDST LSMWPDNRIA RDAHYLYRYD RHGRLTEKTD LIPEGVIRTD DERTHQYHYD SQHRLVHYTR TQYAEPLVES RYLYDPLGRR VAKRVWRRER DLTGWMSLSR KPEVTWYGWD GDRLTTIQND RSRIQTIYQP GSFTPLIRVE TATGELAKTQ RRSLADALQQ SGGEDGGSVV FPPVLVQMLD RLESEILADR VSEESRRWLA SCGLTVEQMQ NQMDPVYTPA RKIHLYHCDH RGLPLALISK EGTTEWCAEY DEWGNLLNEE NPHQLQQLIR LPGQQYDEES GLYYNRHRYY DPLQGRYITQ DPIGLKGGWN LYGYQLNPIS DIDPLGLSMW EDAKSGACTN GLCGTLSAMI GPDKFDSIDS TAYDALNKIN SQSICEDKEF AGLICKDNSG RYFSTAPNRG ERKGSYPFNS PCPNGTEKVS AYHTHGADSH GEYWDEIFSG KDEKIVKSKD NNIKSFYLGT PSGNFKAIDN HGKEITNRKG LPNVCRVHGN M
|
| |