Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0120 |
Symbol | |
ID | 6068552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 128509 |
End bp | 132642 |
Gene Length | 4134 bp |
Protein Length | 1377 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641599522 |
Product | YD repeat-containing protein |
Protein accession | YP_001723131 |
Protein GI | 170018177 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGAA AACCGGCAGC GCGTCAGGGC GACATGACGC AGTATGGCGG TAGCATTGTT CAGGGTTCAG CCGGGGTGCG CATTGGTGCC CCCACCGGCG TGGCCTGTTC GGTGTGCCCC GGCGGAGTGA CGTCCGGCCA TCCGGTCAAT CCCCTGCTCG GTGCAAAGGT CCTTCCCGGT GAAACCGACA TCGCCCTGCC CGGCCCGCTG CCGTTCATCC TCTCCCGCAC CTACAGCAGT TACCGGACAA AAACGCCCGC GCCGGTGGGG AGCCTCGGCC CCGGCTGGAA AATGCCTGCG GATATCCGCT TACAGCTGCG CGATAACACA CTGATACTCA GTGATAACGG CGGCAGAAGC CTGTATTTTG AGCACCTGTT TCCCGGTGAG GACGGTTACA GCCGCAGCGA GTCACTGTGG CTGGTGCGCG GCGGCGTGGC GAAACTGGAT GAAGGTCACC GGCTGGCCGC ACTCTGGCAG GCGCTGCCGG AAGAACTCCG CTTAAGTCCG CATCGTTATC TGGCGACAAA CAGTCCGCAG GGGCCGTGGT GGCTGCTCGG CTGGTGTGAG CGGGTGCCGG AAGCGGATGA GGTGCTGCCT GCGCCGCTGC CGCCGTACCG GGTACTGACC GGGCTGGTGG ACCGCTTCGG GCGCACACAG ACGTTCCACC GCGAAGCCGC CGGTGAATTC AGCGGCGAAA TCACCGGCGT GACGGATGGT GCCTGGCGTC ACTTCCGGCT GGTACTGACC ACGCAGGCGC AGCGGGCAGA AGAAGCCCGG CAGCAGGCCA TTTCCGGCGG GACGGAACCG TCCGCTTTTC CTGATACCCT GCCGGGTTAC ACCGAATATG GCCGGGACAA CGGCATCCGT CTGTCTGCCG TGTGGCTGAC GCACGACCCG GAATACCCGG AGAATTTACC TGCCGCGCCG CTGGTGCGCT ATGGCTGGAC GCCACGCGGC GAACTGGCGG TGGTGTATGA CCGTAGTGGC AAACAGGTGC GCAGCTTTAC TTACGATGAT AAATACCGGG GCCGGATGGT GGCGCACCGT CACACGGGCC GGCCGGAAAT CCGTTACCGT TACGACAGCG ACGGGCGGGT GACAGAACAG CTAAACCCGG CAGGCTTAAG CTACACGTAT CAGTATGAGA AAGACCGCAT CACCATCACC GACAGCCTGG ACCGCCGTGA AGTGCTGCAC ACGCAGGGCG AAGCCGGGCT GAAGCGGGTG GTGAAAAAGG AACACGCGGA CGGCAGCGTC ACGCAGAGTC AGTTTGACGC CGTGGGCAGG CTCAGGGCAC AGACGGATGC CGCAGGCAGG ACAACAGAGT ACAGCCCGGA TGTGGTGACG GGCCTCATCA CGCGCATAAC CACGCCGGAT GGCAGGGCAT CGGCGTTTTA CTATAACCAC CACAGCCAGT TAACGTCAGC CACCGGGCCT GACGGGCTGG AAATACGCCG GGAATATGAT GAATTGGGCC GTCTGATTCA GGAAACTGCC CCTGACGGCG ATATCACCCG CTACCGTTAT GATAATCCAC ACAGTGACTT ACCCTGCGCA ACGGAAGATG CCACCGGCAG CCGGAAAACC ATGACGTGGA GCCGTTACGG TCAGTTGCTG AGCTTCACCG ACTGTTCCGG TTATGTAACC CGTTATGACC ATGACCGCTT CGGGCAGATG ACGGCGGTGC ACCGCGAGGA AGGGCTGAGT CAGTACCGCG CATACGACAG CCGTGGACAG TTAATTGCCG TGAAAGACAC GCAGGGCCAT GAAACGCGGT ATGAATACAA CATCGCCGGT GACCTGACCG CCGTCATTGC CCCGGACGGC AGCAGAAACG GGACACAGTA CGATGCGTGG GGAAAGGCCG TCCGTACCAC GCAGGGCGGG CTGACCCGTA GTATGGAATA CGATGCTGCC GGACGGGTCA TCCGCCTGAC CAGTGAAAAC GGCAGCCACA CCACCTTCCG TTACGATGTA CTCGACCGGC TGATACAGGA AACCGGCTTT GACGGCCGCA CACAGCGTTA TCACCACGAC CTGACCGGCA AACTTATCCG CAGCGAGGAT GAGGGGCTGG TCACCCACTG GTACTATGAC GAAGCAGACC GCCTCACTCA CCGCACCGTG AAGGGTGAAA CCGCAGAGCA GTGGCAGTAT GACGAACGCG GCTGGCTGAC AGACATCAGC CATATCAGCG AAGGGCACCG GGTGGCGGTG CACTATGGTT ATGACAGTAA AGGCCGCCTC GCCAGTGAAC ACCTGACGGT GCATCATCCG CAGACGAATG AACTGCTCTG GCAGCATGAG ACCAGACATG CGTACAACGC ACAGGGACTG GCGAACCGCT GTATACCGGA CAGCCTGCCC GCCGTGGAAT GGCTGACCTA CGGCAGCGGC TGGCTGGCAG GCATGAAGCT CGGCGACACA CCGCTGGTGG ATTTCACCCG CGACCGCCTG CACCGGGAAA CGCTGCGCAG CTTCGGCCGT TATGAACTCA CCACCGCTTA TACCCCTGCC GGGCAGTTAC AGAGCCAGCA CCTGAACAGC CTGCTGTCTG ACCGCGATTA CACCTGGAAC GACAACGGCG AACTCATCCG CATCAGCAGC CCGCGCCAGA CCCGGAGTTA CAGCTACAGC ACCACCGGCA GGCTGACCGG CGTTCACACC ACCGCAGCGA ATCTGGATAT CCGCATCCCG TATGCCACAG ACCCGGCAGG TAACCGCCTG CCCGACCCGG AGCTGCACCC GGACAGCACC CTCAGCATGT GGCCGGATAA CCGTATCGCC CGTGACGCGC ACTATCTTTA CCGGTATGAC CGTCACGGCA GGCTGACAGA GAAAACCGAC CTCATCCCGG AAGGGGTTAT CCGCACGGAT GATGAGCGGA CTCACCGGTA CCATTACGAC AGTCAGCACC GGCTGGTGCA CTACACGCGG ACACAATATG AAGAGCCGCT GGTCGAAAGT CGCTATCTTT ACGACCCGCT GGGCCGCAGG GTGGCAAAAC GGGTGTGGCG GCGTGAACGG GACCTGACGG GCTGGATGTC GCTGTCACGG AAACCGCAAG TGACCTGGTA CGGCTGGGAC GGCGACCGGC TGACCACGAT ACAGAACGAC AGGAGCCGCA TCCAGACGAT TTATCAGCCG GGGAGCTTCA CGCCACTCAT CAGGGTCGAA ACTGCCACCG GTGAGCTGGC GAAAACGCAG CGCCGCAGCC TGGCGGATGC CCTTCAGCAG TCCGGCGGCG AAGACGGTGG CAGTGTGGTG TTCCCGCCGG TGCTGGTGCA GATGCTCGAC CGGCTGGAAA GTGAAATCCT GGCTGACCGG GTGAGTGAGG AAAGCCGCCG CTGGCTGGCA TCGTGCGGCC TGACCGTGGA GCAGATGCAA AACCAGATGG ACCCGGTGTA CACGCCGGCG CGAAAAATCC ACCTGTACCA CTGCGACCAT CGCGGCCTGC CGCTGGCCCT TATCAGCAAG GAAGGGACAA CAGAATGGTG CGCAGAATAC GATGAATGGG GCAACCTGCT GAATGAAGAG AACCCGCATC AGCTGCAGCA GCTTATCCGC CTGCCGGGGC AGCAGTATGA TGAGGAGTCC GGCCTGTATT ACAACCGCCA CCGCTATTAT GACCCGCTGC AGGGGCGGTA TATCACTCAG GATCCGATTG GGCTGAAGGG GGGATGGAAT TTTTATCAGT ATCCGTTGAA TCCAGTTACG AATACAGATC CTCTGGGGTT AGAAGTTTTT CCTAGACCAT TCCCCTTGCC AATTCCATGG CCCAAAAGCC CTGCACAGCA GCAAGCAGAT GATAATGCTG CAAAAGCATT GACAAAATGG TGGAACGATA CAGCATCACA AAGAATATTT GACTCTCTAA TATTGAATAA TCCGGGACTA GCATTAGATA TAACAATGAT AGCTTCTCGT GGAAATGTTG CAGACACAGG GATAACTGAT CGTGTCAATG ACATAATAAA TGACAGATTC TGGAGTGATG GGAAAAAACC CGACAGATGT GACGTACTTC AGGAACTAAT TGATTGTGGT GATATTAGTG CTAAAGATGC AAAAAGCACA CAGAAAGCCT GGAATTGTCG TCACTCCAGA CAGTCAAACG ATAAAAAAAG ATAG
|
Protein sequence | MSGKPAARQG DMTQYGGSIV QGSAGVRIGA PTGVACSVCP GGVTSGHPVN PLLGAKVLPG ETDIALPGPL PFILSRTYSS YRTKTPAPVG SLGPGWKMPA DIRLQLRDNT LILSDNGGRS LYFEHLFPGE DGYSRSESLW LVRGGVAKLD EGHRLAALWQ ALPEELRLSP HRYLATNSPQ GPWWLLGWCE RVPEADEVLP APLPPYRVLT GLVDRFGRTQ TFHREAAGEF SGEITGVTDG AWRHFRLVLT TQAQRAEEAR QQAISGGTEP SAFPDTLPGY TEYGRDNGIR LSAVWLTHDP EYPENLPAAP LVRYGWTPRG ELAVVYDRSG KQVRSFTYDD KYRGRMVAHR HTGRPEIRYR YDSDGRVTEQ LNPAGLSYTY QYEKDRITIT DSLDRREVLH TQGEAGLKRV VKKEHADGSV TQSQFDAVGR LRAQTDAAGR TTEYSPDVVT GLITRITTPD GRASAFYYNH HSQLTSATGP DGLEIRREYD ELGRLIQETA PDGDITRYRY DNPHSDLPCA TEDATGSRKT MTWSRYGQLL SFTDCSGYVT RYDHDRFGQM TAVHREEGLS QYRAYDSRGQ LIAVKDTQGH ETRYEYNIAG DLTAVIAPDG SRNGTQYDAW GKAVRTTQGG LTRSMEYDAA GRVIRLTSEN GSHTTFRYDV LDRLIQETGF DGRTQRYHHD LTGKLIRSED EGLVTHWYYD EADRLTHRTV KGETAEQWQY DERGWLTDIS HISEGHRVAV HYGYDSKGRL ASEHLTVHHP QTNELLWQHE TRHAYNAQGL ANRCIPDSLP AVEWLTYGSG WLAGMKLGDT PLVDFTRDRL HRETLRSFGR YELTTAYTPA GQLQSQHLNS LLSDRDYTWN DNGELIRISS PRQTRSYSYS TTGRLTGVHT TAANLDIRIP YATDPAGNRL PDPELHPDST LSMWPDNRIA RDAHYLYRYD RHGRLTEKTD LIPEGVIRTD DERTHRYHYD SQHRLVHYTR TQYEEPLVES RYLYDPLGRR VAKRVWRRER DLTGWMSLSR KPQVTWYGWD GDRLTTIQND RSRIQTIYQP GSFTPLIRVE TATGELAKTQ RRSLADALQQ SGGEDGGSVV FPPVLVQMLD RLESEILADR VSEESRRWLA SCGLTVEQMQ NQMDPVYTPA RKIHLYHCDH RGLPLALISK EGTTEWCAEY DEWGNLLNEE NPHQLQQLIR LPGQQYDEES GLYYNRHRYY DPLQGRYITQ DPIGLKGGWN FYQYPLNPVT NTDPLGLEVF PRPFPLPIPW PKSPAQQQAD DNAAKALTKW WNDTASQRIF DSLILNNPGL ALDITMIASR GNVADTGITD RVNDIINDRF WSDGKKPDRC DVLQELIDCG DISAKDAKST QKAWNCRHSR QSNDKKR
|
| |