Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4668 |
Symbol | |
ID | 6969052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4309789 |
End bp | 4311879 |
Gene Length | 2091 bp |
Protein Length | 696 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643388372 |
Product | integral membrane protein, YccS/YhfK family |
Protein accession | YP_002272800 |
Protein GI | 209397460 |
COG category | [S] Function unknown |
COG ID | [COG1289] Predicted membrane protein |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR01667] integral membrane protein, YccS/YhfK family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0557749 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.293941 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGCGCA GACTGATTTA TCACCCCGAT ATTAACTATG CACTGCGACA AACGCTGGTG CTATGTTTGC CCGTGGCCGT TGGGTTAATG CTTGGCGAAT TACGATTCGG TCTGCTCTTC TCTCTCGTTC CTGCCTGTTG CAATATTGCG GGCCTTGATA CGCCTCATAA ACGTTTTTTC AAACGCTTAA TCATTGGTGC GTCGCTGTTT GCCACCTGTA GCTTACTGAC ACAGCTGCTA CTGGCAAAAG ATGTTCCCCT GCCTTTTTTG CTGACCGGAT TAACGCTGGT ACTTGGCGTC ACTGCTGAGT TGGGGCCATT GCACGCAAAA TTGCTTCCCG CATCGCTGCT GGCAGCCATT TTTACCCTCA GTCTGGCGGG ATACATGCCG GTCTGGGAAC CGTTGCTCAT CTATGCGTTG GGCACTCTCT GGTACGGATT GTTTAACTGG TTTTGGTTCT GGATCTGGCG CGAACAACCG CTGCGCGAGT CACTAAGTCT GCTGTACCGT GAACTGGCAG ATTATTGTGA AGCCAAATAC AGCCTGCTTA CCCAGCACAC CGACCCTGAA AAAGCGCTGC CGCCGCTGCT GGTGCGCCAG CAAAAAGCGG TCGATCTAAT TACCCAGTGC TATCAGCAAA TGCATATGCT TTCCGCGCAA AATAATACTG ACTACAAGCG GATGCTGCGT ATTTTCCAGG AGGCGCTGGA TTTACAGGAA CATATTTCGG TCAGTTTGCA TCAGCCGGAA GAGGTGCAAA AGCTGGTCGA GCGTAGCCAT GCGGAAGAAG TTATCCGCTG GAATGCGCAA ACCGTCGCCG CTCGCCTGCG CGTGCTGGCT GATGACATTC TTTACCATCG CCTGCCGACG CGTTTTACGA TGGAAAAGCA AATTGGCGCA CTGGAAAAAA TCGCCCGCCA GCATCCGGAT AATCCGGTTG GGCAATTCTG CTATTGGCAC TTCAGCCGCA TCGCCCGCGT GCTGCGCACC CAAAAACCGC TCTATGCCCG TGACTTACTG GCCGATAAAC AGCGGCGAAT GCCGTTACTT CCGGCGCTGA AAAGTTATTT GTCACTAAAG TCTCCGGCGC TACGCAATGC CGGACGACTC AGTGTGATGT TAAGCGTTGC CAGCTTGATG GGCACCGCGC TGCATCTGCC GAAGTCGTAC TGGATCCTGA TGACGGTATT GCTGGTGACG CAAAATGGCT ATGGCGCAAC CCGTCTGAGG ATTGTGAATC GCTCCGTGGG AACCGTGGTC GGGTTAATCA TTGCGGGCGT GGCGCTGCAC TTTAAAATTC CCGAAGGTTA CACCCTGACC TTGATGCTGA TTACCACCCT TGCCAGCTAC CTGATATTGC GCAAAAACTA CGGCTGGGCG ACGGTCGGTT TTACTATTAC CGCAGTGTAT ACCCTGCAAC TGTTGTGGCT GAACGGTGAG CAGTACATCC TTCCGCGTCT TATCGATACC ATTATTGGTT GTTTAATTGC TTTCGGTGGT ACCGTCTGGC TGTGGCCGCA GTGGCAGAGC GGGTTATTGC GTAAAAACGC CCATGACGCT TTAGAAGCCT ATCAGGAAGC GATTCGCTTG ATTCTGAGCG AGGATCCGCA ACCAACGCCA CTGGCCTGGC AGCGAATGCG GGTAAATCAG GCGCATAACA CTCTGTATAA CTCATTGAAT CAGGCGATGC AGGAACCAGC GTTCAACAGC CATTATCTGG CAGATATGAA ACTGTGGGTA ACGCACAGTC AGTTTATTGT TGAACATATT AATGCCATGA CCACGCTGGC GCGGGAACAC CGGGCATTGC CACCTGAACT GGCACAAGAG TATTTACAGT CTTGTGAAAT CGCCATTCAG CGTTGTCAGC AGCGACTGGA GTATGACGAA CCGGGTAGTT CTGGCGATGC CAATATCATG GATGCGCCGG AGATGCAGCC GCACGAAGGC GCGGCAGGTA CGCTGGAGCA GCATTTACAG CGGGTTATTG GTCATCTGAA CACCATGCAC ACCATTTCGT CGATGGCATG GCGTCAGCGA CCGCATCACG GGATTTGGCT GAGTCGCAAG TTGCGGGATT CGAAGGCGTA A
|
Protein sequence | MWRRLIYHPD INYALRQTLV LCLPVAVGLM LGELRFGLLF SLVPACCNIA GLDTPHKRFF KRLIIGASLF ATCSLLTQLL LAKDVPLPFL LTGLTLVLGV TAELGPLHAK LLPASLLAAI FTLSLAGYMP VWEPLLIYAL GTLWYGLFNW FWFWIWREQP LRESLSLLYR ELADYCEAKY SLLTQHTDPE KALPPLLVRQ QKAVDLITQC YQQMHMLSAQ NNTDYKRMLR IFQEALDLQE HISVSLHQPE EVQKLVERSH AEEVIRWNAQ TVAARLRVLA DDILYHRLPT RFTMEKQIGA LEKIARQHPD NPVGQFCYWH FSRIARVLRT QKPLYARDLL ADKQRRMPLL PALKSYLSLK SPALRNAGRL SVMLSVASLM GTALHLPKSY WILMTVLLVT QNGYGATRLR IVNRSVGTVV GLIIAGVALH FKIPEGYTLT LMLITTLASY LILRKNYGWA TVGFTITAVY TLQLLWLNGE QYILPRLIDT IIGCLIAFGG TVWLWPQWQS GLLRKNAHDA LEAYQEAIRL ILSEDPQPTP LAWQRMRVNQ AHNTLYNSLN QAMQEPAFNS HYLADMKLWV THSQFIVEHI NAMTTLAREH RALPPELAQE YLQSCEIAIQ RCQQRLEYDE PGSSGDANIM DAPEMQPHEG AAGTLEQHLQ RVIGHLNTMH TISSMAWRQR PHHGIWLSRK LRDSKA
|
| |