Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4809 |
Symbol | |
ID | 6971264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4444161 |
End bp | 4446479 |
Gene Length | 2319 bp |
Protein Length | 772 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643388501 |
Product | hypothetical protein |
Protein accession | YP_002272929 |
Protein GI | 209396647 |
COG category | [R] General function prediction only |
COG ID | [COG4258] Predicted exporter |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAACG CCAACGTTTT GCCGCCCAGT AAACGCCCCG CGCTGTTATG GGGGCTAGTC TGCCTGGTCA TGGCGGTGGC GTTGCTGATC CTGCTGCCGC AATCACGGCT GAACAGTAGC GTGCTGGCTA TGTTACCCAA ACAGACGATG GGCGATATTC CTCCGGCGCT GAATGACGGC TTTATGCAGC GTCTTGACCG CCAACTGGTG TGGCTGGTCA GCCCCGGTAA AGAGGCTAAT CCTTTGGTCG CTCAGGAGTG GCTGACGCTG CTGCAAAAAT CCGCTGCGCT CGGCGACGTT AAAGGGCCAA TGGATGCCGC CAGCCAGCAG GCGTGGGGAG CGTTTTTCTG GCAGCATCGC AACGGCCTGA TTGACCCCAA CACCCGCGCC CGCCTGCAAA ACGGCGGCGA AGCGCAGGCA CAGTGGATCC TCTCCCAGCT TTATTCCGCA TTCTCCGGCG TAAGCGGCAA GGAGCTGCAA AACGATCCGC TGATGTTAAT GCGCGGCTCG CAGCTGGCAA TGGCGAAAAA CGGCCAGCGT TTGCGGCTGA TGGACGGTTG GCTGGTGACG CAGGATCCCC AGGGCAACTA CTGGTATCTG CTGCACGGCG AACTGGCGGG ATCGTCGTTT GATATGCAGC AAACCCACCA GCTGATCACG ACCCTGAATA CGCTGGAAAA GGATCTGAAA ACGCGTTACC CGCAGGCACA GTTGCTCTCG CGCGGCACGG TGTTTTACAG CGATTACGCC AGCCAACAGG CGAAGCAGGA TATCTCCACC CTGGGCGTGG CTACGCTGCT GGGGGTGATA TTGCTGATTG TGGCGGTGTT CCGCTCTTTA CGCCCGTTGC TGCTTTGCGT GATTTCCATC GGCATCGGCG CGCTGGCGGG AACGGTCGCC ACTTTATTGA TTTTCGGTGA ATTACACCTG ATGACGCTGG TGATGAGCAT GAGCGTTATC GGCATTTCCG CTGACTACAC GCTCTATTAT CTCACCGAGC GGATGGTTCA CGGCAACGAC GTTTCGCCGT GGCAAAGCCT GGCGAAAGTA CGCAATGCCT TGCTACTGGC GCTGCTCACC ACCGTGGCGG CGTATCTGAT TATGATGCTC GCCCCCTTCC CCGGCATTCG CCAGATGGCG ATTTTTGCCG CCGTCGGGTT GAGCGCCTCC TGTCTGACCG TCCTGTTCTG GCATCCGTGG CTGTGCCGTG GCCTGCCGGT GCGTCCGGTT CCGGCGATGG CGCTGATGCT ACGCTGGCTG GCAGCGTGGC GGCGTAATAA AAAACTGTCG CTGGGTCTGC CCGTCGCGCT GGCGCTGTTT TCGCTGGCGG GGATGTCAAT GCTACGCGTC GATGACGATA TCTCGCAGTT ACAGGCGCTA CCGCAGCATA TTCTGGCGCA GGAAAAAGCC ATTACCGCCC TGACCGGGCA GAGCGTCGAT CAAAAATGGT TTGTGGTTTA CGGCGATTCG CCACAGCAAA CATTGCGGCG ACTGGAGAAA TATACCGCCT CACTTGAGTA TGCGAAAAAA GAGGGGCTTA TCAGCAACTA CCGCACCATT CCGCTGAACT CCCTTGCGCG GCAGGAGGAA GATTTACAAC TGCTGAAAAC GGCGGCCCCG ACAGTAACAA AAGCGCTGCA AAATGCCGGG CTGACGGCAG TGAACCCGGA TCTCAACGCC ATGCCAGTGA ACGTTGATGA ATGGCTGGCA AGCCCCGCCA GTGAAGGCTG GCGTCTGCTG TGGCTGACGC TGGAAAACGG CGAAAGCGGC GTACTGGTGC CGGTTGAAGG GGTTAAAAGT AGCGCGTTGA TGCAGGAAAT CGCCACATAT TACCCTTGCG GCATTGCCTG GGTTGATCGC AAAAGCACCT TTGATGAATT GTTCGCACTT TACCGCTACG TCTTAACCGG CTTGTTGCTG GTGGCGCTGG CAGTGATTGC CTGCGGCGCA GTGGCCCGTC TCGGCTGGCG CAAAGGGCTT ATCAGCCTGG TGCCTTCGGT GCTTTCGCTG GGCTGTGGTC TGGCGGTGCT GGCGATGAGC GGGCAGGCGG TGAATCTCTT TTCGCTGCTG GCGCTGGTGC TGGTGCTTGG CATCGGTATC AACTACACGC TGTTTTTCAG TAATCCGCGC GGTACACCGT TAACTTCGCT ACTGGCGATC GCGCTGGCAA TGCTCACCAC CTTGCTGACG CTGGGTATGC TGGTATTCAG CGCCACCCAG GCCATCAGCA GTTTTGGCAT TGTGCTGGTG AGCGGTATTT TCACCGCCTT CCTGCTTTCG CCGCTGGCTA TGCCCGATAA AAAGAGAACA AAAAAATGA
|
Protein sequence | MTNANVLPPS KRPALLWGLV CLVMAVALLI LLPQSRLNSS VLAMLPKQTM GDIPPALNDG FMQRLDRQLV WLVSPGKEAN PLVAQEWLTL LQKSAALGDV KGPMDAASQQ AWGAFFWQHR NGLIDPNTRA RLQNGGEAQA QWILSQLYSA FSGVSGKELQ NDPLMLMRGS QLAMAKNGQR LRLMDGWLVT QDPQGNYWYL LHGELAGSSF DMQQTHQLIT TLNTLEKDLK TRYPQAQLLS RGTVFYSDYA SQQAKQDIST LGVATLLGVI LLIVAVFRSL RPLLLCVISI GIGALAGTVA TLLIFGELHL MTLVMSMSVI GISADYTLYY LTERMVHGND VSPWQSLAKV RNALLLALLT TVAAYLIMML APFPGIRQMA IFAAVGLSAS CLTVLFWHPW LCRGLPVRPV PAMALMLRWL AAWRRNKKLS LGLPVALALF SLAGMSMLRV DDDISQLQAL PQHILAQEKA ITALTGQSVD QKWFVVYGDS PQQTLRRLEK YTASLEYAKK EGLISNYRTI PLNSLARQEE DLQLLKTAAP TVTKALQNAG LTAVNPDLNA MPVNVDEWLA SPASEGWRLL WLTLENGESG VLVPVEGVKS SALMQEIATY YPCGIAWVDR KSTFDELFAL YRYVLTGLLL VALAVIACGA VARLGWRKGL ISLVPSVLSL GCGLAVLAMS GQAVNLFSLL ALVLVLGIGI NYTLFFSNPR GTPLTSLLAI ALAMLTTLLT LGMLVFSATQ AISSFGIVLV SGIFTAFLLS PLAMPDKKRT KK
|
| |