Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0111 |
Symbol | guaC |
ID | 6970056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 117914 |
End bp | 118957 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643384188 |
Product | guanosine 5'-monophosphate oxidoreductase |
Protein accession | YP_002268711 |
Protein GI | 209397931 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0516] IMP dehydrogenase/GMP reductase |
TIGRFAM ID | [TIGR01305] guanosine monophosphate reductase, eukaryotic |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000028675 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTATTG AAGAAGATCT GAAGTTAGGT TTTAAAGACG TTCTCATCCG CCCTAAACGC TCCACTCTTA AAAGCCGTTC CGATGTTGAA CTGGAACGTC AATTCACCTT CAAACATTCA GGTCAGAGCT GGTCCGGCGT GCCGATTATC GCCGCAAATA TGGACACCGT AGGCACATTT TCTATGGCCT CTGCGCTGGC TTCTTTTGAT ATTTTGACTG CTGTGCATAA ACACTATTCT GTCGAAGAGT GGCAAGCGTT TATCAACAAT TCTTCCGCTG ATGTGCTGAA ACATGTGATG GTTTCTACCG GTACGTCTGA TGCGGATTTC GAAAAAACTA AACAGATTCT CGACCTGAAC CCGGCATTAA ACTTCGTTTG TATTGACGTG GCGAATGGTT ATTCCGAACA CTTCGTGCAG TTCGTTGCGA AAGCGCGTGA AGCGTGGCCG ACCAAAACCA TTTGTGCTGG TAACGTAGTG ACTGGTGAAA TGTGTGAGGA GCTTATCCTC TCAGGTGCCG ATATCGTTAA AGTTGGCATT GGCCCAGGTT CTGTTTGTAC AACTCGCGTC AAAACAGGCG TCGGTTATCC GCAACTTTCT GCGGTAATCG AATGTGCCGA TGCTGCGCAC GGTCTGGGCG GAATGATCAT CAGCGATGGT GGCTGCACCA CGCCGGGCGA TGTGGCGAAA GCCTTTGGTG GCGGTGCCGA TTTCGTCATG CTTGGCGGCA TGCTGGCGGG CCACGAAGAG AGCGGCGGTC GCATCGTTGA GGAGAACGGC GAGAAATTTA TGCTGTTCTA CGGCATGAGC TCCGAGTCTG CGATGAAACG TCACGTTGGC GGCGTTGCTG AATATCGCGC AGCAGAAGGT AAAACCGTTA AGCTGCCGCT GCGAGGCCCG GTTGAAAATA CCGCGCGTGA TATTTTGGGC GGCCTGCGTT CAGCTTGTAC ATACGTTGGG GCTTCACGCC TGAAAGAACT GACCAAGCGC ACCACATTTA TCCGTGTGCA GGAACAAGAA AACCGCATCT TCAACAACCT GTAA
|
Protein sequence | MRIEEDLKLG FKDVLIRPKR STLKSRSDVE LERQFTFKHS GQSWSGVPII AANMDTVGTF SMASALASFD ILTAVHKHYS VEEWQAFINN SSADVLKHVM VSTGTSDADF EKTKQILDLN PALNFVCIDV ANGYSEHFVQ FVAKAREAWP TKTICAGNVV TGEMCEELIL SGADIVKVGI GPGSVCTTRV KTGVGYPQLS AVIECADAAH GLGGMIISDG GCTTPGDVAK AFGGGADFVM LGGMLAGHEE SGGRIVEENG EKFMLFYGMS SESAMKRHVG GVAEYRAAEG KTVKLPLRGP VENTARDILG GLRSACTYVG ASRLKELTKR TTFIRVQEQE NRIFNNL
|
| |