Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0109 |
Symbol | guaC |
ID | 5591082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 115083 |
End bp | 116126 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640919297 |
Product | guanosine 5'-monophosphate oxidoreductase |
Protein accession | YP_001456892 |
Protein GI | 157159574 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0516] IMP dehydrogenase/GMP reductase |
TIGRFAM ID | [TIGR01305] guanosine monophosphate reductase, eukaryotic |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0000000081181 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTATTG AAGAAGATCT GAAGTTAGGT TTTAAAGACG TTCTCATCCG CCCTAAACGC TCCACTCTTA AAAGCCGTTC CGATGTTGAA CTGGAACGTC AATTCACCTT CAAACATTCA GGTCAGAGCT GGTCCGGCGT GCCAATTATC GCCGCAAATA TGGACACCGT AGGTACATTT TCTATGGCCT CTGCGCTGGC TTCTTTTGAT ATTTTGACTG CTGTGCATAA ACACTATTCT GTCGAAGAGT GGCAAGCGTT TATCAACAAT TCTTCCGCTG ATGTGCTGAA ACATGTGATG GTTTCTACCG GTACGTCTGA TGCGGATTTC GAAAAAACTA AACAGATTCT CGACCTGAAC CCGGCATTAA ACTTCGTTTG TATTGATGTG GCGAATGGTT ATTCCGAACA CTTCGTGCAG TTCGTTGCGA AAGCGCGTGA AGCGTGGCCG ACCAAAACCA TTTGTGCTGG CAACGTAGTG ACTGGTGAAA TGTGTGAGGA GCTTATCCTC TCCGGTGCTG ATATCGTTAA AGTTGGCATT GGCCCAGGTT CTGTTTGTAC AACTCGCGTC AAAACAGGCG TCGGTTATCC GCAGCTTTCT GCGGTAATCG AATGTGCCGA TGCGGCGCAC GGTCTGGGCG GAATGATCGT CAGCGACGGT GGCTGCACCA CGCCGGGCGA TGTCGCAAAA GCCTTTGGCG GCGGTGCCGA TTTCGTCATG CTTGGCGGCA TGCTGGCGGG CCACGAAGAG AGCGGCGGTC GCATCGTTGA GGAGAACGGC GAGAAATTTA TGCTGTTCTA CGGCATGAGC TCCGAGTCTG CGATGAAACG TCACGTTGGC GGCGTTGCGG AATATCGCGC AGCAGAAGGT AAAACCGTTA AGCTGCCGCT GCGAGGCCCG GTTGAAAATA CCGCGCGTGA TATTTTGGGC GGCCTGCGTT CAGCTTGTAC ATACGTTGGG GCTTCACGCC TGAAAGAACT GACCAAGCGC ACCACATTTA TCCGTGTGCA GGAACAAGAA AACCGCATCT TCAACAACCT GTAA
|
Protein sequence | MRIEEDLKLG FKDVLIRPKR STLKSRSDVE LERQFTFKHS GQSWSGVPII AANMDTVGTF SMASALASFD ILTAVHKHYS VEEWQAFINN SSADVLKHVM VSTGTSDADF EKTKQILDLN PALNFVCIDV ANGYSEHFVQ FVAKAREAWP TKTICAGNVV TGEMCEELIL SGADIVKVGI GPGSVCTTRV KTGVGYPQLS AVIECADAAH GLGGMIVSDG GCTTPGDVAK AFGGGADFVM LGGMLAGHEE SGGRIVEENG EKFMLFYGMS SESAMKRHVG GVAEYRAAEG KTVKLPLRGP VENTARDILG GLRSACTYVG ASRLKELTKR TTFIRVQEQE NRIFNNL
|
| |