Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0108 |
Symbol | guaC |
ID | 6143417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 119960 |
End bp | 121003 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615009 |
Product | guanosine 5'-monophosphate oxidoreductase |
Protein accession | YP_001742225 |
Protein GI | 170683674 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0516] IMP dehydrogenase/GMP reductase |
TIGRFAM ID | [TIGR01305] guanosine monophosphate reductase, eukaryotic |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000115881 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.600048 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTATTG AAGAAGATCT GAAGTTAGGT TTTAAAGACG TTCTCATCCG CCCTAAACGC TCCACTCTTA AAAGCCGTTC CGATGTTGAA CTGGAACGTC AATTCACCTT CAAACATTCA GGTCAGAGCT GGTCCGGCGT GCCGATTATC GCCGCAAATA TGGACACCGT AGGCACATTT TCTATGGCCT CTGCGCTGGC TTCTTTTGAT ATTTTGACTG CTGTGCATAA ACACTATTCT GTCGAAGAGT GGCAAGCGTT TATCAACAAT TCTTCCGCTG ATGTGCTGAA ACATGTGATG GTTTCTACCG GCACGTCTGA TGCGGATTTC GAAAAAACTA AACAGATTCT CGACCTGAAC CCGGCATTAA ACTTCGTTTG TATTGACGTG GCGAATGGTT ATTCCGAACA CTTCGTGCAG TTCGTTGCGA AAGCGCGTGA AGCGTGGCCG ACCAAAACCA TTTGTGCTGG CAACGTAGTG ACTGGTGAAA TGTGTGAGGA GCTTATCCTC TCCGGTGCCG ATATCGTTAA AGTTGGCATT GGCCCAGGTT CTGTTTGTAC AACTCGCGTC AAAACAGGCG TCGGTTATCC GCAACTTTCT GCGGTAATCG AATGTGCCGA TGCTGCGCAC GGTCTGGGCG GAATGATCGT CAGCGACGGT GGCTGCACCA CGCCGGGCGA TGTCGCGAAA GCCTTTGGCG GCGGTGCCGA TTTCGTCATG CTTGGCGGCA TGCTGGCGGG CCACGAAGAG AGCGGCGGTC GCATCGTTGA GGAGAACGGC GAGAAATTTA TGCTGTTCTA CGGCATGAGC TCAGAGTCTG CGATGAAACG TCACGTTGGC TGCGTTGCGG AATATCGCGC AGCAGAAGGT AAAACCGTTA AGCTGCCGCT GCGAGGCCCG GTTGAAAATA CCGCGCGTGA TATTTTGGGC GGCCTGCGTT CAGCTTGTAC ATACGTTGGG GCTTCACGCC TGAAAGAGCT GACCAAGCGC ACCACATTTA TCCGTGTGCA GGAACAAGAA AACCGCATCT TCAACAACCT GTAA
|
Protein sequence | MRIEEDLKLG FKDVLIRPKR STLKSRSDVE LERQFTFKHS GQSWSGVPII AANMDTVGTF SMASALASFD ILTAVHKHYS VEEWQAFINN SSADVLKHVM VSTGTSDADF EKTKQILDLN PALNFVCIDV ANGYSEHFVQ FVAKAREAWP TKTICAGNVV TGEMCEELIL SGADIVKVGI GPGSVCTTRV KTGVGYPQLS AVIECADAAH GLGGMIVSDG GCTTPGDVAK AFGGGADFVM LGGMLAGHEE SGGRIVEENG EKFMLFYGMS SESAMKRHVG CVAEYRAAEG KTVKLPLRGP VENTARDILG GLRSACTYVG ASRLKELTKR TTFIRVQEQE NRIFNNL
|
| |