Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2657 |
Symbol | guaB |
ID | 6142770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2716901 |
End bp | 2718436 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617528 |
Product | inosine 5'-monophosphate dehydrogenase |
Protein accession | YP_001744693 |
Protein GI | 170684009 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0516] IMP dehydrogenase/GMP reductase |
TIGRFAM ID | [TIGR01302] inosine-5'-monophosphate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000195261 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATCGG TTACGCTCTG TATAATGCCG CGGCAATATT TATTAACCAC TCTGGTCGAG ATATTGCCCA TGCTACGTAT CGCTAAAGAA GCTCTGACGT TTGACGACGT TCTCCTCGTT CCTGCTCATT CTACCGTTCT GCCGAATACT GCTGACCTCA GCACCCAGCT GACGAAAACT ATTCGTCTGA ATATCCCTAT GCTTTCCGCA GCAATGGATA CCGTAACGGA AGCGCGCCTG GCTATTGCTC TGGCTCAGGA AGGCGGTATC GGCTTTATCC ACAAAAACAT GTCCATTGAA CGCCAGGCAG AAGAAGTTCG CCGTGTGAAA AAACACGAAT CTGGCGTGGT AACTGATCCG CAGACCGTGC TGCCGACAAC CACCCTGCGT GAAGTGAAAG AACTGACCGA GCGTAACGGC TTTGCGGGCT ACCCGGTGGT TACCGAAGAA AACGAACTGG TCGGCATCAT CACCGGTCGT GACGTGCGTT TTGTGACCGA TCTGAATCAA CCGGTTAGCG TTTACATGAC GCCGAAAGAG CGTCTGGTAA CCGTGCGTGA AGGCGAAGCC CGTGAAGTGG TGCTGGCAAA AATGCACGAA AAACGCGTTG AAAAGGCGCT GGTGGTTGAT GACGAATTCC ACCTGATCGG CATGATCACT GTGAAGGACT TCCAGAAAGC GGAACGTAAA CCGAACGCGT GTAAAGACGA ACAAGGCCGT CTGCGTGTAG GTGCAGCGGT TGGCGCGGGT GCGGGTAACG AAGAGCGTGT TGATGCGCTG GTTGCCGCAG GCGTTGACGT TCTGCTGATC GACTCCTCTC ACGGTCACTC TGAAGGTGTT CTGCAGCGTA TCCGTGAAAC TCGCGCTAAA TATCCTGACC TGCAAATCAT CGGCGGCAAC GTGGCAACAG CAGCAGGTGC CCGCGCGCTG GCAGAAGCCG GTTGCAGTGC CGTTAAAGTG GGTATCGGCC CTGGTTCTAT TTGTACTACC CGCATTGTTA CTGGCGTCGG TGTTCCGCAG ATCACCGCCG TTGCTGACGC AGTAGAAGCC CTGGAAGGCA CCGGAATTCC GGTTATCGCT GACGGTGGTA TTCGTTTCTC CGGCGACATC GCCAAAGCTA TCGCCGCTGG CGCAAGCGCG GTAATGGTGG GTTCCATGCT GGCAGGTACT GAAGAATCTC CGGGTGAAAT TGAACTCTAC CAGGGCCGTT CTTACAAATC TTACCGTGGT ATGGGTTCCC TGGGCGCGAT GTCCAAAGGT TCTTCTGACC GTTACTTCCA GAGCGACAAC GCTGCCGACA AACTGGTGCC GGAAGGTATC GAAGGTCGCG TGGCCTATAA AGGTCGCCTG AAAGAGATCA TTCACCAGCA GATGGGCGGC CTGCGCTCCT GTATGGGCCT GACCGGCTGT GGTACTATCG ACGAACTGCG TACTAAAGCG GAGTTTGTAC GTATCAGCGG TGCGGGCATT CAGGAAAGCC ACGTTCACGA CGTGACCATT ACTAAAGAGT CCCCGAACTA CCGTCTGGGC TCCTGA
|
Protein sequence | MQSVTLCIMP RQYLLTTLVE ILPMLRIAKE ALTFDDVLLV PAHSTVLPNT ADLSTQLTKT IRLNIPMLSA AMDTVTEARL AIALAQEGGI GFIHKNMSIE RQAEEVRRVK KHESGVVTDP QTVLPTTTLR EVKELTERNG FAGYPVVTEE NELVGIITGR DVRFVTDLNQ PVSVYMTPKE RLVTVREGEA REVVLAKMHE KRVEKALVVD DEFHLIGMIT VKDFQKAERK PNACKDEQGR LRVGAAVGAG AGNEERVDAL VAAGVDVLLI DSSHGHSEGV LQRIRETRAK YPDLQIIGGN VATAAGARAL AEAGCSAVKV GIGPGSICTT RIVTGVGVPQ ITAVADAVEA LEGTGIPVIA DGGIRFSGDI AKAIAAGASA VMVGSMLAGT EESPGEIELY QGRSYKSYRG MGSLGAMSKG SSDRYFQSDN AADKLVPEGI EGRVAYKGRL KEIIHQQMGG LRSCMGLTGC GTIDELRTKA EFVRISGAGI QESHVHDVTI TKESPNYRLG S
|
| |