Gene EcSMS35_2657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2657 
SymbolguaB 
ID6142770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2716901 
End bp2718436 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content55% 
IMG OID641617528 
Productinosine 5'-monophosphate dehydrogenase 
Protein accessionYP_001744693 
Protein GI170684009 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0516] IMP dehydrogenase/GMP reductase 
TIGRFAM ID[TIGR01302] inosine-5'-monophosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000195261 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCGG TTACGCTCTG TATAATGCCG CGGCAATATT TATTAACCAC TCTGGTCGAG 
ATATTGCCCA TGCTACGTAT CGCTAAAGAA GCTCTGACGT TTGACGACGT TCTCCTCGTT
CCTGCTCATT CTACCGTTCT GCCGAATACT GCTGACCTCA GCACCCAGCT GACGAAAACT
ATTCGTCTGA ATATCCCTAT GCTTTCCGCA GCAATGGATA CCGTAACGGA AGCGCGCCTG
GCTATTGCTC TGGCTCAGGA AGGCGGTATC GGCTTTATCC ACAAAAACAT GTCCATTGAA
CGCCAGGCAG AAGAAGTTCG CCGTGTGAAA AAACACGAAT CTGGCGTGGT AACTGATCCG
CAGACCGTGC TGCCGACAAC CACCCTGCGT GAAGTGAAAG AACTGACCGA GCGTAACGGC
TTTGCGGGCT ACCCGGTGGT TACCGAAGAA AACGAACTGG TCGGCATCAT CACCGGTCGT
GACGTGCGTT TTGTGACCGA TCTGAATCAA CCGGTTAGCG TTTACATGAC GCCGAAAGAG
CGTCTGGTAA CCGTGCGTGA AGGCGAAGCC CGTGAAGTGG TGCTGGCAAA AATGCACGAA
AAACGCGTTG AAAAGGCGCT GGTGGTTGAT GACGAATTCC ACCTGATCGG CATGATCACT
GTGAAGGACT TCCAGAAAGC GGAACGTAAA CCGAACGCGT GTAAAGACGA ACAAGGCCGT
CTGCGTGTAG GTGCAGCGGT TGGCGCGGGT GCGGGTAACG AAGAGCGTGT TGATGCGCTG
GTTGCCGCAG GCGTTGACGT TCTGCTGATC GACTCCTCTC ACGGTCACTC TGAAGGTGTT
CTGCAGCGTA TCCGTGAAAC TCGCGCTAAA TATCCTGACC TGCAAATCAT CGGCGGCAAC
GTGGCAACAG CAGCAGGTGC CCGCGCGCTG GCAGAAGCCG GTTGCAGTGC CGTTAAAGTG
GGTATCGGCC CTGGTTCTAT TTGTACTACC CGCATTGTTA CTGGCGTCGG TGTTCCGCAG
ATCACCGCCG TTGCTGACGC AGTAGAAGCC CTGGAAGGCA CCGGAATTCC GGTTATCGCT
GACGGTGGTA TTCGTTTCTC CGGCGACATC GCCAAAGCTA TCGCCGCTGG CGCAAGCGCG
GTAATGGTGG GTTCCATGCT GGCAGGTACT GAAGAATCTC CGGGTGAAAT TGAACTCTAC
CAGGGCCGTT CTTACAAATC TTACCGTGGT ATGGGTTCCC TGGGCGCGAT GTCCAAAGGT
TCTTCTGACC GTTACTTCCA GAGCGACAAC GCTGCCGACA AACTGGTGCC GGAAGGTATC
GAAGGTCGCG TGGCCTATAA AGGTCGCCTG AAAGAGATCA TTCACCAGCA GATGGGCGGC
CTGCGCTCCT GTATGGGCCT GACCGGCTGT GGTACTATCG ACGAACTGCG TACTAAAGCG
GAGTTTGTAC GTATCAGCGG TGCGGGCATT CAGGAAAGCC ACGTTCACGA CGTGACCATT
ACTAAAGAGT CCCCGAACTA CCGTCTGGGC TCCTGA
 
Protein sequence
MQSVTLCIMP RQYLLTTLVE ILPMLRIAKE ALTFDDVLLV PAHSTVLPNT ADLSTQLTKT 
IRLNIPMLSA AMDTVTEARL AIALAQEGGI GFIHKNMSIE RQAEEVRRVK KHESGVVTDP
QTVLPTTTLR EVKELTERNG FAGYPVVTEE NELVGIITGR DVRFVTDLNQ PVSVYMTPKE
RLVTVREGEA REVVLAKMHE KRVEKALVVD DEFHLIGMIT VKDFQKAERK PNACKDEQGR
LRVGAAVGAG AGNEERVDAL VAAGVDVLLI DSSHGHSEGV LQRIRETRAK YPDLQIIGGN
VATAAGARAL AEAGCSAVKV GIGPGSICTT RIVTGVGVPQ ITAVADAVEA LEGTGIPVIA
DGGIRFSGDI AKAIAAGASA VMVGSMLAGT EESPGEIELY QGRSYKSYRG MGSLGAMSKG
SSDRYFQSDN AADKLVPEGI EGRVAYKGRL KEIIHQQMGG LRSCMGLTGC GTIDELRTKA
EFVRISGAGI QESHVHDVTI TKESPNYRLG S