Gene EcHS_A2659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2659 
SymbolguaB 
ID5594971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2670244 
End bp2671710 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content54% 
IMG OID640921774 
Productinosine 5'-monophosphate dehydrogenase 
Protein accessionYP_001459301 
Protein GI157161983 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0516] IMP dehydrogenase/GMP reductase 
TIGRFAM ID[TIGR01302] inosine-5'-monophosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000118294 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTACGTA TCGCTAAAGA AGCTCTGACG TTTGACGACG TTCTCCTCGT TCCTGCTCAT 
TCTACCGTTC TGCCGAATAC TGCTGACCTC AGCACCCAGC TGACGAAAAC TATTCGTCTG
AATATCCCTA TGCTTTCCGC AGCAATGGAT ACCGTAACGG AAGCGCGCCT GGCTATTGCT
CTGGCTCAGG AAGGCGGTAT CGGCTTTATC CACAAAAACA TGTCCATTGA ACGCCAGGCA
GAAGAAGTTC GCCGTGTGAA AAAACACGAA TCTGGTGTGG TGACTGATCC GCAGACTGTT
CTGCCAACCA CGACGCTGCG CGAAGTGAAA GAACTGACCG AGCGTAACGG TTTTGCGGGC
TATCCGGTCG TTACCGAAGA AAACGAACTG GTGGGTATTA TCACCGGTCG TGACGTGCGT
TTTGTTACCG ACCTGAACCA GCCGGTTAGC GTTTACATGA CGCCGAAAGA GCGTCTGGTC
ACCGTGCGTG AAGGTGAAGC CCGTGAAGTG GTGCTGGCAA AAATGCACGA AAAACGCGTT
GAAAAAGCGC TGGTGGTTGA TGACGAATTC CACCTGATCG GCATGATCAC CGTGAAAGAT
TTCCAGAAAG CGGAACGTAA ACCGAACGCC TGTAAAGACG AGCAAGGCCG TCTGCGTGTT
GGTGCTGCAG TTGGCGCAGG TGCGGGTAAC GAAGAGCGTG TTGACGCGCT GGTTGCCGCT
GGCGTTGACG TTCTGCTGAT CGACTCCTCC CACGGTCACT CAGAAGGCGT TTTGCAGCGT
ATCCGTGAAA CCCGTGCTAA ATATCCGGAT CTGCAAATCA TCGGCGGCAA CGTGGCAACA
GCTGCAGGTG CACGCGCTTT GGCAGAAGCT GGTTGCAGCG CGGTTAAAGT CGGTATCGGC
CCTGGTTCTA TCTGTACTAC TCGTATCGTT ACTGGCGTAG GTGTTCCGCA GATCACTGCC
GTTGCTGACG CAGTTGAAGC GCTGGAAGGC ACAGGTATTC CGGTTATCGC AGATGGTGGT
ATTCGCTTCT CCGGCGACAT CGCCAAAGCT ATCGCCGCTG GCGCAAGCGC GGTAATGGTG
GGTTCCATGC TGGCGGGTAC TGAAGAATCT CCGGGTGAAA TCGAACTCTA CCAGGGCCGT
TCTTACAAAT CTTACCGTGG TATGGGTTCC CTGGGCGCGA TGTCCAAAGG TTCCTCTGAC
CGTTATTTCC AGAGCGATAA CGCTGCCGAC AAACTGGTGC CGGAAGGTAT CGAAGGTCGC
GTAGCTTATA AAGGTCGCCT GAAAGAGATC ATTCACCAGC AGATGGGTGG CCTGCGCTCC
TGTATGGGCC TGACCGGCTG TGGTACTATC GACGAACTGC GTACTAAAGC GGAGTTTGTA
CGTATCAGCG GTGCGGGCAT TCAGGAAAGC CACGTTCACG ACGTGACCAT TACTAAAGAG
TCCCCGAACT ACCGTCTGGG CTCCTGA
 
Protein sequence
MLRIAKEALT FDDVLLVPAH STVLPNTADL STQLTKTIRL NIPMLSAAMD TVTEARLAIA 
LAQEGGIGFI HKNMSIERQA EEVRRVKKHE SGVVTDPQTV LPTTTLREVK ELTERNGFAG
YPVVTEENEL VGIITGRDVR FVTDLNQPVS VYMTPKERLV TVREGEAREV VLAKMHEKRV
EKALVVDDEF HLIGMITVKD FQKAERKPNA CKDEQGRLRV GAAVGAGAGN EERVDALVAA
GVDVLLIDSS HGHSEGVLQR IRETRAKYPD LQIIGGNVAT AAGARALAEA GCSAVKVGIG
PGSICTTRIV TGVGVPQITA VADAVEALEG TGIPVIADGG IRFSGDIAKA IAAGASAVMV
GSMLAGTEES PGEIELYQGR SYKSYRGMGS LGAMSKGSSD RYFQSDNAAD KLVPEGIEGR
VAYKGRLKEI IHQQMGGLRS CMGLTGCGTI DELRTKAEFV RISGAGIQES HVHDVTITKE
SPNYRLGS