Gene ECH74115_2961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2961 
Symbolugd 
ID6970139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2735345 
End bp2736511 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content43% 
IMG OID643386801 
ProductUDP-glucose 6-dehydrogenase 
Protein accessionYP_002271269 
Protein GI209398959 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00000204361 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATCA CCATTTCCGG TACTGGCTAT GTAGGCTTGT CAAACGGGCT TCTAATCGCT 
CAAAATCATG AGGTTGTGGC ATTAGATATT TTACCGTCAC GCGTTGCTAT GCTGAATGAT
CGGATATCTC CTATTGTTGA TAAGGAAATT CAGCAGTTTT TGCAATCAGA TAAAATACAC
TTTAATGCCA CATTAGATAA AAATGAAGCC TACCGGGATG CTGATTATGT CATCATCGCC
ACTCCAACCG ACTATGATCC TAAAACTAAT TATTTCAATA CATCCAGTGT AGAATCAGTA
ATTAAAGACG TAGTTGAGAT AAATCCTTAT GCGGTTATGG TCATCAAATC AACGGTTCCC
GTTGGTTTTA CCGAAGCGAT GCATAAGAAA TATCGTACTG AAAATATTAT TTTCTCTCCG
GAATTTCTCC GCGAAGGTAA AGCTCTTTAC GATAACCTTC ATCCGTCACG TATTGTAATC
GGTGAGCGTT CGGAACGCGC AGAACGTTTT GCTGCGTTAT TACAGGAAGG CGCGATTAAG
CAAAATATCC CAACCCTGTT TACCGACTCC ACTGAAGCAG AAGCGATTAA ACTTTTTGCA
AACACCTACC TCGCGATGCG CGTGGCGTAC TTTAACGAAC TGGATAGCTA CGCAGAAAGT
TTAGGTCTGA ATTCCCGTCA AATAATCGAA GGCGTTTGTC TCGACCCACG TATTGGCAAC
CATTACAACA ATCCGTCGTT TGGTTATGGT GGTTATTGTT TGCCGAAAGA TACCAAGCAG
TTACTGGCGA ACTACCAGTC TGTGCCGAAT AACCTGATCT CGGCAATTGT CGATGCTAAC
CGCACGCGTA AAGATTTTAT TGCCGATGCC ATTTTGTCAC GCAAGCCGCA AGTGGTAGGT
ATTTATCGTC TGATTATGAA GAGCGGTTCA GATAACTTCC GTGCGTCTTC TATTCAGGGG
ATTATGAAGC GTATCAAGGC GAAAGGTGTT GAAGTAATCA TCTACGAGCC AGTGATGAAA
GAAGACTCAT TCTTCAACTC TCGCCTGGAA CGTGATCTCG CCACCTTCAA ACAACAAGCC
GACGTCATTA TTTCCAACCG TATGGCAGAA GAGCTTAGGG ATGTGGCAGA TAAGGTCTAC
ACCCGCGATC TCTTTGGCAG CGACTAA
 
Protein sequence
MKITISGTGY VGLSNGLLIA QNHEVVALDI LPSRVAMLND RISPIVDKEI QQFLQSDKIH 
FNATLDKNEA YRDADYVIIA TPTDYDPKTN YFNTSSVESV IKDVVEINPY AVMVIKSTVP
VGFTEAMHKK YRTENIIFSP EFLREGKALY DNLHPSRIVI GERSERAERF AALLQEGAIK
QNIPTLFTDS TEAEAIKLFA NTYLAMRVAY FNELDSYAES LGLNSRQIIE GVCLDPRIGN
HYNNPSFGYG GYCLPKDTKQ LLANYQSVPN NLISAIVDAN RTRKDFIADA ILSRKPQVVG
IYRLIMKSGS DNFRASSIQG IMKRIKAKGV EVIIYEPVMK EDSFFNSRLE RDLATFKQQA
DVIISNRMAE ELRDVADKVY TRDLFGSD