Gene EcHS_A2178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2178 
Symbolugd 
ID5594708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2156840 
End bp2158006 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content51% 
IMG OID640921311 
ProductUDP-glucose 6-dehydrogenase 
Protein accessionYP_001458850 
Protein GI157161532 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value0.112663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTA CTATTTCCGG TACAGGTTAT GTTGGTTTAT CGAACGGTGT CCTGATTGCG 
CAAAACCACG AAGTGGTTGC TCTGGATATC GTACAGGCCA AAGTGGATAT GCTTAACCAG
AAGATCTCTC CGATTGTGGA TAAAGAGATT CAGGAATATC TGGCAGAAAA ACCGTTAAAT
TTCCGTGCCA CCACGGACAA GCACGACGCC TATCGTAATG CCGACTACGT GATCATTGCG
ACGCCGACCG ATTACGATCC CAAAACCAAC TACTTCAACA CCTCTACAGT GGAAGCGGTT
ATTCGCGATG TCACAGAGAT CAACCCGAAC GCGGTGATGA TCATTAAATC GACCATCCCG
GTGGGGTTCA CCCGCGACAT CAAAGAACGT TTAGGGATTG ATAATGTTAT TTTCTCTCCT
GAGTTCCTGC GTGAAGGCCG TGCGCTGTAC GACAACCTGC ACCCATCGCG CATTGTTATT
GGTGAGCGCT CTGCGCGTGC CGAGCGTTTC GCAGACCTGC TGAAAGAAGG CGCGATTAAG
CAGGATATCC CGACCCTGTT TACCGACTCC ACTGAAGCGG AAGCGATCAA ACTGTTCGCG
AACACCTATC TGGCGCTGCG TGTTGCCTAT TTCAATGAGC TCGACAGCTA TGCTGAAAGC
CAGGGGCTGA ACAGCAAGCA GATTATCGAA GGGGTATGCC TGGATCCGCG TATCGGCAAC
CACTACAACA ACCCGTCCTT TGGCTATGGC GGCTACTGCC TGCCGAAAGA TACCAAGCAG
CTGCTGGCGA ACTACGAATC GGTCCCGAAC AATATCATCG CGGCTATCGT GGATGCTAAC
CGTACCCGTA AAGACTTTAT CGCGGATTCT ATTCTCGCCC GTAAGCCGAA AGTGGTGGGT
GTGTATCGCC TGATCATGAA GAGTGGTTCG GACAACTTCC GTGCTTCTTC TATTCAGGGC
ATTATGAAGC GCATCAAGGC GAAAGGTATT CCGGTTATTA TCTATGAACC GGTGATGCAG
GAAGATGAGT TCTTTAACTC CCGCGTCGTG CGCGACCTGG ATACCTTCAA ACAAGAGGCG
GATGTGATCA TCTCTAACCG TATGGCGGAA GAGCTGGCGG ATGTGGCGGA CAAGGTATAC
ACCCGCGATC TGTTTGGTAA CGATTAA
 
Protein sequence
MKITISGTGY VGLSNGVLIA QNHEVVALDI VQAKVDMLNQ KISPIVDKEI QEYLAEKPLN 
FRATTDKHDA YRNADYVIIA TPTDYDPKTN YFNTSTVEAV IRDVTEINPN AVMIIKSTIP
VGFTRDIKER LGIDNVIFSP EFLREGRALY DNLHPSRIVI GERSARAERF ADLLKEGAIK
QDIPTLFTDS TEAEAIKLFA NTYLALRVAY FNELDSYAES QGLNSKQIIE GVCLDPRIGN
HYNNPSFGYG GYCLPKDTKQ LLANYESVPN NIIAAIVDAN RTRKDFIADS ILARKPKVVG
VYRLIMKSGS DNFRASSIQG IMKRIKAKGI PVIIYEPVMQ EDEFFNSRVV RDLDTFKQEA
DVIISNRMAE ELADVADKVY TRDLFGND