Gene EcolC_1614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1614 
Symbol 
ID6065807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1795759 
End bp1796925 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content44% 
IMG OID641601029 
Productnucleotide sugar dehydrogenase 
Protein accessionYP_001724599 
Protein GI170019645 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.812894 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA CCATTTCCGG TACTGGCTAT GTAGGCTTGT CAAACGGGCT TCTAATCGCA 
CAAAATCATG AGGTTGTGGC ATTAGATATT TTACCGTCAC GTGTTGCTAT GCTGAATGAT
CGGATATCTC CTATTGTTGA TAAGGAAATT CAGCAGTTTT TGCAATCAGA TAAAATACAC
TTTAATGCCA CATTAGATAA AAATGAAGCC TACCGGGGGG CTGATTATGT CATCATCGCT
ACTCCAACCG ACTATGATCC TAAAACTAAT TATTTCAATA CATCCAGTGT AGAATCAGTA
ATTAAAGACG TAGTTGAGAT AAATCCTTAT GCGGTTATGG TCATCAAATC AACGGTTCCC
GTTGGTTTTA CCGCAGCGAT GCATAAGAAA TATCGTACTG AAAATATTAT ATTCTCACCC
GAATTTCTCC GTGAGGGTAA AGCCCTTTAC GATAACCTTC ATCCGTCACG TATTGTCATC
GGTGAGCGTT CAGAACGCGC AGAACGTTTC GCTGCGTTAT TACAGGAAGG CGCGATTAAG
CAAAATATCC CAACCCTGTT TACCGACTCC ACTGAAGCAG AAGCGATTAA ACTTTTCGCT
AATACCTATC TGGCGATGCG CGTAGCATAC TTTAATGAAC TGGATAGCTA TGCAGAAAGT
TTAGGTCTGA ATACTCGCCA GATTATCGAA GGCGTTTGTC TCGATCCGCG TATTGGCAAC
CATTACAACA ATCCGTCGTT TGGTTATGGT GGTTATTGTC TGCCGAAAGA TACCAAGCAG
TTACTGGCGA ACTACCAGTC TGTGCCGAAT AACCTGATCT CGGCAATCGT AGACGCTAAC
CGCACGCGTA AAGATTTTAT TGCCGATGCC ATTTTGTCAC GCAAACCGCA AGTGGTGGGT
ATTTATCGTC TGATTATGAA GAGCGGTTCA GATAACTTTC GCGCGTCTTC CATTCAGGGG
ATTATGAAGC GTATCAAGGC GAAAGGCGTT GAAGTGATCA TCTACGAACC GGTGATGAAA
GAAGACTTAT TCTTCAACTC TCGCCTGGAA CGTGATCTCG CCACCTTCAA ACAACAAGCC
GACGTCATTA TTTCCAACCG TATGGCAGAA GAGCTTAAGG ATGTGGCAGA CAAAGTCTAC
ACCCGCGATC TCTTTGGCAG TGACTAA
 
Protein sequence
MKITISGTGY VGLSNGLLIA QNHEVVALDI LPSRVAMLND RISPIVDKEI QQFLQSDKIH 
FNATLDKNEA YRGADYVIIA TPTDYDPKTN YFNTSSVESV IKDVVEINPY AVMVIKSTVP
VGFTAAMHKK YRTENIIFSP EFLREGKALY DNLHPSRIVI GERSERAERF AALLQEGAIK
QNIPTLFTDS TEAEAIKLFA NTYLAMRVAY FNELDSYAES LGLNTRQIIE GVCLDPRIGN
HYNNPSFGYG GYCLPKDTKQ LLANYQSVPN NLISAIVDAN RTRKDFIADA ILSRKPQVVG
IYRLIMKSGS DNFRASSIQG IMKRIKAKGV EVIIYEPVMK EDLFFNSRLE RDLATFKQQA
DVIISNRMAE ELKDVADKVY TRDLFGSD