Gene EcolC_1046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1046 
Symbol 
ID6066419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1132920 
End bp1134188 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content57% 
IMG OID641600459 
Producthydroxyglutarate oxidase 
Protein accessionYP_001724042 
Protein GI170019088 
COG category[R] General function prediction only 
COG ID[COG0579] Predicted dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGATT TTGTGATTAT TGGCGGTGGC ATCATCGGCA TGTCGACCGC CATGCAACTG 
ATTGATGTCT ACCCGGACGC CCGCATTGCG TTGCTGGAAA AAGAGTCCGG CCCGGCCTGT
CACCAGACGG GCCACAACAG CGGCGTGATC CATGCCGGGG TTTATTACAC GCCCGGTAGC
CTGAAGGCAC AGTTTTGCCT GGCGGGAAAC CGCGCCACTA AAGCCTTTTG CGATCAAAAC
GGCATTCGCT ATGACAACTG CGGCAAGATG CTGGTCGCCA CCTCTGAACT CGAAATGGAA
CGGATGCGCG CGTTGTGGGA ACGCACGGCG GCAAACGGTA TCGAGCGCGA GTGGTTAAAC
GCCGATGAAC TGCGCGAGCG CGAACCGAAT ATCACCGGGC TTGGCGGTAT TTTTGTGCCG
TCCAGCGGCA TTGTCAGCTA CCGCGAAGTA ACGGCGGCGA TGGCAAAAAT TTTCCAGGCC
AGAGGCGGCG AGATTATCTA TAACGCCGAA GTCAGCGGCC TCAGTGAGCA TAAAAACGGC
GTGGTGATAC GTACCCGCCA GGGCGGCGAA TATGAAGCAT CGACGCTGAT TAGCTGTTCC
GGGCTGATGG CTGACCGGCT GGTGAAAATG CTCGGACTCG AACCGGGCTT TATCATCTGC
CCGTTCCGTG GTGAGTATTT CCGCCTTGCG CCGGAGCATA ACCAGATTGT TAACCACCTG
ATTTACCCCA TTCCCGACCC CGCGATGCCA TTTTTGGGCG TGCATCTCAC CCGAATGATC
GATGGCAGCG TGACCGTCGG GCCAAACGCG GTGCTGGCTT TTAAACGCGA AGGCTACCAC
AAGCGCGACT TCTCGTTTAG CGACACGCTG GAAATTTTGG GCTCGTCGGG GATTCGCCGG
GTGCTGCAAA ACCATCTACG CTCAGGACTG GGCGAGATGA AAAACTCGCT GTGCAAAAGC
GGCTATCTGC GGCTGGTGCA AAAGTATTGT CCCCGGCTTT CGTTAAGCGA TCTCCAGCCC
TGGCCCGCCG GTGTGCGGGC GCAGGCGGTA TCGCCGGACG GCAAGCTGAT TGACGATTTT
CTGTTTGTCA CCACCCCGCG CACGATCCAC ACCTGCAATG CGCCCTCCCC GGCAGCGACA
TCAGCAATTC CTATTGGTGC GCATATTGTC AGCAAGGTAC AAACGCTGTT GGCAAGCCAG
AGTAACCCCG GACGCACGCT GCGAGCGGCA CGTAGTGTGG ATGCCTTACA CGCCGCATTT
AATCAATAA
 
Protein sequence
MYDFVIIGGG IIGMSTAMQL IDVYPDARIA LLEKESGPAC HQTGHNSGVI HAGVYYTPGS 
LKAQFCLAGN RATKAFCDQN GIRYDNCGKM LVATSELEME RMRALWERTA ANGIEREWLN
ADELREREPN ITGLGGIFVP SSGIVSYREV TAAMAKIFQA RGGEIIYNAE VSGLSEHKNG
VVIRTRQGGE YEASTLISCS GLMADRLVKM LGLEPGFIIC PFRGEYFRLA PEHNQIVNHL
IYPIPDPAMP FLGVHLTRMI DGSVTVGPNA VLAFKREGYH KRDFSFSDTL EILGSSGIRR
VLQNHLRSGL GEMKNSLCKS GYLRLVQKYC PRLSLSDLQP WPAGVRAQAV SPDGKLIDDF
LFVTTPRTIH TCNAPSPAAT SAIPIGAHIV SKVQTLLASQ SNPGRTLRAA RSVDALHAAF
NQ