Gene EcSMS35_2780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2780 
Symbol 
ID6146911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2863598 
End bp2864866 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content57% 
IMG OID641617649 
Producthydroxyglutarate oxidase 
Protein accessionYP_001744809 
Protein GI170682247 
COG category[R] General function prediction only 
COG ID[COG0579] Predicted dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.582782 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGATT TTGTGATTAT TGGCGGCGGC ATCATCGGCA TGTCGACCGC CATGCAACTG 
ATTGATGTTT ATCCGGATGC CCGCATTGCG TTGCTGGAAA AAGAGTCCGG CCCGGCCTTT
CATCAGACGG GCCACAACAG CGGCGTGATC CATGCCGGGG TCTATTACAC GCCCGGTAGC
CTGAAGGCAC AGTTTTGCCT GGCGGGAAAC CGCGCCACCA AAGCCTTTTG CGATCAAAAC
GGCATTCGCT ATGACAACTG CGGCAAGATG CTGGTCGCCA CCTCTGAACT CGAAATGGAA
CGGATGCGCG CGTTGTGGGA ACGCACGGCG GCGAACGGTA TCGAGCGCGA GTGGTTAAAC
GCCGAGGAAC TGCGCGAGCG CGAACCGAAT ATCACCGGGC TCGGCGGCAT TTTTGTGCCG
TCCAGCGGCA TTGTCAGCTA TCGCGAAGTG ACGGCGGCGA TGGCAAAAAT CTTCCAGGCC
AGAGGCGGCG AGATTATTTA TAACGCCGAA GTCAGCGCCC TCAGTGAGCA TAAAAACGGC
GTGGTGATAC GTACCCGTCA GGGCGGTGAA TATGAAGCAT CGACGCTGAT TAGCTGTTCC
GGGCTGATGG CTGACCGGCT GGTGAAAATG CTCGGCCTCG AACCGGGCTT TATTATCTGC
CCGTTCCGTG GCGAGTATTT CCGCCTTGCG CCGGAGCATA ACCAGATTGT TAACCACCTG
ATTTACCCCA TTCCCGACCC CGCGATGCCA TTTTTGGGCG TGCATCTCAC CCGCATGATC
GATGGCAGCG TGACCGTCGG GCCAAACGCG GTGCTGGCTT TTAAACGCGA AGGCTATCGC
AAGCGCGATT TCTCGTTTAG TGACACGCTG GAGATTTTAG GCTCGTCGGG GATTCGCCGG
GTGCTGCAAA ACCATCTACG CTCAGGATTG GGCGAGATGA AAAACTCGCT GTGCAAAAGC
GGCTATCTGC GGCTGGTGCA AAAGTATTGT CCCCGGCTTT CGTTAAGCGA TCTCCAGCCC
TGGCCCGCAG GTGTGCGGGC GCAGGCGGTA TCGCCGGACG GCAAGCTGAT TGACGATTTT
CTGTTTGTCA CCACCCCGCG CACGATCCAC ACCTGCAATG CGCCCTCCCC GGCAGCGACA
TCAGCAATTC CTATTGGTGC GCATATTGTC AGCAAGGTAC AAACGCTGTT GGCAAGCCAG
AGTAACCCCG GACGCACGCT GCGAGCGGCA CGTAGTGTGG ATGCCTTACA CGCCGCATTT
AATCAATAA
 
Protein sequence
MYDFVIIGGG IIGMSTAMQL IDVYPDARIA LLEKESGPAF HQTGHNSGVI HAGVYYTPGS 
LKAQFCLAGN RATKAFCDQN GIRYDNCGKM LVATSELEME RMRALWERTA ANGIEREWLN
AEELREREPN ITGLGGIFVP SSGIVSYREV TAAMAKIFQA RGGEIIYNAE VSALSEHKNG
VVIRTRQGGE YEASTLISCS GLMADRLVKM LGLEPGFIIC PFRGEYFRLA PEHNQIVNHL
IYPIPDPAMP FLGVHLTRMI DGSVTVGPNA VLAFKREGYR KRDFSFSDTL EILGSSGIRR
VLQNHLRSGL GEMKNSLCKS GYLRLVQKYC PRLSLSDLQP WPAGVRAQAV SPDGKLIDDF
LFVTTPRTIH TCNAPSPAAT SAIPIGAHIV SKVQTLLASQ SNPGRTLRAA RSVDALHAAF
NQ