Gene EcSMS35_2174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2174 
SymbolpyrD 
ID6143184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2181670 
End bp2182680 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content47% 
IMG OID641617050 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_001744224 
Protein GI170682541 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000792039 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.661729 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTACC CCTTCGTTCG TAAAGCCCTT TTCCAGCTCG ATCCAGAGCG CGCTCATGAG 
TTTACTTTTC AGCAATTACG CCGTATTACC GGAACGCCGT TTGAAGCACT GGTGCGGCAG
AAAGTGCCTG CGAAACCTGT TAACTGCATG GGCCTGACGT TTAAAAATCC GCTTGGTCTG
GCAGCCGGTC TTGATAAAGA CGGGGAGTGC ATTGACGCGT TAGGCGCGAT GGGTTTTGGA
TCTATCGAGA TCGGTACCGT CACGCCACGT CCACAGCCAG GTAATGACAA GCCGCGTCTC
TTTCGTCTGG TAGATGCCGA AGGTTTGATC AACCGTATGG GCTTTAATAA TCTTGGCGTT
GATAACCTCG TAGAGAACGT AAAAAAAGCC CATTATGATG GCGTCCTGGG TATTAACATC
GGCAAAAATA AAGATACGCC GGTGGAGCAG GGTAAAGATG ACTATCTGAT TTGTATGGAA
AAAATCTATG CTTATGCGGG ATATATCGCC ATCAATATTT CATCGCCGAA TACCCCAGGA
CTACGCACAC TGCAATATGG CGAAGCGCTG GATGATCTCT TAACCGCGAT TAAAAATAAG
CAAAATGATT TGCAAGCGAT GCACCATAAA TATGTGCCGA TCGCAGTGAA GATCGCGCCG
GATCTTTCTG AAGAAGAATT GATCCAGGTT GCCGATAGTT TAGTTCGCCA TAATATTGAT
GGCGTTATTG CAACCAATAC CACGCTCGAT CGTTCTCTGG TTCAGGGAAT GAAAAATTGT
GATCAAACCG GTGGCTTAAG TGGTCGTCCG CTTCAGTTAA AAAGCACAGA AATTATTCGC
CGCTTGTCAC AGGAATTAAA CGGTCGCTTA CCGATCATCG GTGTTGGCGG CATTGACTCG
GTTATCGCTG CGCGTGAAAA GATTGCTGCG GGGGCCTCAC TGGTGCAAAT TTATTCTGGT
TTTATTTTTA AAGGTCCGCC GCTGATTAAA GAAATCGTTA CCCATATCTA A
 
Protein sequence
MYYPFVRKAL FQLDPERAHE FTFQQLRRIT GTPFEALVRQ KVPAKPVNCM GLTFKNPLGL 
AAGLDKDGEC IDALGAMGFG SIEIGTVTPR PQPGNDKPRL FRLVDAEGLI NRMGFNNLGV
DNLVENVKKA HYDGVLGINI GKNKDTPVEQ GKDDYLICME KIYAYAGYIA INISSPNTPG
LRTLQYGEAL DDLLTAIKNK QNDLQAMHHK YVPIAVKIAP DLSEEELIQV ADSLVRHNID
GVIATNTTLD RSLVQGMKNC DQTGGLSGRP LQLKSTEIIR RLSQELNGRL PIIGVGGIDS
VIAAREKIAA GASLVQIYSG FIFKGPPLIK EIVTHI