Gene ECH74115_1441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1441 
SymbolpyrC 
ID6969193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1424291 
End bp1425337 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content53% 
IMG OID643385414 
Productdihydroorotase 
Protein accessionYP_002269908 
Protein GI209397912 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0418] Dihydroorotase 
TIGRFAM ID[TIGR00856] dihydroorotase, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.426148 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.0213207 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCAC CATCCCAGGT ATTAAAGATC CGCCGCCCAG ACGACTGGCA CCTTCACCTC 
CGCGATGGCG ACATGTTAAA AACTGTCGTG CCGTATACCA GCGAAATTTA TGGACGGGCT
ATCGTAATGC CCAATCTGGC TCCGCCCGTG ACCACCGTTG AGGCTGCCGT GGCGTATCGC
CAACGTATTC TTCACGCCGT ACCTGCCGGG CACGATTTCA CCCCATTGAT GACCTGTTAT
TTAACAGATT CGCTGGATCC TAATGAGCTG GAGCGCGGAT TTAACGAAGG CGTGTTCACC
GCTGCAAAAC TTTACCCGGC AAACGCAACC ACTAACTCCA GCCACGGCGT GACGTCAGTT
GACGCAATCA TGCCGGTATT AGAGCGCATG GAAAAAATCG GTATGCCGCT ACTGGTGCAT
GGTGAAGTGA CACATGCAGA TATCGACATT TTTGATCGTG AAGCGCGCTT TATAGAAAGC
GTGATGGAAC CTCTGCGCCA GCGCCTGACC GCGCTGAAAG TCGTTTTTGA GCACATCACC
ACCAAAGACG CAGCCGACTA TGTCCGTGAC GGAAATGAAC GGCTGGCTGC TACCATCACT
CCGCAACATC TGATGTTTAA CCGCAACCAT ATGCTGGTTG GTGGCGTGCG TCCGCACCTG
TATTGTCTAC CCATCCTCAA ACGCAATATT CACCAACAGG CATTGCGTGA ACTAGTCGCC
AGCGGTTTTA ATCGAGTATT CCTCGGTACG GATTCTGCGC CACATGCACG TCATCGCAAA
GAGAGCAGCT GCGGCTGCGC GGGCTGCTTC AACGCCCCAA CCGCGCTGGG AAGTTACGCT
ACCGTCTTTG AAGAGATGAA TGCTCTGCAG CACTTTGAAG CATTCTGTTC TGTAAACGGC
CCGCAGTTCT ATGGGTTACC GGTCAACGAC ACATTCATCG AACTGGTACG TGAAGAGCAT
CAGGTTGCTG AAAGCATCGC ACTGACTGAT GACACGCTGG TGCCATTCCT CGCCGGGGAA
ACGGTACGCT GGTCTGTTAA ACAATAA
 
Protein sequence
MTAPSQVLKI RRPDDWHLHL RDGDMLKTVV PYTSEIYGRA IVMPNLAPPV TTVEAAVAYR 
QRILHAVPAG HDFTPLMTCY LTDSLDPNEL ERGFNEGVFT AAKLYPANAT TNSSHGVTSV
DAIMPVLERM EKIGMPLLVH GEVTHADIDI FDREARFIES VMEPLRQRLT ALKVVFEHIT
TKDAADYVRD GNERLAATIT PQHLMFNRNH MLVGGVRPHL YCLPILKRNI HQQALRELVA
SGFNRVFLGT DSAPHARHRK ESSCGCAGCF NAPTALGSYA TVFEEMNALQ HFEAFCSVNG
PQFYGLPVND TFIELVREEH QVAESIALTD DTLVPFLAGE TVRWSVKQ