Gene EcSMS35_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2067 
SymbolpyrC 
ID6147117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2084687 
End bp2085733 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content53% 
IMG OID641616943 
Productdihydroorotase 
Protein accessionYP_001744119 
Protein GI170680153 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0418] Dihydroorotase 
TIGRFAM ID[TIGR00856] dihydroorotase, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.863228 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.103848 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCAC CATCCCAGGT ATTAAAGATC CGCCGCCCAG ACGACTGGCA CCTTCACCTC 
CGCGATGGCG ACATGTTAAA AACTGTCGTG CCGTATACCA GCGAAATTTA TGGACGGGCT
ATCGTAATGC CCAATCTGGC TCCGCCCGTG ACCACCGTTG AGGCTGCCGT GGCGTATCGC
CAGCGCATTC TTGACGCCGT ACCTGCCGGG CACGATTTCA CCCCGCTGAT GACCTGTTAT
TTAACAGATT CGCTGGATCC TAATGAGCTG GAGCGCGGAT TTAACGAAGG CGTGTTCACC
GCTGCAAAAC TTTATCCGGC AAACGCAACC ACTAACTCCA GCCACGGCGT TACGTCAGTT
GACGCAATCA TGCCGGTACT TGAGCGCATG GAAAAAATCG GTATGCCGCT ACTGGTGCAT
GGTGAAGTGA CACATGCAGA TATCGACATT TTTGATCGTG AAGCGCGCTT TATAGAAAGC
GTGATGGAAC CGCTACGCCA GCGTCTGACT GCGCTGAAAG TCGTTTTTGA GCACATCACC
ACCAAAGATG CTGCCGACTA TGTCCGTGAC GGAAATGAAC GGCTGGCTGC CACCATCACT
CCGCAGCATC TGATGTTTAA CCGCAACCAT ATGCTGGTTG GTGGCGTGCG TCCGCACCTG
TATTGTCTAC CCATCCTCAA ACGCAATATC CACCAACAGG CATTGCGTGA ACTGGTCGCC
AGCGGTTTTA ATCGTGTATT CCTCGGAACG GATTCTGCGC CACATGCACG TCATCGCAAA
GAGAGCAGCT GCGGCTGCGC GGGCTGTTTC AACGCCCCAA CCGCGCTGGG CAGTTACGCT
ACCGTCTTTG AAGAGATGAA TGCTTTGCAG CACTTTGAAG CATTCTGTTC TGTAAACGGC
CCGCAGTTCT ATGGCTTGCC GGTCAACGAC ACATTCATCG AACTGGTACG TGAAGAGCAA
CAGGTTGCTG AAAGCATCGC ACTGACTGAT GACACCCTGG TGCCATTCCT CGCTGGGGAA
ACGGTACGCT GGTCCGTTAA ACAATAA
 
Protein sequence
MTAPSQVLKI RRPDDWHLHL RDGDMLKTVV PYTSEIYGRA IVMPNLAPPV TTVEAAVAYR 
QRILDAVPAG HDFTPLMTCY LTDSLDPNEL ERGFNEGVFT AAKLYPANAT TNSSHGVTSV
DAIMPVLERM EKIGMPLLVH GEVTHADIDI FDREARFIES VMEPLRQRLT ALKVVFEHIT
TKDAADYVRD GNERLAATIT PQHLMFNRNH MLVGGVRPHL YCLPILKRNI HQQALRELVA
SGFNRVFLGT DSAPHARHRK ESSCGCAGCF NAPTALGSYA TVFEEMNALQ HFEAFCSVNG
PQFYGLPVND TFIELVREEQ QVAESIALTD DTLVPFLAGE TVRWSVKQ