Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2067 |
Symbol | pyrC |
ID | 6147117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2084687 |
End bp | 2085733 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616943 |
Product | dihydroorotase |
Protein accession | YP_001744119 |
Protein GI | 170680153 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0418] Dihydroorotase |
TIGRFAM ID | [TIGR00856] dihydroorotase, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.863228 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.103848 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCAC CATCCCAGGT ATTAAAGATC CGCCGCCCAG ACGACTGGCA CCTTCACCTC CGCGATGGCG ACATGTTAAA AACTGTCGTG CCGTATACCA GCGAAATTTA TGGACGGGCT ATCGTAATGC CCAATCTGGC TCCGCCCGTG ACCACCGTTG AGGCTGCCGT GGCGTATCGC CAGCGCATTC TTGACGCCGT ACCTGCCGGG CACGATTTCA CCCCGCTGAT GACCTGTTAT TTAACAGATT CGCTGGATCC TAATGAGCTG GAGCGCGGAT TTAACGAAGG CGTGTTCACC GCTGCAAAAC TTTATCCGGC AAACGCAACC ACTAACTCCA GCCACGGCGT TACGTCAGTT GACGCAATCA TGCCGGTACT TGAGCGCATG GAAAAAATCG GTATGCCGCT ACTGGTGCAT GGTGAAGTGA CACATGCAGA TATCGACATT TTTGATCGTG AAGCGCGCTT TATAGAAAGC GTGATGGAAC CGCTACGCCA GCGTCTGACT GCGCTGAAAG TCGTTTTTGA GCACATCACC ACCAAAGATG CTGCCGACTA TGTCCGTGAC GGAAATGAAC GGCTGGCTGC CACCATCACT CCGCAGCATC TGATGTTTAA CCGCAACCAT ATGCTGGTTG GTGGCGTGCG TCCGCACCTG TATTGTCTAC CCATCCTCAA ACGCAATATC CACCAACAGG CATTGCGTGA ACTGGTCGCC AGCGGTTTTA ATCGTGTATT CCTCGGAACG GATTCTGCGC CACATGCACG TCATCGCAAA GAGAGCAGCT GCGGCTGCGC GGGCTGTTTC AACGCCCCAA CCGCGCTGGG CAGTTACGCT ACCGTCTTTG AAGAGATGAA TGCTTTGCAG CACTTTGAAG CATTCTGTTC TGTAAACGGC CCGCAGTTCT ATGGCTTGCC GGTCAACGAC ACATTCATCG AACTGGTACG TGAAGAGCAA CAGGTTGCTG AAAGCATCGC ACTGACTGAT GACACCCTGG TGCCATTCCT CGCTGGGGAA ACGGTACGCT GGTCCGTTAA ACAATAA
|
Protein sequence | MTAPSQVLKI RRPDDWHLHL RDGDMLKTVV PYTSEIYGRA IVMPNLAPPV TTVEAAVAYR QRILDAVPAG HDFTPLMTCY LTDSLDPNEL ERGFNEGVFT AAKLYPANAT TNSSHGVTSV DAIMPVLERM EKIGMPLLVH GEVTHADIDI FDREARFIES VMEPLRQRLT ALKVVFEHIT TKDAADYVRD GNERLAATIT PQHLMFNRNH MLVGGVRPHL YCLPILKRNI HQQALRELVA SGFNRVFLGT DSAPHARHRK ESSCGCAGCF NAPTALGSYA TVFEEMNALQ HFEAFCSVNG PQFYGLPVND TFIELVREEQ QVAESIALTD DTLVPFLAGE TVRWSVKQ
|
| |