Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | STER_1040 |
Symbol | pyrC |
ID | 4438611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus thermophilus LMD-9 |
Kingdom | Bacteria |
Replicon accession | NC_008532 |
Strand | - |
Start bp | 965169 |
End bp | 966437 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 639676687 |
Product | dihydroorotase |
Protein accession | YP_820441 |
Protein GI | 116627822 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTACTGA TTAAAAATGG TCGTGTTGTT GACCCTAAAT CTGGTTTGGA CATGCAAGCC GATGTTCTTG TGGACGGAAA AAAAGTCGTT AAAATTGCTG AAAATATCGA TGCGGGAGAT GCCCAAGTTA TCGATGCGAC TGGTCTTGTG GTTGCTCCTG GTTTGGTGGA TATCCATGTT CACTTCCGTG AGCCAGGTCA AACCCATAAG GAAGACATTC ATACGGGTGC CTTGGCAGCG GCTGCAGGTG GTTTTACAAC AGTTGTGATG ATGGCTAATA CGAATCCAAC GATTTCAGAC AAGGAAACTT TGAAAGAGGT CTTGACTTCA GCAGCTAAGG AAAATATCCA TGTTAAGTCT GTTGCGACTA TTACAAAGAA CTTTGATGGT GAAAATATTA CTGATTTCAA GGGTTTGCTT GAAGCAGGTG CTGTTGGATT CTCAGATGAC GGTATTCCAT TGACCAATGC TGGGATTGTC AAAAAAGCCA TGGAGTTAGC TAAAGAGAAT AATACCTTTA TCAGTCTTCA CGAGGAGGAT CCTGATCTTA ATGGTGTTCT CGGTTTCAAT GAAAATATTG CTAAAAAAGA ATTTCATATT TGTGGGGCAA CTGGCGTAGC TGAGTACAGC ATGATTGCGC GTGATGTCAT GGTTGCTTAT GATACACAAG CACATGTTCA TATTCAACAC TTGTCAAAAG CTGAATCTGT AAAAGTCGTT GAGTTTGCTC AAAAACTTGG AGCACAAGTC ACTGCTGAAG TAGCGCCGCA GCACTTCTCA AAAACTGAAG ACCTCTTACT CTCAAAAGGC GCTAATGCCA AGATGAACCC ACCACTTCGT TTGGAATCAG ACCGTCAAGC CGTTATCGAA GGTTTGAAAT CTGGAGTAAT CTCAGTCATT GCTACGGACC ACGCGCCACA CCACGCAGAT GAAAAGAATG TGGCTGATGT GACTAAAGCA CCATCAGGGA TGACTGGTCT GGAAACCTCT CTATCTCTTG GTTTAACTTA TTTAGTTGAA GCAGGACATT TAAGTTTGAC AGAATTATTG AAATTAATGA CAAGCAACCC ATCTGATCTT TATGGTTTCG ATGCCGGTTA TTTGGCTGAA AATGGACCAG CAGACCTTGT TATCTTTGCA GATAAGGAAA AACGTCAGGT TACAGCAGAC TTTAAGTCTA AAGCAGCCAA TTCACCATTT GTAGGCGAAG AGCTTACTGG TAGTGTTAAA TACACGATCT GTGATGGTGA GATTGTTTAT CAAGTCTAG
|
Protein sequence | MLLIKNGRVV DPKSGLDMQA DVLVDGKKVV KIAENIDAGD AQVIDATGLV VAPGLVDIHV HFREPGQTHK EDIHTGALAA AAGGFTTVVM MANTNPTISD KETLKEVLTS AAKENIHVKS VATITKNFDG ENITDFKGLL EAGAVGFSDD GIPLTNAGIV KKAMELAKEN NTFISLHEED PDLNGVLGFN ENIAKKEFHI CGATGVAEYS MIARDVMVAY DTQAHVHIQH LSKAESVKVV EFAQKLGAQV TAEVAPQHFS KTEDLLLSKG ANAKMNPPLR LESDRQAVIE GLKSGVISVI ATDHAPHHAD EKNVADVTKA PSGMTGLETS LSLGLTYLVE AGHLSLTELL KLMTSNPSDL YGFDAGYLAE NGPADLVIFA DKEKRQVTAD FKSKAANSPF VGEELTGSVK YTICDGEIVY QV
|
| |