Gene STER_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSTER_1040 
SymbolpyrC 
ID4438611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus thermophilus LMD-9 
KingdomBacteria 
Replicon accessionNC_008532 
Strand
Start bp965169 
End bp966437 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content42% 
IMG OID639676687 
Productdihydroorotase 
Protein accessionYP_820441 
Protein GI116627822 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACTGA TTAAAAATGG TCGTGTTGTT GACCCTAAAT CTGGTTTGGA CATGCAAGCC 
GATGTTCTTG TGGACGGAAA AAAAGTCGTT AAAATTGCTG AAAATATCGA TGCGGGAGAT
GCCCAAGTTA TCGATGCGAC TGGTCTTGTG GTTGCTCCTG GTTTGGTGGA TATCCATGTT
CACTTCCGTG AGCCAGGTCA AACCCATAAG GAAGACATTC ATACGGGTGC CTTGGCAGCG
GCTGCAGGTG GTTTTACAAC AGTTGTGATG ATGGCTAATA CGAATCCAAC GATTTCAGAC
AAGGAAACTT TGAAAGAGGT CTTGACTTCA GCAGCTAAGG AAAATATCCA TGTTAAGTCT
GTTGCGACTA TTACAAAGAA CTTTGATGGT GAAAATATTA CTGATTTCAA GGGTTTGCTT
GAAGCAGGTG CTGTTGGATT CTCAGATGAC GGTATTCCAT TGACCAATGC TGGGATTGTC
AAAAAAGCCA TGGAGTTAGC TAAAGAGAAT AATACCTTTA TCAGTCTTCA CGAGGAGGAT
CCTGATCTTA ATGGTGTTCT CGGTTTCAAT GAAAATATTG CTAAAAAAGA ATTTCATATT
TGTGGGGCAA CTGGCGTAGC TGAGTACAGC ATGATTGCGC GTGATGTCAT GGTTGCTTAT
GATACACAAG CACATGTTCA TATTCAACAC TTGTCAAAAG CTGAATCTGT AAAAGTCGTT
GAGTTTGCTC AAAAACTTGG AGCACAAGTC ACTGCTGAAG TAGCGCCGCA GCACTTCTCA
AAAACTGAAG ACCTCTTACT CTCAAAAGGC GCTAATGCCA AGATGAACCC ACCACTTCGT
TTGGAATCAG ACCGTCAAGC CGTTATCGAA GGTTTGAAAT CTGGAGTAAT CTCAGTCATT
GCTACGGACC ACGCGCCACA CCACGCAGAT GAAAAGAATG TGGCTGATGT GACTAAAGCA
CCATCAGGGA TGACTGGTCT GGAAACCTCT CTATCTCTTG GTTTAACTTA TTTAGTTGAA
GCAGGACATT TAAGTTTGAC AGAATTATTG AAATTAATGA CAAGCAACCC ATCTGATCTT
TATGGTTTCG ATGCCGGTTA TTTGGCTGAA AATGGACCAG CAGACCTTGT TATCTTTGCA
GATAAGGAAA AACGTCAGGT TACAGCAGAC TTTAAGTCTA AAGCAGCCAA TTCACCATTT
GTAGGCGAAG AGCTTACTGG TAGTGTTAAA TACACGATCT GTGATGGTGA GATTGTTTAT
CAAGTCTAG
 
Protein sequence
MLLIKNGRVV DPKSGLDMQA DVLVDGKKVV KIAENIDAGD AQVIDATGLV VAPGLVDIHV 
HFREPGQTHK EDIHTGALAA AAGGFTTVVM MANTNPTISD KETLKEVLTS AAKENIHVKS
VATITKNFDG ENITDFKGLL EAGAVGFSDD GIPLTNAGIV KKAMELAKEN NTFISLHEED
PDLNGVLGFN ENIAKKEFHI CGATGVAEYS MIARDVMVAY DTQAHVHIQH LSKAESVKVV
EFAQKLGAQV TAEVAPQHFS KTEDLLLSKG ANAKMNPPLR LESDRQAVIE GLKSGVISVI
ATDHAPHHAD EKNVADVTKA PSGMTGLETS LSLGLTYLVE AGHLSLTELL KLMTSNPSDL
YGFDAGYLAE NGPADLVIFA DKEKRQVTAD FKSKAANSPF VGEELTGSVK YTICDGEIVY
QV