Gene SAG1045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1045 
SymbolpyrC 
ID1013849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1054737 
End bp1056029 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content38% 
IMG OID637316228 
Productdihydroorotase 
Protein accessionNP_688055 
Protein GI22537204 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.197397 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATATCA TTAAAAACGG ATTGATTATC GACCCTCAAA GTGGTTTTAA TCAAGTATCA 
GACATGCTGA TTGATCAAGG AAAAATTAAA CAAATATCGA AAGAAATTGA TATTAAAGGC
ATTCCAATCA TTGATGCAAG CAATAAAATT GTAGCTCCAG GATTAGTCGA TATTCATGTA
CATTTTCGTG AGCCTGGACA GACACATAAA GAAAATATCC ATACAGGGGC CCTATCAGCT
GCAGTAGGTG GTTTTACGAC AGTTCTTATG ATGGCCAATA CTAACCCAAC GATTTCTAGC
CCAGAAATTG TAAAACAAGT TAAGGAGAGT GCAGCAAAAG AAGCAATTAA GATTGAAACA
GTTGCTACAA TTACTAAATC TCTGAATGGT AAAGATTTAG TTAATTTCGA AGAACTATTA
GAGGCTGGAG TTGCTGGATT CTCAGACGAT GGTATTCCTT TGACAGATAC TAAGGTTTTG
CAAGAAGCTA TGAACTTAGC TAGAAAACAT GATGTTGTCC TATCGTTACA TGAAGAAGAT
CCATCGTTAA ATGGTGTCCT TGGTATCAAC GAACATATTG CTCAGAAAAT TTATCATGTC
TGTGGAGCTA GTGGTCTTGC TGAATATTCA ATGATTGCGC GAGATGCTAT GATTGCTTAC
CAAACACAGG CGAAAGTCCA TATCCAGCAT TTGTCAAGTT CAGAATCAGT AGAAGTAGTT
GATTTTGCGC AGAAGCTTGG TGCTAACTTA ACAGCAGAAG TAACCCCCCA ACATTTTTCA
AAGACAGAGA ACTTACTTTT AACTAAGGGT GCTAATGCAA AATTAAACCC ACCACTGCGA
CTTGAAAAGG ATAGACAAGC CCTCATTGAT GGACTTAAAA GTGGTGTTAT TTCAATTATT
GCAAGTGACC ACGCTCCCCA TCATATTATG GAGAAAGCAG CAGATAATAT TAGTCAAGCG
CCCTCTGGTA TGACTGGCTT AGAGACATCA TTAGCCTTAG GGATTACTTA TTTAGTATCT
ACTAAAGAAC TCTCAATGAT TGACTTTTTG GCAAAAATGA CATGTAATCC TGCACAGTTA
TATGGATTTG ATGCAGGTTA TCTCAGAGAA GGTGGACCTG CAGATATCGT TATTTTTGAT
CAAGCTGAAG AGAGAATAAT AAAGGCAGAA TTTGCCTCCA AATCCAGCAA CTCTCCTTTT
ATCGGTGATA AATTGAAGGG AGTTATTCAC TACACAATTT GCAATGGTGA AATAGTCTAT
CAAAAAGATA GCCACGCAGC GACATTGGTA TAG
 
Protein sequence
MYIIKNGLII DPQSGFNQVS DMLIDQGKIK QISKEIDIKG IPIIDASNKI VAPGLVDIHV 
HFREPGQTHK ENIHTGALSA AVGGFTTVLM MANTNPTISS PEIVKQVKES AAKEAIKIET
VATITKSLNG KDLVNFEELL EAGVAGFSDD GIPLTDTKVL QEAMNLARKH DVVLSLHEED
PSLNGVLGIN EHIAQKIYHV CGASGLAEYS MIARDAMIAY QTQAKVHIQH LSSSESVEVV
DFAQKLGANL TAEVTPQHFS KTENLLLTKG ANAKLNPPLR LEKDRQALID GLKSGVISII
ASDHAPHHIM EKAADNISQA PSGMTGLETS LALGITYLVS TKELSMIDFL AKMTCNPAQL
YGFDAGYLRE GGPADIVIFD QAEERIIKAE FASKSSNSPF IGDKLKGVIH YTICNGEIVY
QKDSHAATLV