Gene SbBS512_E2265 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2265 
SymbolpyrC 
ID6271911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2060577 
End bp2061623 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content52% 
IMG OID641726282 
Productdihydroorotase 
Protein accessionYP_001880766 
Protein GI187730925 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0418] Dihydroorotase 
TIGRFAM ID[TIGR00856] dihydroorotase, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCAC CATCCCAGGT ATTAAAGATC CGCCGCCCAG ACGACTGGCA CCTTCACCTC 
CGCGATGGCG ACATGTTAAA AACTGTCGTG CCGTATACCA GCGAAATTTA TGGACGGGCT
ATTGTAATGC CCAATCTGGC TCCGCCCGTG ACCACTGTTG AGGCTGCCGT GGCGTATCGC
CAGCGTATTC TTGATGCCGT ACCTGCCGGG CACAATTTCA CCCCATTGAT GACCTGTTAT
TTAACAGATT CGCTGGATCC TAATGAGCTG GAGCGCGGAT TTAACGAAGG CGTGTTCACC
GCTGCAAAAC TTTATCCAGC AAACGCAACC ACTAACTCCA GCCACGGCGT GACGTCAATT
GACGCAATCA TGCCGGTACT TGAGCGCATG GAAAAAATCG GTATGCCGCT ACTGGTGCAT
GGTGAAGTGA CACATGCAGA TATCGACATT TTTGATCGTG AAGCGCACTT TATAGAAAGC
GTGATGGAAC CTCTGCGCCA GCGCCTGACT GCGCTGAAAG TCGTTTTTGA GCACATCACC
ACCAAAGATG CTGCCGACTA TGTCCATGAC GGAAATGAAC GGCTGGCTGC CACCATCACT
CCGCAGCATC TGATGTTTAA CCGCAACCAT ATGCTGGTTG GTGGCGTGCG TCCGCACCTG
TATTGTTTAC CCATCCTCAA ACGCAATATT CACCAACAGG CATTGCGTGA ACTGGTCGCC
AGCGGTTTTA ATCGAGTATT CCTCGGTACG GATTCTGCGC CACATGCACG TCATCGCAAA
GAGAGCAGTT GTGGCTGCGC GGGCTGCTTC AACGCCCCAA CCGCGCTGGG CAGTTACGCT
ACCGTCTTTG AAGAGATGAA TGCTCTGCAG CACTTTGAAG CATTCTGTTC TGTAAACGGC
CCGCAGTTCT ATGGCTTGCC GGTCAACGAT ACATTCATCG AACTGGTACG TGAAGAGCAA
CAGGTTGCTG AAAGCATCGC ACTGACTGAT GACACCCTGG TGCCATTCCT CGCCGGGGAA
ACGGTACGCT GGTCCGTTAA ACAATAA
 
Protein sequence
MTAPSQVLKI RRPDDWHLHL RDGDMLKTVV PYTSEIYGRA IVMPNLAPPV TTVEAAVAYR 
QRILDAVPAG HNFTPLMTCY LTDSLDPNEL ERGFNEGVFT AAKLYPANAT TNSSHGVTSI
DAIMPVLERM EKIGMPLLVH GEVTHADIDI FDREAHFIES VMEPLRQRLT ALKVVFEHIT
TKDAADYVHD GNERLAATIT PQHLMFNRNH MLVGGVRPHL YCLPILKRNI HQQALRELVA
SGFNRVFLGT DSAPHARHRK ESSCGCAGCF NAPTALGSYA TVFEEMNALQ HFEAFCSVNG
PQFYGLPVND TFIELVREEQ QVAESIALTD DTLVPFLAGE TVRWSVKQ