Gene SbBS512_E2372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2372 
SymbolpyrD 
ID6268867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2163515 
End bp2164525 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content47% 
IMG OID641726376 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_001880858 
Protein GI187732395 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000000164605 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACTACC CCTTCGTTCG TAAAGCCCTT TTCCAGCTCG ATCCAGAGCG CGCTCATGAG 
TTTACTTTTC AGCAATTACG CCGTATTACA GGAACGCCGT TTGAAGCACT GGTGCGGCAG
AAAGTGCCTG CGAAACCTGT TAACTGCATG GGCCTGACGT TTAAAAATCC GCTTGGTCTG
GCAGCCGGTC TTGATAAAGA CGGGGAGTGC ATTGACGCGT TAGGCGCGAT GGGATTTGGA
TCGATCGAGA TCGGTACCGT CACGCCACGT CCACAGCCAG GTAATGACAA GCCGCGTCTC
TTTCGTCTGG TAGATGCCGA AGGTTTGATC AACCGTATGG GCTTTAATAA TCTTGGCGTT
GATAACCTCG TAGAGAACGT AAAAAAGGCC CATTATGACG GCGTCCTGGG TATTAACATC
GGCAAAAATA AAGATACGCC AGTGGAGCAG GGCAAAGATG ACTATCTGAT TTGTATGGAA
AAAATCTATG CCTATGCGGG ATATATCGCC ATCAATATTT CATCGCCGAA TACCCCAGGA
TTACGCACGC TGCAATATGG TGAAGCGCTG GATGATCTCT TAACCGCGAT TAAAAATAAG
CAAAATGATT TGCAAGCGAT GCACCATAAA TATGTGCCGA TCGCAGTGAA GATCGCGCCG
GATCTTTCTG AAGAAGAATT GATCCAGGTT GCCGATAGTT TAGTTCGCCA TAATATTGAT
GGCGTTATTG CAACCAATAC CACACTCGAT CGTTCTCTTG TTCAGGGAAT GAAAAATTGC
GATCAAACCG GTGGCTTAAG TGGTCGTCCG CTTCAGTTAA AAAGCACCGA AATTATTCGC
CGCTTGTCAC TGGAATTAAA CGGTCGCTTA CCGATCATCG GTGTTGGCGG CATCGACTCG
GTTATCGCTG CGCGTGAAAA GATTGCTGCG GGTGCCTCAC TGGTGCAAAT TTATTCTGGT
TTTATTTTTA AAGGTCCGCC GCTGATTAAA GAAATCGTTA CCCATATCTA A
 
Protein sequence
MYYPFVRKAL FQLDPERAHE FTFQQLRRIT GTPFEALVRQ KVPAKPVNCM GLTFKNPLGL 
AAGLDKDGEC IDALGAMGFG SIEIGTVTPR PQPGNDKPRL FRLVDAEGLI NRMGFNNLGV
DNLVENVKKA HYDGVLGINI GKNKDTPVEQ GKDDYLICME KIYAYAGYIA INISSPNTPG
LRTLQYGEAL DDLLTAIKNK QNDLQAMHHK YVPIAVKIAP DLSEEELIQV ADSLVRHNID
GVIATNTTLD RSLVQGMKNC DQTGGLSGRP LQLKSTEIIR RLSLELNGRL PIIGVGGIDS
VIAAREKIAA GASLVQIYSG FIFKGPPLIK EIVTHI