Gene A9601_02171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_02171 
SymbolpyrD 
ID4716901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp202905 
End bp204074 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content30% 
IMG OID640077916 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_001008612 
Protein GI123967754 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAC AGAAGGGTGG ATTTAAAAAT CTTTATAAAA ACTTGATTAC CCCTGTATTA 
CAAAAAGACT CTGGAATTGA TGCAGAATAC TTAACAAATT TATCTCTTAG TCTCCTATCA
TTCAGTTCAA GAAAATATAA TTGGCCTATA GTATCTTCTA TCTTAAAAAA TCTAAATGAA
GAATTTTCTG TAGTTGATAA AAGGTTAACT CAGAAGATAT GTGGAATAAA TTTTTGTAAT
CCAATTGGTT TAGCTGCGGG TTTTGACAAA AATGGAAATG CCGCAAATAT ATGGAAAGAT
TTTGGTTTTG GATTTGCTGA GCTTGGAACA GTAACTAAAT TTGCTCAGGA TGGAAATCCC
AAACCAAGGT TATTTAGATT GGCAGAAGAA GAAGCAGCAT TAAATAGAAT GGGTTTCAAT
AATAATGGTG CTGAAAATCT AGTTAAAAAC TTTGTCGAAC AAGGTATTGA GTTTAAAAAA
AACAGGGAGA ATATTTGTTT AGGGATAAAT TTCGGGAAGT CAAAAATTAC AGGTTTATCT
CAAGCAAAAG ATGATTATTT AACTTCTCTA AAATTATTAA TTCCATATTG TGATTACGCA
GCAATAAACG TTAGTTCTCC AAATACTGAA GGACTAAGAA AATTACAAGA TCCAATTCTT
CTAAAAGAAC TTCTTAGAGA AATTAAAAAC TTACCTAATT GTCCACCATT ATTTGTAAAA
ATTGCACCAG ATTTAGGCCT TAAAGATATT GAAGATATTT GCAAATTAAT AATCGAGGAG
AACATCGATG GAATAATTGC TACAAATACC AGCATTGATA GATTAGGTCT TGAAAATAGA
AAAATCAAGC AAACTGGATT ATTACTCTCT CAAGAGAATG GAGGATTAAG TGGAAAGCCG
CTCCAAAAAA AAGCAAATCA AATAATAAAA CATATACATA ATATTGATAA AAAGATTATT
TTAATTGGGG TTGGTGGAAT AGATAGTCCT GAGTCAGCTT GGGAAAGAAT TTGTTCTGGA
GCATCATTAA TTCAACTTTA TACAGGATGG ATATATAAGG GTCCACAATT AGTACCAGAT
ATACTTGAGG GAATTATAAA GCAACTCAAT AACCATCAAT TATCTAGTAT AAAAGATGCA
ATTGGATCAG ATTTAAAATG GATTGAATAA
 
Protein sequence
MNEQKGGFKN LYKNLITPVL QKDSGIDAEY LTNLSLSLLS FSSRKYNWPI VSSILKNLNE 
EFSVVDKRLT QKICGINFCN PIGLAAGFDK NGNAANIWKD FGFGFAELGT VTKFAQDGNP
KPRLFRLAEE EAALNRMGFN NNGAENLVKN FVEQGIEFKK NRENICLGIN FGKSKITGLS
QAKDDYLTSL KLLIPYCDYA AINVSSPNTE GLRKLQDPIL LKELLREIKN LPNCPPLFVK
IAPDLGLKDI EDICKLIIEE NIDGIIATNT SIDRLGLENR KIKQTGLLLS QENGGLSGKP
LQKKANQIIK HIHNIDKKII LIGVGGIDSP ESAWERICSG ASLIQLYTGW IYKGPQLVPD
ILEGIIKQLN NHQLSSIKDA IGSDLKWIE