Gene NATL1_02751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_02751 
SymbolpyrD 
ID4779941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp255182 
End bp256339 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content36% 
IMG OID640083540 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_001014104 
Protein GI124024988 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCAACAA CTAAAAAAAA CGACTTATAC AACATTCTCC TCGGTCAGCT TCTATCTCAA 
GACGAGGGTA TTGATGCAGA AATACTTACT AATTCAGCCC TTAATGCAAT TAAATTCGCA
TCATTAAATA GAAACTTTCC ATTAATTTCT AATATCCTTT TAAAAGCATC AAATGATTTT
CAAAGAAACA ATTCAAGCTT AAATCAAATT GTTTTTGGCT CTCACTTTAA AAATCCTGTG
GGACTTGCTG CTGGATTTGA CAAGAATGGC GTAGGAGCAG GTCTTTGGAA TTATTTTGGA
TTTGGTTTCG CCGAATTGGG AACTATTACT TGGCATGCCC AAGAAGGCAA TCCAAAGCCT
AGACTTTTCA GAATTGCAAA AGAGAAAGCT GCGCTGAATC GAATGGGATT CAATAACCAA
GGAGCAGAAA ATTTTTTGAA AACAATCGAA AAACAGAAAA TCCTTGCACC AGGGAATAGA
CCTTGTGTCC TAGGAATAAA TTTAGGCAAG TCAAAAATCA CTCCACTCGA TGAAGCCCAT
ATAGACTATT CTTTATCTCT AAAACTACTG GCTCCTTTAT CAGACTATGC AGTAATTAAT
GTTAGTTCAC CTAATACCCC AGGCCTTCGT TCATTACAAG GAACAAAACA AATAAAAAAA
TTAATAATCA CGCTTAAAGA TTTACCCAAT TGTCCTCCTT TGCTTGTAAA AATTGCCCCA
GATCTTTCCA ATGAAGCAAT TGATGAAATT GCAAGAGTTG CGATGGAAAA TGGCATCGAT
GGAATTATTG CAATCAATAC AAGCTTAGAT AGATTTGATT TAAAAAATCT GAAAATCAAA
ACTGGAAATA CTCTAGGACA AGAAAATGGA GGATTAAGTG GTCTACCCTT ACAAAAAAGA
GGACTAGAAG TTATTCGGAG ACTAAGAAGA AGTACTGATA ATGATTTACC TCTGATTGGT
GTGGGTGGAA TTCATTCAGC AAGAGCGGCA TGGGAAAGAA TTACAGCTGG TGCCTCACTG
GTTCAGATTT ATACTGGGTG GATATTTGAG GGACCAAATT TAGTTCCAGA CATACTAGAT
GGATTAATCC AGCAAATGGA AAAACATGGA TTCCGAAATA TTAAAGAGGC CATAGGTTCC
GAAGAACCAT GGAAGTAA
 
Protein sequence
MPTTKKNDLY NILLGQLLSQ DEGIDAEILT NSALNAIKFA SLNRNFPLIS NILLKASNDF 
QRNNSSLNQI VFGSHFKNPV GLAAGFDKNG VGAGLWNYFG FGFAELGTIT WHAQEGNPKP
RLFRIAKEKA ALNRMGFNNQ GAENFLKTIE KQKILAPGNR PCVLGINLGK SKITPLDEAH
IDYSLSLKLL APLSDYAVIN VSSPNTPGLR SLQGTKQIKK LIITLKDLPN CPPLLVKIAP
DLSNEAIDEI ARVAMENGID GIIAINTSLD RFDLKNLKIK TGNTLGQENG GLSGLPLQKR
GLEVIRRLRR STDNDLPLIG VGGIHSARAA WERITAGASL VQIYTGWIFE GPNLVPDILD
GLIQQMEKHG FRNIKEAIGS EEPWK