Gene Pnec_1080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnec_1080 
Symbol 
ID6183263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. necessarius STIR1 
KingdomBacteria 
Replicon accessionNC_010531 
Strand
Start bp937003 
End bp938037 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content48% 
IMG OID641671691 
ProductDihydroorotate oxidase 
Protein accessionYP_001797868 
Protein GI171463755 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.106041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value0.333003 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGATC GTTACTCCCT CCTGCGCCCT TGGCTTTTTT GCATAGACCC TGAAAAAGCC 
CACAACCTTA CCCTAAGTAA TTTAGATCGC GCACAGCGTT GGGGATTTTT GGAACGCTTG
ATTACCAAAC CGATTAACGA TCCTCAAGTA TTGTGTGGGA TTGAGTTTTC CAACCCTGTT
GGTCTAGCCG CTGGATTAGA CAAAGATGGC AAGTATATCG ATGCACTGGC TGCATTAGGA
TTTGGATTTT TAGAAATCGG CACCGTTACA CCCCGACCAC AACCTGGCAA TCCCAAGCCA
CGAATGTTTC GACTCCCGGA AGCACAAGCC ATCATTAATC GTATGGGCTT CAATAACGAT
GGTGTTGAGG CCTGCGTAGC AAGAGTACGC TGTTCAAAAT TTTGGCAAAA CGGCGGCGTT
CTTGGGATGA ATATTGGCAA AAATGCCAGC ACACCAATTG AAGAGGCGTC GCGCGATTAC
ATCTTGGCTA TGGAAGCTGT TTACGAAATT GCTACTTACA TTACCATCAA TATCTCTTCC
CCTAATACTC AAAATCTACG CGCACTCCAG GGCGAAGAAA TGCTCCGCGA ATTACTCGGC
AGCTTAGGTG AAGCCAGAAA ACATTTATGC GATCGTCATG GCGTACGAAA ACCACTATTC
CTGAAAATTG CACCAAACTT AGATCAGGGC GATATCAATC TCATTGCCGA CCTCCTACTT
GAGTTTGGCA TCGATGCAGT TATTGCCACC AACACAACTA TCTCCCGCGA TGCAGTCAAG
GGAATGGAAT TTGGCGAAGA AGCTGGCGGC CTATCTGGCG CACCTGTTCG CAATGCCTCG
AATATCGTCA TCAAAGCTTT GAAAGCAAGG CTTGGCAATC AACTACCGAT CATCGGCGTT
GGCGGCATCA TGTCTGGAGT TGATGCACGA GAAAAGATCA TGGCTGGTGC TAGCCTGGTC
CAACTCTATA GCGGCCTGAT CTATCGCGGC CCAGACTTGG TCTACAAGTG CGCTACCGTC
CTAAGGCAAC CCTAA
 
Protein sequence
MIDRYSLLRP WLFCIDPEKA HNLTLSNLDR AQRWGFLERL ITKPINDPQV LCGIEFSNPV 
GLAAGLDKDG KYIDALAALG FGFLEIGTVT PRPQPGNPKP RMFRLPEAQA IINRMGFNND
GVEACVARVR CSKFWQNGGV LGMNIGKNAS TPIEEASRDY ILAMEAVYEI ATYITINISS
PNTQNLRALQ GEEMLRELLG SLGEARKHLC DRHGVRKPLF LKIAPNLDQG DINLIADLLL
EFGIDAVIAT NTTISRDAVK GMEFGEEAGG LSGAPVRNAS NIVIKALKAR LGNQLPIIGV
GGIMSGVDAR EKIMAGASLV QLYSGLIYRG PDLVYKCATV LRQP