Gene Msed_1961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1961 
SymbolpyrC 
ID5103348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1900141 
End bp1901292 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content51% 
IMG OID640507849 
Productdihydroorotase 
Protein accessionYP_001192025 
Protein GI146304709 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.608866 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTGGATTA AGGGAAAGCT TTTCCTAGGA GAGATAGTGG AGGGTTGCGT GGGATTTGAT 
AGGAGGATAA GGGAGACCAG AAGGGAATGT AAGCCTGACC TGAAGCTACC TGAGGACTCC
CTCATATTTC CTGCTGGAGT TGACATGCAC GTTCACCTTA GGGGACTTCA GCTTTCTTAC
AAGGAAACCG TGGCCTCTGC CACCTCTGAG GCAGTGTACG GTGGCATAGG GGTTGTGGTA
GATATGCCCA ACACCTCTCC GGTGATCAAC AGCGAGGAAA CTATCAAGCT AAGGCTCGCT
GAACTGGCGA ATCATTCGAG GTGCGATTAC GGTATTTACT CGGGGGTAAC CAAGGAGAAC
GTGGATAACA TGCCCATAGC AGGGTACAAG GTATTCCCTG AGGATCTCGA GAGGGAAGAG
GTGGAAAGGG TATTCTCTTC ACCCAAGCTC AAGATTCTTC ATCCCGAAAT CCCAATGTCA
CTACGTCCAG GTAGGGGTAA CAGGCGTCTT TGGCAAGAGA TAGCCTCCCT CTTCCTCATC
AGGGGAAAGT TTCACATAAC ACATGTGAGT AACCTTGAGA CCCTGAGGAT TGCAAGGAGC
TTGGGCTACA CAACAGACCT AACCATGCAT CACCTTCTCG TTGACGGGGA GAGGAATTGC
CTTTCGAAGG TTAACCCACC CATCAGGGAT ATCACAGAGA GAAGGAAATT GCTCTCAGCC
CTATTCGAAG CAGATGCAGT CGCAAGCGAT CATGCTCCAC ACTCGAGCTG GGAAAAGGGT
TTACCATTTG AGGTATGTCC GCCTGGTATC CCGGCAATGT CCTTCACTCT CCCCTTCATT
TACACCCTCG CGTTTAGGGG AGTGCTTCCC ATCTCAAGGG CCGTGGAGTT AACGGCAACT
GGGCCAAGCA AAATCTTGGG GATCAAGGCC GGTGAGATAA GGGAGGGTTA CCTGGCCAAC
TTCGTCATCC TTAGAAAGGA TAGGTGGAGA TACTCCACCA GGTATAGTAA GGCCATACAC
ACTCCGCTGG ACGGTTTCGC CCTAGACGCA ACCGTGTATG GAACAATTGT AGAAGGAAAG
GTAGCTTATC TAGAGGGACA TTCCTATCCT GTGAGGGGAT CCAATGTATT CGACGAGACT
GGCAGGAGTT GA
 
Protein sequence
MWIKGKLFLG EIVEGCVGFD RRIRETRREC KPDLKLPEDS LIFPAGVDMH VHLRGLQLSY 
KETVASATSE AVYGGIGVVV DMPNTSPVIN SEETIKLRLA ELANHSRCDY GIYSGVTKEN
VDNMPIAGYK VFPEDLEREE VERVFSSPKL KILHPEIPMS LRPGRGNRRL WQEIASLFLI
RGKFHITHVS NLETLRIARS LGYTTDLTMH HLLVDGERNC LSKVNPPIRD ITERRKLLSA
LFEADAVASD HAPHSSWEKG LPFEVCPPGI PAMSFTLPFI YTLAFRGVLP ISRAVELTAT
GPSKILGIKA GEIREGYLAN FVILRKDRWR YSTRYSKAIH TPLDGFALDA TVYGTIVEGK
VAYLEGHSYP VRGSNVFDET GRS