Gene GWCH70_1045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1045 
SymbolpyrC 
ID7979186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1095178 
End bp1096461 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content51% 
IMG OID644797998 
Productdihydroorotase 
Protein accessionYP_002949171 
Protein GI239826547 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATTA TTTTGAAAAA TGGCAAGTCG TTCAATAAAG ATGGTGTGAT CGAACGGACG 
GAACTAAAAA TCGAAAATGG ATTTATTACC GCCATCGGCT CCAAGCTTCA CAGTGAAGAA
GCAGACGAAG TTATCGATGT ACAAGGGAAG TTGATATCAG CCGGATTTAT CGATTTGCAT
GTTCACCTGC GCGAACCGGG CGGCGAAGCG AAAGAAACGA TTGCCACCGG AACGCTGGCA
GCAGCAAAAG GTGGTTTTAC CACAGTGGCG GCAATGCCGA ATACGCGACC AGTGCCGGAT
ACGAAAGAAC AAATGGAATG GCTTTGCAAG CGGATCCGCG AAACGGCTTA TGTCCATGTG
CTTCCATATG CGGCCATTAC GGTCGGCCAG CAAGGAACAG AGCTGACCGA CTTCGCCGCA
TTAAAAGAAG CGGGTGCGTT CGCGTTTACC GATGACGGGG TAGGCGTGCA GTCTGCCGGC
ATGATGTATG AAGCGATGAA GCGGGCTGCT GCACTAGATA TGGCGATTGT CGCCCATTGC
GAAGATAACA CTCTGGCGAA TCGCGGTGTG GTGCATGATG GCGAATTTGC GCACCGCTAC
GGGCTATATG GAATTCCATC CGTATGCGAA TCGGTACATA TCGCGCGTGA TGTGCTATTA
GCGGAAGCAA CGGGATGTCA CTACCATGTG TGCCATATTA GCACGAAAGA ATCGGTCCGC
GTTGTCCGCG ATGCAAAACG GGCAGGAATT CGCGTCACCG CGGAAGTGAC GCCGCATCAT
CTTCTTTTAT GCGATGAAGA TATTCCAGGC CCTGACGCGA ATTATAAGAT GAATCCGCCG
CTTCGCAGCA AAGAAGACCG CGAGGCGTTA ATCGAGGGGC TGCTTGATGG CACGATCGAC
TTTATCGCAA CCGACCATGC CCCGCATACG GAAGCGGAAA AACAAAAAGG AATCAATGCC
GCCCCGTTTG GCATTGTCGG TTTGGAAACG GCGTTTCCGC TCCTTTATAC CCACTTGGTC
GAAACAAACA TATTGACACT GAAGCAGCTG ATTGATTTGC TGACGGTGAA GCCGGCTGAA
TGCTTCGGCT TGCCGCTTGG AAAGCTTGCT GTCGGCGAGC GGGCGGATAT TACGATTATA
GATTTAGAGA CCGAAGAAGC AATTGATCCA CAGACGTTTG TATCCAGAGG GAAAAATACT
CCATTTGCCG GTTGGAAATG TAAGGGTTGG CCGGTGATGA CGTTTGTCGG CGGAAAACTA
GTTTGGCAGA AAGGAAGAGA ATAA
 
Protein sequence
MAIILKNGKS FNKDGVIERT ELKIENGFIT AIGSKLHSEE ADEVIDVQGK LISAGFIDLH 
VHLREPGGEA KETIATGTLA AAKGGFTTVA AMPNTRPVPD TKEQMEWLCK RIRETAYVHV
LPYAAITVGQ QGTELTDFAA LKEAGAFAFT DDGVGVQSAG MMYEAMKRAA ALDMAIVAHC
EDNTLANRGV VHDGEFAHRY GLYGIPSVCE SVHIARDVLL AEATGCHYHV CHISTKESVR
VVRDAKRAGI RVTAEVTPHH LLLCDEDIPG PDANYKMNPP LRSKEDREAL IEGLLDGTID
FIATDHAPHT EAEKQKGINA APFGIVGLET AFPLLYTHLV ETNILTLKQL IDLLTVKPAE
CFGLPLGKLA VGERADITII DLETEEAIDP QTFVSRGKNT PFAGWKCKGW PVMTFVGGKL
VWQKGRE