Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_1817 |
Symbol | pyrC |
ID | 5877932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | - |
Start bp | 1827095 |
End bp | 1828390 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641542170 |
Product | dihydroorotase |
Protein accession | YP_001663435 |
Protein GI | 167040450 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0249027 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGATGA TTATAAAAAA CGGTACGGTA ATAGATGGAT TTGGAAGTGA AGCCACAGCG GATATATTGA TAGACTATGG TATCATAAAA GCGATTGATA AAAATATACA AGTCTCAGAC GGGATAGTGA TTGATGCGAC AGGAAAATAT GTACTTCCGG GCTTTGTTGA TATGCACACC CATTTAAGGC AGCCGGGATT TGAAGAGAAA GAGACTATTA AAACAGGGAC AGAGGCGGCA GCAACAGGAG GGTATACAAC TGTTGCCTGC ATGCCAAATA CAAATCCTCC TATAGACAAT GAAATAGTAG TAGAATATGT AAAAAGCATT GCACAAAGAG AAGGAGTTGT AAAAGTACTG CCTATAGGAG CCATGACAAA GGGAATGAAA GGCGAAGAGA TAACCGAAAT GGCAAAACTT AAAAAAGCAG GGGTTGTTGC CTTATCTGAT GATGGTTTTC CAATAATGAG CGCAGGGCTT ATGAAGAGGA TAATGACATA CGGAAAAATG TATGACCTTC TTATGATAAC TCACTGTGAA GACAAAGCCT TAAGTGGAGA AGGTGTAATG AATAGCGGAG TAATTTCAAC AATGATAGGA TTAAAAGGCA TACCAAGAGA AGCAGAAGAA GTCATGCTTG CAAGAAATAT TATACTTGCA AAGTCAACTG GTGTAAGGCT TCACATCGCA CATATCTCAA CAAAGGGAAG TGTCGAACTT ATAAGAGAAG CGAAAGAAAA GGGAGTGAAA ATAACTGCTG AAGTGACTCC TCACAATCTT ACCTTGACAG ATGAAGCAGT TTACAATTAC GATACAAACA CAAAAGCTTA TCCACCCTTA AGGACAAGAG AGGATATAGA AGCATTAATA GAAGGATTGA AAGATGGCAC AATAGATGCA ATCGCGACAG ACCATGCCCC TCACACTAAG GATGACAAAA AAGTACCTTA CGACATGGCT GCTTTTGGAA TATCGGGCTT AGAGACAGCC TTCTCTGTGA TAAATACTTT CCTTGTACAG ACAGGTAAAA TAACCATAAA AGAGTTAGTC AATTACATGA GTATAAATCC TGCGAAAATA TTAGGTATAT CCAGTGGGAT AAAGGTAGGG TCGATAGCGG ATATTGTAAT TGTAGACCCT TATGAAGAGT ATGTAGTTGA CAAAGATAAA TTTAAATCAA AAGGGAAAAA CACACCTTTC CATGGTATGA GGCTAAAAGG AGTTGTGGAT TGTACGATAG TGGAAGGAGA AATAAAATAC AAAAAGGATA GAAAAACAGA AAAAGTTGAG GTATAA
|
Protein sequence | MRMIIKNGTV IDGFGSEATA DILIDYGIIK AIDKNIQVSD GIVIDATGKY VLPGFVDMHT HLRQPGFEEK ETIKTGTEAA ATGGYTTVAC MPNTNPPIDN EIVVEYVKSI AQREGVVKVL PIGAMTKGMK GEEITEMAKL KKAGVVALSD DGFPIMSAGL MKRIMTYGKM YDLLMITHCE DKALSGEGVM NSGVISTMIG LKGIPREAEE VMLARNIILA KSTGVRLHIA HISTKGSVEL IREAKEKGVK ITAEVTPHNL TLTDEAVYNY DTNTKAYPPL RTREDIEALI EGLKDGTIDA IATDHAPHTK DDKKVPYDMA AFGISGLETA FSVINTFLVQ TGKITIKELV NYMSINPAKI LGISSGIKVG SIADIVIVDP YEEYVVDKDK FKSKGKNTPF HGMRLKGVVD CTIVEGEIKY KKDRKTEKVE V
|
| |