Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | STER_0969 |
Symbol | |
ID | 4438240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus thermophilus LMD-9 |
Kingdom | Bacteria |
Replicon accession | NC_008532 |
Strand | + |
Start bp | 894550 |
End bp | 895497 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 639676628 |
Product | dihydroorotate dehydrogenase 1B |
Protein accession | YP_820382 |
Protein GI | 116627763 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0916684 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCAG AAAATCGTTT AGCTATTAGT CTTCCAGGGC TCGATTTGAA AAATCCCATT ATCCCAGCTT CTGGTTGCTT TGGTTTTGGT CAAGAGTATT CCAAGTACTA TGATTTGGAC AAACTAGGGT CGATCATGAT TAAGGCAACA ACTGCCAACC CACGTTTTGG GAATCCAACA CCCCGTGTAG CAGAGACACC ATCAGGTATG CTTAATGCCA TTGGACTGCA AAATCCGGGT GTTGATGCTG TCTTGTCTGA AAAACTCCCA TGGTTACAGG AACATTATCC TGAGTTGCCA ATCATTGCGA ATGTGGCTGG ATTTTCTAAC GAAGAATATG CAGAAGTCTC TCACAAGATT TCCAAAGCTA GTAACGTCAA GGCAATCGAG CTTAACATCT CCTGTCCAAA CGTGGACCAT GGCAATAACG GTCTTCTCAT AGGACAAGTA CCAGAACTTG CCTATGCAGC CGTAAAAGCC AGTGTTTCTC ACTCTGATGT GCCCGTCTAT GTCAAACTGA CACCAAGCGT GGCTGACATT ACAAGTGTTG CCAAGGCAGT CGAAGATGCC GGTGCGACAG GTTTCACTAT GATTAACACC TTGGTTGGTA CACGCTATGA TTTGGCGACT CGCAAACCAA TCATTGCCAA TGGTCAAGGT GGTATGTCAG GACCAGCTGT CTTCCCAGTA GCCCTTAAAC TCATCCGCCA AGTTGCTCTA GCGTCAGACC TCCCAATCAT CGGTATGGGT GGCGTTGACA GTGCCGAAGC GGCTATCGAA ATGTTCATCG CTGGTGCCTC AGCCATCGGT GTCGGAACAG CAAACTTCGC AGATCCCTAT GCCTGCCCTA AAATCATTGA TCGTCTCCCT GAAGTCATGG ACAAGTATGG CATCACAACA CTAGAAGACT TACGTAAGGA GGTTCGAACA GACTTGTTGG GGAAATAA
|
Protein sequence | MKSENRLAIS LPGLDLKNPI IPASGCFGFG QEYSKYYDLD KLGSIMIKAT TANPRFGNPT PRVAETPSGM LNAIGLQNPG VDAVLSEKLP WLQEHYPELP IIANVAGFSN EEYAEVSHKI SKASNVKAIE LNISCPNVDH GNNGLLIGQV PELAYAAVKA SVSHSDVPVY VKLTPSVADI TSVAKAVEDA GATGFTMINT LVGTRYDLAT RKPIIANGQG GMSGPAVFPV ALKLIRQVAL ASDLPIIGMG GVDSAEAAIE MFIAGASAIG VGTANFADPY ACPKIIDRLP EVMDKYGITT LEDLRKEVRT DLLGK
|
| |