Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2085 |
Symbol | |
ID | 7408794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2207942 |
End bp | 2209087 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643716452 |
Product | dihydroorotate dehydrogenase family protein |
Protein accession | YP_002573935 |
Protein GI | 222530053 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00414883 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAACC TATCAACGAC GTATGCAAAG CTGAATTTAA GAACACCTGT AATTGTTGCA TCTGCTGGCA TTACTGGAAC TGTGGAGAGG CTTCAAAGAT GCGAAGAAAA CGGTGCTGGG GCAGTTGTGA CAAAAAGTCT TTTTCAAAAG GAAATATGCA GAATTGCACC CACTCCACGG TTTAAAATAG TCAAGCATGA AAACACGTTT ACGCTTTACT CATATGAACA GGCAAGCGAA TTTAACCCTC AAGAGTATGC TGAATTTATA TTCAAAGCAA AACAAAAGCT AAGCATTCCA GTTATTGCGA GTATAAACTG CTACACAGAT GATGCATGGC TTGAGTATAG CAAGCTTATG GAGCAGGCAG GGGCTGATGC GATAGAGCTA AACCTTTCAT GTCCTCACGG TGTGCATATA ATGTCTGGTA TGGATGTAAT TGAAGAGATG GTCAACACAA CAAAACTTGT CAAAAGCAAT GTTAAGATAC CAGTGATACC CAAAATGACT CCTCAATCTA CAAATCCGGG ATCTGATGCC TTAAGACTCG ACAGTGCAGG AGCAGACGGG CTTGTAATGT TCAATAGATT TACAGGGCTT GACATTGATA TTGAGAAAGA AGCACCCATT TTGCACGGCG GTTATGCAGG GCATGGTGGT CCGTGGGCAA TTATGTATGG TTTGAGGTGG ATAAGCGCTG TATCGCCAAA AGTGAAATGT AGTATCAGTG CAAGCGGCGG TGCCATGAAT GGAGAAGATG TTGTCAAATA CATATTGGCA GGTGCGTCGG CTGTTCAAGT TTGCACAACT GTTATTTTGA ATGGCTATGG GGTTATAAAA AAGATAAACA AGTATTTAGA AGAGTACATG GAGAGAAAAG GTTACAACAC AATTGATGAT TTTAAAGGAA AGGTGTGCAG TAGAATTCTT GACATGGACT CTGTTGACAG AACGCACTGG GCTGTTGCAA GGATTGACAA AGAAAAATGC ACATCTTGTG GCAAGTGCTT CACAGTTTGC ATATATGATG CAATTGAAAA GGATGATGGA AAGTTTAAAG TAAATCAAAA CTGCGATGGC TGCGGACTTT GTGCAGAACT GTGCCCAGCC AAGGCAATCT TAATGGTAAG AAGAGGTGAA GTTTAA
|
Protein sequence | MPNLSTTYAK LNLRTPVIVA SAGITGTVER LQRCEENGAG AVVTKSLFQK EICRIAPTPR FKIVKHENTF TLYSYEQASE FNPQEYAEFI FKAKQKLSIP VIASINCYTD DAWLEYSKLM EQAGADAIEL NLSCPHGVHI MSGMDVIEEM VNTTKLVKSN VKIPVIPKMT PQSTNPGSDA LRLDSAGADG LVMFNRFTGL DIDIEKEAPI LHGGYAGHGG PWAIMYGLRW ISAVSPKVKC SISASGGAMN GEDVVKYILA GASAVQVCTT VILNGYGVIK KINKYLEEYM ERKGYNTIDD FKGKVCSRIL DMDSVDRTHW AVARIDKEKC TSCGKCFTVC IYDAIEKDDG KFKVNQNCDG CGLCAELCPA KAILMVRRGE V
|
| |