Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_2097 |
Symbol | |
ID | 4058194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008025 |
Strand | + |
Start bp | 2207310 |
End bp | 2208440 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641231136 |
Product | hypothetical protein |
Protein accession | YP_605560 |
Protein GI | 94986196 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0332172 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTGGC TGCGCGACCA GGCTCTGGCC CCCATCGTGG AGAAAGTGGA AGCGGGCGAG CGCCTGAGCT TTGACGAGGG GATGCGGCTC TACCACACCC GCGACCTGAA CGCGCTGATG CGCCTGGCGA ACCAGACCAA GATGCGGCTG CACGGAGACA AGGTGTACTT CGTCCACTCC ATGCGGCTTG AATTCACCAA TATCTGCTAC GTGGGCTGCA CCTTCTGCGC CTTTGCCGCG CGCAAGGGCG AGGACCGCGC CTGGGACTAT TCCCCAGAAG AAGTGGTCGA GCAGGTGCGG CGCCGTTACC TTCCCGGCAT CACCGAACTC CACATGAGTA GTGGGCACCA TCCCAACCAC CCGTGGGCCT ATTACCCCGA GATGGTGCGG CGGCTGCGGG CAGCTTTCCC CGACCTGCAG GTCAAGGCCT TCACGGCGGC GGAGATCGAA CACCTGTCGC GAATCAGCAA GATGCCTACC CTGGATGTCC TGCGTGAGCT TCAGGCGGCG GGCCTCGCGG CAATGCCAGG TGGCGGGGCG GAAATCTTCG CGGACCGGGT GCGGCGCCAG GTGGCGAAGA ATAAGGTGAA GGCGGAAAAG TGGCTCCAGA TTCACCGTGA GGCGCATTCG CTGGGCATGC GGACAAACGC CACCATGCTC TACGGCCATA TCGAGACGCT GGAAGAACGG CTGGACCACC TGCACCGGCT GCGGGAGTTG CAAGACGAGA CGGGCGGCTT TCACGCCTTC ATCCCCCTTG CCTTTCAGCC GCTCGGGAAC ACGCTCGCGC AGAACCTCGG CAAGACCGAG TTCACGACCG GGCTCGACGA TCTGCGAAAT CTCGCAGTGG CCCGCGTGTA CCTCGACAAC TTCCCGCACA TCAAGGGCTA CTGGGTGATG ATCGGCTCGG AGCTGACGCA GGTCAGCCTG GACTGGGGCG TCAGCGATAT CGACGGCACC ATTCAGGAAG AACACATTGC GCACGCCGCA GGGGCGACCT CCCCGATGGC GCTCTCGCAG GCGGGCATGG TGAGGATGAT CCAGCAAGCC GGGCGCGTGC CGGTGCTGCG CGACGCCTAC TATCACGAGC TGGAGGTCTT CCCCCGGCTG GGCGCGGAGG CGGCGGACTA G
|
Protein sequence | MKWLRDQALA PIVEKVEAGE RLSFDEGMRL YHTRDLNALM RLANQTKMRL HGDKVYFVHS MRLEFTNICY VGCTFCAFAA RKGEDRAWDY SPEEVVEQVR RRYLPGITEL HMSSGHHPNH PWAYYPEMVR RLRAAFPDLQ VKAFTAAEIE HLSRISKMPT LDVLRELQAA GLAAMPGGGA EIFADRVRRQ VAKNKVKAEK WLQIHREAHS LGMRTNATML YGHIETLEER LDHLHRLREL QDETGGFHAF IPLAFQPLGN TLAQNLGKTE FTTGLDDLRN LAVARVYLDN FPHIKGYWVM IGSELTQVSL DWGVSDIDGT IQEEHIAHAA GATSPMALSQ AGMVRMIQQA GRVPVLRDAY YHELEVFPRL GAEAAD
|
| |