Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_0162 |
Symbol | |
ID | 4058408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008025 |
Strand | - |
Start bp | 150443 |
End bp | 152131 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641229159 |
Product | hypothetical protein |
Protein accession | YP_603634 |
Protein GI | 94984270 |
COG category | [R] General function prediction only |
COG ID | [COG1418] Predicted HD superfamily hydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR03319] conserved hypothetical protein YmdA/YtgF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.183423 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.57312 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATGT TGTACTTCGT GCTGGCGCTC CTGGTGGGGT TAGCAGGCGG GTTTTTCGTC GGACAGGCGC GCGGGCGGCA ACAAAGGGCC ACCCTTGATG ACCAGCTCCA GCGGGAAGCG CGGGCCGAGG CGGAACGCAT CCGGACACAG GCGGACGCCG AGGCCCGGCA GCTGCGCGAG CAGGCTGAGC AACGCCTGCA AGACGCAGCG CGGCGCCTGC AAGAAGCCGA CGACCGGGAA CGCCAAGTCA CCCTTCAACT GGAAGCGCAA AGGGAGCAGC TTCAGGCCGT TCGCGCCCAG ATCGAGGCCG AGCGGGCACG GGCCGCCCAG GACGCCGCGC GCGAACGCGA GACACTCAGC GCTGACCGGC AGGAAACCCG GCGCGAACGT GAGGAACTCA AGCGCGAGAT CGAGCGCCTC AACCGCCGGG CCGAGCAGCT CGACGCGCGG GGTGACAAGC TCGACGCCCT CGAGGAACGT CTAGAAGGAC AACTCCACGC ACTGGCCCAG CAGGAGGCTG AACTGGCGGA GCGCAGCCGC CAGGTGGACC TCAAGCTCTA CGAGGTTGCA GGCCTCACCC CCGAAGCTGC GCGCGAACAG ATCCTCCGGC AGCTTGACGC CGAACTGGAG GAGGAAAAAG CCATCCGGGT CAAGGCGATG ACCGAGCGGG CAACAGCAGA GGCCAGGCGT ACCGCCCGCA ACGTGATCGC ACAGGCCATT CAGCGCAGTG CCAGCGAGAC CAGCAGCCAG ATGAGCGTGT CGGTAGTGCC CATTCCCAAT GACGCCATGA AGGGCCGTCT GATTGGGCGC GAGGGGCGCA ATATCCGCGC GTTTGAGGCG CTGACCGGCG TGGACCTGAT CATCGACGAC ACGCCCGAGG CGGTCATCTT GTCGAGCTTC AACCCGGTGC GGCGTGAGGT GGCCCGCCAC GTGCTGGAAG CGCTGGTGGC CGATGGGCGC ATCCACCCCA CCCGCATTGA GGAGATGGTT CACAAGGCCC AGGATGAGAT GAAGAGCTTC ATCCACGCCC AGGGCGAGGA GGCGGCCATC GAGTCAGGCG TGGTGGGCCT CAAGCCGGGG CTGGTGCAGT TGCTCGGAAG GATGTACTTC CGCTCCAGCT ATGGCCAGAA CGTGCTGAAG CACTCCGTGC AGGTCGCGCA CCTCACCGGC ATCATGGCCG ATGAGCTGGG GCTGGACGCG GCTCTCGCCC GCCGCGCTGG GCTGATGCAC GACATCGGCA AGAGCATCGA CCGCGAGATC GAGGGCACCC ACGTCGAGAT CGGCATCAAC CTCGCCAAAC GCTTCGGGGA GCCGCCGGAA GTGATCGATG CCATCGCGCA CCACCACGAC CCCGAGAACG GCGAGACGCT GTACTCGGTG TTGGTGGCCG CCGCCGACGC GATCAGCGCC GCCCGGCCCG GAGCCCGCCG CGAGGAACTC GAAGCCTATG TGCGGCGCCT GGAACAGCTC GAACAGATTG CCATTGCCTT TCCCGGTGTG CAGCAGGCCT ACGCGATCCA GGCGGGCCGC GAGGTGCGCG TGCTGGTGCA ACCCGAGAAG GTCACCGACG CCCAAGCCAC CCTGCTCGCC CGTGAGATCG CCGGACGCAT CGAGCAGGAC ATGGAGTACC CCGGCCAGGT GCAGGTCACA GTGGTGCGCG AGAGCCGCGC CGTGGAGGTC GCCCGGTAA
|
Protein sequence | MNMLYFVLAL LVGLAGGFFV GQARGRQQRA TLDDQLQREA RAEAERIRTQ ADAEARQLRE QAEQRLQDAA RRLQEADDRE RQVTLQLEAQ REQLQAVRAQ IEAERARAAQ DAARERETLS ADRQETRRER EELKREIERL NRRAEQLDAR GDKLDALEER LEGQLHALAQ QEAELAERSR QVDLKLYEVA GLTPEAAREQ ILRQLDAELE EEKAIRVKAM TERATAEARR TARNVIAQAI QRSASETSSQ MSVSVVPIPN DAMKGRLIGR EGRNIRAFEA LTGVDLIIDD TPEAVILSSF NPVRREVARH VLEALVADGR IHPTRIEEMV HKAQDEMKSF IHAQGEEAAI ESGVVGLKPG LVQLLGRMYF RSSYGQNVLK HSVQVAHLTG IMADELGLDA ALARRAGLMH DIGKSIDREI EGTHVEIGIN LAKRFGEPPE VIDAIAHHHD PENGETLYSV LVAAADAISA ARPGARREEL EAYVRRLEQL EQIAIAFPGV QQAYAIQAGR EVRVLVQPEK VTDAQATLLA REIAGRIEQD MEYPGQVQVT VVRESRAVEV AR
|
| |