Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_1007 |
Symbol | |
ID | 8418829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 1180590 |
End bp | 1181699 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 645037576 |
Product | Radical SAM domain protein |
Protein accession | YP_003197873 |
Protein GI | 258405131 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0379726 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGGCCT CCCCGCCGCA TGCCATAACC GCACGTGACT TACGAACCAT TGGAAATGCC ATGTCTGAAC GAATGACGCC CGAGGCGGCG CAACGCCTGT GGGATTCGCA CGATCTTTTT GAACTGGGGC GTATGGCCCA TGACCGCCGT TGGGCCCTCC ATCCAGACCC GACTGTGACG TATATCGTCG ACAGAAATAT CAACTACACC AATATATGTG TGTCTGGGTG TAAGTTCTGT GCCTTTTTTC GCCCCCCTGG CCATAGCCAG GGATTTGTCC TCAGTCTCGC GCAACTCGAG CAGAAAGTCC AAGAGACCGT GGATGTCGGC GGCTACCAGA TTCTGTTGCA GGGGGGCATG CACCCCGACT TGCCGCTGGC CTTTTACCAA GACATGCTGG GGTTTCTGAA GCAACGCTTT CCCCAGGTGG CAGTCCACGG CTTTTCTCCG CCGGAAATCT GGTTCCTGGC CGAAAATGAA GGCCGCTCTC TAACTGAGAT TGTCGCTGAA CTCAAGCAGG CCGGGCTTGA TTCCATTCCC GGGGGCGGCG CGGAGATCCT GACCGACCGC ATGCGCAACG AGGTTTCCCC CAATAAGTGT TCGGCGGCGC AATGGCTGGC GGTTATGGAA GAGGCGCACA ATCAGGGACT GCAGACCACC GCAACCATGA TGTTCGGACA GGGGGAGCGG TTTGACGAGC GGTTGGAACA TCTGGAAGCG CTTCGGGCGC TGCAGGACCG GACCCATGGT TTCACCGCGT TTATCCCGTG GACCTTCCAG CCCCGCAATA CCCAGATCCA CCGCACCGAG ACCTCCTCTC ACGAGTATCT CAAATTCCTG GCCCTGGCCC GTTTGTACCT GGATAATATT CCCAATATCC AGGCCTCATG GGTCACGCAG GGACCGCTTA TAGGGCAATT GGCCCTGTTT TGGGGCGCCA ATGATTTTGG ATCGACCATG ATTGAAGAAA ATGTGGTTGC TGCGGCTGGG GTCCATTTTC GTCTGCCGGA AACAACGATC CGGAATATTG TTGAACGAGC CGGGTTTGTC CCGCGCCGGC GGCGCATGGA TTACACCCTG TTAACCGAGG ACATCCCCGC TGAAAGGTAG
|
Protein sequence | MRASPPHAIT ARDLRTIGNA MSERMTPEAA QRLWDSHDLF ELGRMAHDRR WALHPDPTVT YIVDRNINYT NICVSGCKFC AFFRPPGHSQ GFVLSLAQLE QKVQETVDVG GYQILLQGGM HPDLPLAFYQ DMLGFLKQRF PQVAVHGFSP PEIWFLAENE GRSLTEIVAE LKQAGLDSIP GGGAEILTDR MRNEVSPNKC SAAQWLAVME EAHNQGLQTT ATMMFGQGER FDERLEHLEA LRALQDRTHG FTAFIPWTFQ PRNTQIHRTE TSSHEYLKFL ALARLYLDNI PNIQASWVTQ GPLIGQLALF WGANDFGSTM IEENVVAAAG VHFRLPETTI RNIVERAGFV PRRRRMDYTL LTEDIPAER
|
| |