Gene Dret_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1007 
Symbol 
ID8418829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1180590 
End bp1181699 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content57% 
IMG OID645037576 
ProductRadical SAM domain protein 
Protein accessionYP_003197873 
Protein GI258405131 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0379726 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGGCCT CCCCGCCGCA TGCCATAACC GCACGTGACT TACGAACCAT TGGAAATGCC 
ATGTCTGAAC GAATGACGCC CGAGGCGGCG CAACGCCTGT GGGATTCGCA CGATCTTTTT
GAACTGGGGC GTATGGCCCA TGACCGCCGT TGGGCCCTCC ATCCAGACCC GACTGTGACG
TATATCGTCG ACAGAAATAT CAACTACACC AATATATGTG TGTCTGGGTG TAAGTTCTGT
GCCTTTTTTC GCCCCCCTGG CCATAGCCAG GGATTTGTCC TCAGTCTCGC GCAACTCGAG
CAGAAAGTCC AAGAGACCGT GGATGTCGGC GGCTACCAGA TTCTGTTGCA GGGGGGCATG
CACCCCGACT TGCCGCTGGC CTTTTACCAA GACATGCTGG GGTTTCTGAA GCAACGCTTT
CCCCAGGTGG CAGTCCACGG CTTTTCTCCG CCGGAAATCT GGTTCCTGGC CGAAAATGAA
GGCCGCTCTC TAACTGAGAT TGTCGCTGAA CTCAAGCAGG CCGGGCTTGA TTCCATTCCC
GGGGGCGGCG CGGAGATCCT GACCGACCGC ATGCGCAACG AGGTTTCCCC CAATAAGTGT
TCGGCGGCGC AATGGCTGGC GGTTATGGAA GAGGCGCACA ATCAGGGACT GCAGACCACC
GCAACCATGA TGTTCGGACA GGGGGAGCGG TTTGACGAGC GGTTGGAACA TCTGGAAGCG
CTTCGGGCGC TGCAGGACCG GACCCATGGT TTCACCGCGT TTATCCCGTG GACCTTCCAG
CCCCGCAATA CCCAGATCCA CCGCACCGAG ACCTCCTCTC ACGAGTATCT CAAATTCCTG
GCCCTGGCCC GTTTGTACCT GGATAATATT CCCAATATCC AGGCCTCATG GGTCACGCAG
GGACCGCTTA TAGGGCAATT GGCCCTGTTT TGGGGCGCCA ATGATTTTGG ATCGACCATG
ATTGAAGAAA ATGTGGTTGC TGCGGCTGGG GTCCATTTTC GTCTGCCGGA AACAACGATC
CGGAATATTG TTGAACGAGC CGGGTTTGTC CCGCGCCGGC GGCGCATGGA TTACACCCTG
TTAACCGAGG ACATCCCCGC TGAAAGGTAG
 
Protein sequence
MRASPPHAIT ARDLRTIGNA MSERMTPEAA QRLWDSHDLF ELGRMAHDRR WALHPDPTVT 
YIVDRNINYT NICVSGCKFC AFFRPPGHSQ GFVLSLAQLE QKVQETVDVG GYQILLQGGM
HPDLPLAFYQ DMLGFLKQRF PQVAVHGFSP PEIWFLAENE GRSLTEIVAE LKQAGLDSIP
GGGAEILTDR MRNEVSPNKC SAAQWLAVME EAHNQGLQTT ATMMFGQGER FDERLEHLEA
LRALQDRTHG FTAFIPWTFQ PRNTQIHRTE TSSHEYLKFL ALARLYLDNI PNIQASWVTQ
GPLIGQLALF WGANDFGSTM IEENVVAAAG VHFRLPETTI RNIVERAGFV PRRRRMDYTL
LTEDIPAER