Gene Dret_1935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1935 
Symbol 
ID8419780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2216869 
End bp2217996 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content60% 
IMG OID645038523 
ProductRadical SAM domain protein 
Protein accessionYP_003198797 
Protein GI258406055 
COG category[L] Replication, recombination and repair 
COG ID[COG1533] DNA repair photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.779003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG CGATTTCGCC TCCGGCGCAG TTGCCCTCCT GGTTGCAAAC GATCAGCGAG 
GTGGTGGTCG ATCCCTCGGT GCGCACCACC CCGCTGGCCG CCAGGGTGCG TGAGCGACTG
CCCCATGTGC GTTGGACGGT GCTTGACGAG GACGGTGCTG TGCCGGCGTG GGGACAGGAG
GAACAGGTCG TCTATCTCAA GCACTACAAG GGCCGCTTCC TGCGTTTTTG CCCCGGCACT
AAGGCCTACC GGTGTTGCGG GTACCGGATC GTGCATATCG GAGAGAATTG TCCCCTGTCG
TGTTCCTACT GTATTTTGCA GGCGTATTTT CAGGACCGGA TGCTCAAGGT CTGGGCCAAC
CAGGAGGACC TGTACACCGA ACTCGACCGG GCCTTTGCCG CTAATCCGGA CAAGCGGTAC
CGGTTGGGCA CTGGGGAGTT CACGGACTCG CTGGCCCTGG AACCGCTCAC CGGGTACAGC
CGTGATCTGG TTGATTTTCT GGGGCGCTAT CCCAATGTCT GCCTGGAACT CAAAAGCAAA
GTCATCGATC TGAGTTGGAT GACGGCGGTC CGAGATCCGC GCCGGGTCCT GCCGGCGTGG
TCGCTCAACG CCCCGGCGAT CCACGAAGAG CAGGAACACA GGACGTCGAC ACTCACGGAG
CGGCTGGAGG CAGCGCGCGA ATGCGTCCGC CACGGCTTTC GGGTCTGTTT GCATTTTGAT
CCGATCATCC GTTATCCGGG ATGGGAAACA GGGTATGACC GTATCGTGGA TATGATTTTC
GAGTATCTCC GCCCTGCGGA TATTGCCTAC ATGAGTTTGG GGTCATTCCG CTGTATGCCG
GAGCTGAAAT CGATCATCGA TCGTCGCCAC CCCCAGGCGC GGTATATCTA CGACGAATTT
ATCACCGGAG CGGACGATAA ATTGCGCCTG CTCAGACCTC TTCGCGTCGA ACAATTCCGG
CGCATTGTCA GCCGGCTCCA GAAATGGGGG ATGCGGGAGC AGCTCTATTT CTGCATGGAG
TCTGATACCG TCTGGCAGCA GGTCTTCGGG TATACCCCTC GGCAACTCGG GGGGCTGCAA
AACCATCTGC TGCGCCGGAG TTTTGGTGAG GAGTCGGATG CCCTGTAA
 
Protein sequence
MSAAISPPAQ LPSWLQTISE VVVDPSVRTT PLAARVRERL PHVRWTVLDE DGAVPAWGQE 
EQVVYLKHYK GRFLRFCPGT KAYRCCGYRI VHIGENCPLS CSYCILQAYF QDRMLKVWAN
QEDLYTELDR AFAANPDKRY RLGTGEFTDS LALEPLTGYS RDLVDFLGRY PNVCLELKSK
VIDLSWMTAV RDPRRVLPAW SLNAPAIHEE QEHRTSTLTE RLEAARECVR HGFRVCLHFD
PIIRYPGWET GYDRIVDMIF EYLRPADIAY MSLGSFRCMP ELKSIIDRRH PQARYIYDEF
ITGADDKLRL LRPLRVEQFR RIVSRLQKWG MREQLYFCME SDTVWQQVFG YTPRQLGGLQ
NHLLRRSFGE ESDAL