Gene Dret_1149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1149 
Symbol 
ID8418977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1345408 
End bp1348899 
Gene Length3492 bp 
Protein Length1163 aa 
Translation table11 
GC content63% 
IMG OID645037724 
ProductSMC domain protein 
Protein accessionYP_003198015 
Protein GI258405273 
COG category[S] Function unknown 
COG ID[COG4717] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.166006 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGATCG CCCGTTTCAC GCTCGACGCC TTTGGTCCGT TTACCGACAC CAGTCTGGTT 
TTCGACCCCC AGCCCCTGCA CCTGATCTAC GGTCCGAACG AGGCGGGCAA GAGCAGCACC
CTGCGCGCCC TGACCGCTTG GCTCTACGGC TTTCCCGAAC GGACCACCGA CAATTTTCTG
CACGCCAATC CCCGGCTGTG TGTTTCCGGC ACCCTGGAGA ACGGGCGTGG CGACACATTG
ACCTTTTCCC GGCGTAAAAA ACGCAAAGGC AGTGTCCTCG ACGCCGGGGG CAATGCCGTT
GATCCTCAGG TGGTAGCGGC CTGGTTACAG GGCTTGGACC AGGAGACCTT CCAGGCCCTG
TTCGGTTTGG ATCATCCCGG CCTGGTCCAG GGAGGGACCA CGATTTTGCA GGAGCAGGGC
AGTACGGGCA CGACCCTGTT CGCCGCCGGC AGTGGGATCG CTTCCCTGCG CCATGTCCAG
AAAGGGTTGC AGGAGGAATA CGAGGCCGTG TTCAAACCCA GTGGGAGCAA GCCGGACCTC
AATGCCGCCC TGCACCGCTA TAGTGAGTTG AAAAAAGAAT CCAACCAGCT TGGCCTGGCC
AGCACGGCCT GGAAACACAA GGAACGCGCT TTGCGCGAGG CGCAAAAAGG TCTCGATGCG
CTCAAGGCAG AGCGCGCCGT TCAGCAGACC GAACGCCGCC GTCTGGAACG GATCCAGCAG
GCGATGGCCC CTTTGGCCCG CCTGCGCAAA GCAGAAGCCG GGCTGAGCGA ACTCGGTCCG
GTCCCCGATT TGCCCGACGA TTTCCCCAAA CGGCGACAAG AGAGCCAGGA CCAGTTGCGG
GAGGCCCAGC AGTCCCTGGA CAATGCCCAA AAACGGCTGG CCAATGATCA GCACAAACTC
GACGGTATCT CCCTGAATAA CGAGCTTTTG GATCAGGCCG CGACCATCGC TGATCTGCAC
CAGCGTCTCG GCGCCTACCG CAAAGGCCAG GCCGACTTGC GCTCCCTGGA GGACGATGAA
CTCGAACACC GCGCCGCGGC CCGGGATGTA TTGCGGGGGC AGCGCCCGGA TCTCGACCCA
GCTGACAGTG ATACGGTGCA GCAGCTCTTG CAGCACCGGC CACGCGTTCA GGCTCTGAGC
GAGCGCAAGC CTCTGGTGGA CAAGGAACTC TCCGACAGCC GCACTCTCTG GGAGAGCAAA
CAGGAGGAAC TGGCCCAGGT CCGGGACGCT TCGAAAACTT TGCCCCATGT TCAAGATACC
ACGCTGCTGC ACCAGGCTGC GGTCCATGCC GGACGCCTCG GCGATATCGA TACCGAGATC
GCGACAAAGC GCAGCGGGTG CGATCGCCTC CACCAGGAAT GCGCCACGAT GTTCGCTCAA
CTTGGCAAGT GGCATGGCAC GCCTGAAGAG GCTCTGAGGC TGACGTTGCC GCAGGAGGAG
AAGGTCCAAC TCCTGAAACA GGAATGGGAG GACACCGAAC GGCGCTCCAC AGAGCTAGAG
ACGCAGCGTC AAGAACTCTC AGAGACCTTT GCCGATCTGG ATGAACAACT GCAGGCCCAG
AAGGAGGCCG GGGCAGTGCC ATCGGAAGAA GATCTGGATG CCAGGCGGCG GGAGCGGGAT
GCGGCCTGGT CTTTGTTGCG CCGGCAGTGG GTGGACGGCG AAGACGTCTC CAATGCGGTT
ACCGAGATCG TCTCCGGGGG CAACCTGATC CAGCATTTTG AGCGTACACT CCACGAGGCG
GACTCACTTG CCGACCGTTT GCGGCGTGAA AGCGAACGGG TCCATACCCA TGCCCGGCTC
GTAGCCCAGT ACGACAGGGC CCGACAGCAT TGGGCGGCAT TACAGGAGGC GCAAGAGCGA
CTGGAGGCTG CTTGGCAGGA TCTGCAGCAG CGGTGGCGCA CCACCTGGGA CGGTACCGGC
ATTGAGCCAG ACGACCCCCG CCCCATGCTC AGCTGGCTGC ACCGTTTTCA AAAAGGCGTC
GACCAGGCCC GTCGCCTTTA CGAGGACCGC TTTGCGCTCC AGGAACTCGA AGGCAAGCGA
GAGGCGGCGC GAGTGAACCT GGTTCGCGAG CTCCAGGCTC TGGAGGAAGC GGTGCCGCAC
GGGGAGGAGC TCACGCCGGT GCGCGAGCGG GCCGACGCCG TTCTCAAGGC GCGGGAGGAG
TTGGCCCGCA AGCAGCAGCA GCTCCAGGAT GAACAGGTCC GGCTGGAACG GGAGACGGCC
ACAGCCCAGC ATCGGTTGAC CGCAGCCGAG GAAGCCAGGA GTGCCTGGCA GGCGAGTTGG
GGGCAGGCCA TGCAGGAACT TGGGCTCCCC GAGGATGCGA CGCCGGAGAC AGTCCACAAT
TTCTTTAGCG ATCTCGAGAC CTATCTCAGT CGGGCGAATG CGGCCGACGG GCTGCGCAAG
CGCATCGAGG GTATCCGGCG CGACAGCCAG GAATTGGAGC GGGATGTGGC CAGTCTCGTG
GAGCGCACTG CACCGGATCT TGCTGCTATC CCAGTGGATC AAGCGGTGGA GCGGTTAAAC
GGTCAACTGC AGCGCGAACG CGACCTGGCT ACAACTGTGG CCCATTACCA GGAACGGATC
GCCTCTACCC AGGAAGAAAT CGAATCGGCC CGGATCGCGC GCGACGCGGC CAGCCGGGAA
TTGGCCGCGC TTTGCGAACT GGCTGGCTGC ACCGAGGCCG AAGAACTGCC CGGAATCGAA
AAACGGTGGC AAGAACACCG GACCCTGGAG CAGAACAAGG GAGCGGCCCT GGACGAACTT
GGCGAGATTG CCGCTGCAAC GGATGTGGAG GAGCTGATCG CTGAGGTTGC GGCTGAAGAC
CCGGACGAGT TGCCGGCCCG GATCCAATCC TGCGATGAGG AATTGGACCG TTTGGACGGG
GAAATTGAGC ACCAGAGCGC CCAGGCCGGC GAATTGCGTC GCGAATTCCG GGAAATGGAC
GGCCGGGATG AAGCGGCGCG GAAGGCCGAA GAAGCCCAGG CGACTCTGGC CACGATCCGT
CGTCTGGCCG AGCGCTATGC CCAACTCCGT TTGGCGTCCA CGGTCCTGGA CGAGGCCATC
GAGCGCTACC GCGCTGAAAA CCAGGATCCT GTCCTGACCT TGGCCAGCGG GTATTTCCAG
GAGGTAACCC TCGGCTCCTT CGATGGTCTG CGCACCGATC TCGACGACAA GGGCAACCAG
GTCATTGTCG GTCTGCGCGG CGAAGAGCGC GTCCCGGTCG CGGGCATGAG TTCCGGAACC
CGCGACCAGC TGTATCTCGC CCTGCGTCTG GCGTCCTTGG AGCACCGGCT GCACAGCAGC
GAGCCCATGC CGTTTATCGT CGACGACATT CTTGTCAATT TCGACGAGCA GCGCACCCGG
GCCGCGTTGC AGGCCTTGGC CCGCCTGGGG GCAAAGAACC AGGTCCTGGT GTTTTCCCAC
CACGACCAGG TCGCTACCGC AGTCCGCGAA CTCGGCCTGG GCGCGGTGTA TGAATTGCAG
GCTTCAGCCT GA
 
Protein sequence
MRIARFTLDA FGPFTDTSLV FDPQPLHLIY GPNEAGKSST LRALTAWLYG FPERTTDNFL 
HANPRLCVSG TLENGRGDTL TFSRRKKRKG SVLDAGGNAV DPQVVAAWLQ GLDQETFQAL
FGLDHPGLVQ GGTTILQEQG STGTTLFAAG SGIASLRHVQ KGLQEEYEAV FKPSGSKPDL
NAALHRYSEL KKESNQLGLA STAWKHKERA LREAQKGLDA LKAERAVQQT ERRRLERIQQ
AMAPLARLRK AEAGLSELGP VPDLPDDFPK RRQESQDQLR EAQQSLDNAQ KRLANDQHKL
DGISLNNELL DQAATIADLH QRLGAYRKGQ ADLRSLEDDE LEHRAAARDV LRGQRPDLDP
ADSDTVQQLL QHRPRVQALS ERKPLVDKEL SDSRTLWESK QEELAQVRDA SKTLPHVQDT
TLLHQAAVHA GRLGDIDTEI ATKRSGCDRL HQECATMFAQ LGKWHGTPEE ALRLTLPQEE
KVQLLKQEWE DTERRSTELE TQRQELSETF ADLDEQLQAQ KEAGAVPSEE DLDARRRERD
AAWSLLRRQW VDGEDVSNAV TEIVSGGNLI QHFERTLHEA DSLADRLRRE SERVHTHARL
VAQYDRARQH WAALQEAQER LEAAWQDLQQ RWRTTWDGTG IEPDDPRPML SWLHRFQKGV
DQARRLYEDR FALQELEGKR EAARVNLVRE LQALEEAVPH GEELTPVRER ADAVLKAREE
LARKQQQLQD EQVRLERETA TAQHRLTAAE EARSAWQASW GQAMQELGLP EDATPETVHN
FFSDLETYLS RANAADGLRK RIEGIRRDSQ ELERDVASLV ERTAPDLAAI PVDQAVERLN
GQLQRERDLA TTVAHYQERI ASTQEEIESA RIARDAASRE LAALCELAGC TEAEELPGIE
KRWQEHRTLE QNKGAALDEL GEIAAATDVE ELIAEVAAED PDELPARIQS CDEELDRLDG
EIEHQSAQAG ELRREFREMD GRDEAARKAE EAQATLATIR RLAERYAQLR LASTVLDEAI
ERYRAENQDP VLTLASGYFQ EVTLGSFDGL RTDLDDKGNQ VIVGLRGEER VPVAGMSSGT
RDQLYLALRL ASLEHRLHSS EPMPFIVDDI LVNFDEQRTR AALQALARLG AKNQVLVFSH
HDQVATAVRE LGLGAVYELQ ASA