Gene Dret_1434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1434 
Symbol 
ID8419263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1667300 
End bp1668508 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content61% 
IMG OID645038009 
Productdomain of unknown function DUF1745 
Protein accessionYP_003198299 
Protein GI258405557 
COG category[S] Function unknown 
COG ID[COG3287] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.318192 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.236861 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATTG CCACGGGATG GTCGACGCAG AAAATTGCCG GGCGAGCGGC ACGTGAAGCC 
ATGGCCGGAA TAGCGGCTGC AGGGGATCGC CATCCTGATT TCGTTCTCTG TGAGTTGACG
GAAGACTATG ATGTCGAAGA GGTCCTGGCT CTATTGACCC AAACGTGGCC GGAGACGCCC
GTGCACCTGG CCACAACCTG CCGCGGTGTC TTGTTGCGCC GGGGGTGGAC GAGCAGTGCG
GGACGGGTGC TTGGATTGTG GAGTGTTTTC GATGCCCAGG GGGCTTTCGG CACCGGCGGC
GTGCCTCTTG AGGATCGTCC GAGTGCGGCC GCAGCTCTGG CGACCCGGCA GGCCCTGGAT
CAGGCCGGAC GAGCCGGGGA GTTGCCCTCG TTGATCTGGA TCAGCACCGC GCCGGGGTGC
GAGGAACAGG TCCTGGCCGG TATCGAATCC GAAGTGGGCA CCAATGTTCC CATTCTGGGC
GGGAGCACTG CTGACAACGA TGTCCTGGGG CGCTGGTCCC AGGGGACCAA AGCCGGAACG
ATGTCCAACG GCGTGGTCGT CTCGGTGTTT TTTCCCTCGG TGGAGATCGG CTATTCCTAC
CACAATGGGT ATCTGCCCCA GGCCCAGTCC GGTACCGCCA CTGAAGCCGA GGGGCGCCTG
ATTCGCTCCA TTGACGGCCG CCCAGCTGCA GAAGTGTATA ACGAGTGGAC CCAGGGTTTG
ATCGGGCCGA CTTTGTCCGA AGGCGGCAAT ATCTTTGACA AGACGACGTT TTGGCCTCTG
GGACGGGTGC GGGGATGGCT GAACAACATC CCGCTCTATG TTTTGGCTCA TCCTGAGCGG
GCCGAGCCAG ACGGGGCTTT GCGACTGTTT GCCGATGTTG AACAAGGCGA AACAGTGGTT
TTGATGTCCG GGACGCGCAA TGGTCTCATC CGGCGGGCGG GGCGGGTGGC CGAAAGCGCG
CTGGACAGCC TGGACGTGCT GCCTTCTCAG ATTTCTGGTG CGTTGGTGAT CTTTTGCGCC
GGTTCGATGG TTGCCATCGA GGAGCATATC GATGAAGTGG CGCAATCCAT TCACCAGGTC
CTGGGCGATG TCCCCTATCT TGGCTGTTTT ACATTCGGTG AGCAGGGGCG GCTCGTGGGC
GGGGGCAACC ACCACGGCAA TTTGATGATT TCCGTGGTCG TCTTCAGTGA TCAGGAGGCG
GTTTTTTGA
 
Protein sequence
MQIATGWSTQ KIAGRAAREA MAGIAAAGDR HPDFVLCELT EDYDVEEVLA LLTQTWPETP 
VHLATTCRGV LLRRGWTSSA GRVLGLWSVF DAQGAFGTGG VPLEDRPSAA AALATRQALD
QAGRAGELPS LIWISTAPGC EEQVLAGIES EVGTNVPILG GSTADNDVLG RWSQGTKAGT
MSNGVVVSVF FPSVEIGYSY HNGYLPQAQS GTATEAEGRL IRSIDGRPAA EVYNEWTQGL
IGPTLSEGGN IFDKTTFWPL GRVRGWLNNI PLYVLAHPER AEPDGALRLF ADVEQGETVV
LMSGTRNGLI RRAGRVAESA LDSLDVLPSQ ISGALVIFCA GSMVAIEEHI DEVAQSIHQV
LGDVPYLGCF TFGEQGRLVG GGNHHGNLMI SVVVFSDQEA VF