Gene Dret_1512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1512 
Symbol 
ID8419341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1756071 
End bp1757528 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content57% 
IMG OID645038086 
Productprotein of unknown function UPF0061 
Protein accessionYP_003198376 
Protein GI258405634 
COG category[S] Function unknown 
COG ID[COG0397] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.385999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.55294 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCAAT TTGATACGAG CTACGCACGT CTTCCACACC CCTTCTATAC ACGGGTGAGT 
CCGGCCCAGG TCCCCAAACC GGAACTCATC ACCTTCAACA GCGATTTGGC CAGACAATTA
GGGGCCCAGG ACACAAACCA ATCAGACACG CAATTGGCGG AGATCTTCAG CGGCCAACGC
CTGCTCCCTG GCTCCGAGCC CATTGCCATG GCCTATGCCG GCCATCAATT CGGCAACTTC
GTGCCCCAAC TCGGAGACGG CCGGGCGCTC TTGCTGGGAG AAGTGGTGGG TCCTGCCGGG
CACCGGTTTG ACATTCAGCT CAAAGGCTCC GGGAGAACCC CTTTTTCCAG AAACGGCGAT
GGCAAGGCCC CCCTTGGTCC CGTCCTGCGG GAATACATTG TCAGCGAGGC CCTGCACCAT
CTGGGCATGC CCACCACTCG TTCCCTGGCC GCGGTCAAGA CCGGGGAAAC GGTACGCAGG
GAAACCCCGC TTCCCGGAGC TATCCTGACC CGGGTCGCCT CAAGCCATAT CCGGATCGGC
ACCTTCGAAT ACTTTGCTTT CAAACAGGAC GTGGACAATC TCAAGCGGTT GACCGAGTAC
TCCATCCAAC GGCATTATCC CGAAATTCAA AACGATCCTC AGGCCGATCT GCACTTTTTC
CGCAACGTGG GACAGACCCA GGTCGAGTTG GTGACCGGCT GGATGGCGCT GGGCTTCATT
CACGGGGTGA TGAACACGGA CAACATGTCC ATCGCCGGAG AATCCATCGA CTTCGGGCCC
TGCGCCTTCA TGGACCAGTT CCGGTTCGAT CAGGTCTTCA GCTCCATCGA CCAATTCGGC
AGGTACAATT ACTCCAACCA GGCCCAGATC ACGCTCTGGA ACCTCTCCAG ATTCGGCAAT
TGCCTGCTTC TTCTGCACCA GGACATATCA GAAGACGAAC TCCGGAAATG GAACCAGGAA
CTGGAAGCCC TTCAGGACCT TTTCGAACAC AGGCATACAA TAAAGATGCT GGAAAAATTG
GGGATATTTG ACTACGAACG CCCCAACGAC CGCGAGCTTA TCCAAAAGTG GCTCCGCTAT
CTGGAAGCCG AGCATCTCGA TTACACCTTG AGCTTTCGCG CCCTGGCGCC CCTGGTGGAC
CAGGAGGCAG GCTGGGGGGC TTTCAAAAAG ACCGAGGCCT TCCAGGAGTT TTACAACCTC
TGGCAAGAGC GCCTTCACCA GCAGGACCTG GAAGTCACGG CCATACAGGA ACGAATGAAC
CAGGTAAATC CCGTGTTTAT TCCCAGAAAC CACAAGATCG AAGAGGTTAT TGAAGCTGGC
CGGAGGGGTG ACTTCAGCCC CTTTCATGAG ATGAACGAGG TACTGCAAAC CCCATTTCAC
GAACAAGAGG GACGCAGCAG CTACCGAGCA GCTCCCAGAC CGGAGGAGAT GGTCCAGGCT
ACGTTTTGCG GGACCTGA
 
Protein sequence
MFQFDTSYAR LPHPFYTRVS PAQVPKPELI TFNSDLARQL GAQDTNQSDT QLAEIFSGQR 
LLPGSEPIAM AYAGHQFGNF VPQLGDGRAL LLGEVVGPAG HRFDIQLKGS GRTPFSRNGD
GKAPLGPVLR EYIVSEALHH LGMPTTRSLA AVKTGETVRR ETPLPGAILT RVASSHIRIG
TFEYFAFKQD VDNLKRLTEY SIQRHYPEIQ NDPQADLHFF RNVGQTQVEL VTGWMALGFI
HGVMNTDNMS IAGESIDFGP CAFMDQFRFD QVFSSIDQFG RYNYSNQAQI TLWNLSRFGN
CLLLLHQDIS EDELRKWNQE LEALQDLFEH RHTIKMLEKL GIFDYERPND RELIQKWLRY
LEAEHLDYTL SFRALAPLVD QEAGWGAFKK TEAFQEFYNL WQERLHQQDL EVTAIQERMN
QVNPVFIPRN HKIEEVIEAG RRGDFSPFHE MNEVLQTPFH EQEGRSSYRA APRPEEMVQA
TFCGT