Gene Dret_0536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0536 
Symbol 
ID8418344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp644879 
End bp645916 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content61% 
IMG OID645037100 
ProductApbE family lipoprotein 
Protein accessionYP_003197411 
Protein GI258404669 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAGA CTGCAAGCAA CGACGTCCAA TCCACACGCC GCTCTTGTCT GAAATTCCTT 
GGTCTCGGCG CCTTGGGGAT CGCCACACCT ATGCTCGCCG CGCGCCCCGG ACAAGCGGCC
CAGGATCTGC CTCAGGCGCA AAGCACTCGT CCCTTGATGG GGACCATGGT CACGGTCACG
GTTCTGGATT CCTCCCGTGA CAAGGCCCAT GAAGCCGCGC AAAGCGCCCT GGACACCATG
CAGGCCCTAA CCCCGGTGCT GGACCGCCAC GCACCCGGGA CACCTTTGAG TGAACTCAAT
GCCCAGGGCC GGTTGAAGGA TGTGCCGCCG CAACTCGCCG CCGTATTGTA CCAGGCTGGC
TATTTCTATA CTGTCAGCCA GCGGGCCTTT GATGCGAGTA TCCTGCCCCT GTTGACCCTG
ACCAAAGAGA CCTTTGCGAA CCACGGTGAG GCCCCCTCGG AGACGGCCGT ACGCCGGGTC
CGCGACCGTA TCGGGTTCGA TAAGGTCCAC ATTGGCCCCG ACGGCGTGCA ACTCCAGGAC
GGCATGCAGC TGACCCTGGA CGGTATCGCC AAAGGGCATA TCATCGATCA GGCGGCCAAT
ACCTTACAAC AAATGGGGAT CCGCTACGCC CTGATCAACG CCGGAGGCGA TATCCGGGCC
CTGGCTGGCA AAGGCCCCGG TGCAGCGTGG CGGGTCGGGA TCAAAGATCC ACGAGGCCGC
AAACCGTTTA TCCAGACCAT GGCTCTTAAT AATGGGGCTC TGGCGACGTC CGGCAACTAC
GAACACTATT TCGACCGCAA TAAGGCCCAC CACCATATTA TTGACGCCAA TTCCGGCCAT
TCCCCGCGGG CAGTCAGCAG TGTCAGCGTG GTCGCGCCTA CAGTGGCAGA AGCCGACGCC
CTGTCCACCA CCCTGTTTGT CAAACCGCGT CCAGAGGCCG TCGCCTTTGC CAATTCGCTT
CCGAATACCG AGGCCCTGCT TCTCGACGAC GAACTGCGTG AAACGCACAC GCAGGGATGG
CTGGGGGAGT GGGCCTAA
 
Protein sequence
MSKTASNDVQ STRRSCLKFL GLGALGIATP MLAARPGQAA QDLPQAQSTR PLMGTMVTVT 
VLDSSRDKAH EAAQSALDTM QALTPVLDRH APGTPLSELN AQGRLKDVPP QLAAVLYQAG
YFYTVSQRAF DASILPLLTL TKETFANHGE APSETAVRRV RDRIGFDKVH IGPDGVQLQD
GMQLTLDGIA KGHIIDQAAN TLQQMGIRYA LINAGGDIRA LAGKGPGAAW RVGIKDPRGR
KPFIQTMALN NGALATSGNY EHYFDRNKAH HHIIDANSGH SPRAVSSVSV VAPTVAEADA
LSTTLFVKPR PEAVAFANSL PNTEALLLDD ELRETHTQGW LGEWA