Gene Dret_0343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0343 
Symbol 
ID8418148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp427906 
End bp428847 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content57% 
IMG OID645036909 
ProductSubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_003197223 
Protein GI258404481 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGC GGCTTTCTAT CGGACGGATT CTTTTCACTG TGCTGGTCTT CTCTCTTCTT 
GCCCTGCCTG CCATGGCCTC CGGGCCGGTC AAATTCGGTG TCCCTTCCTG GCCGGGGGTA
ACGGTCAAAT CCGAGGTCGC CTCACAACTC ATCCGGGCCA TGGGCTATGA GGTCGAGCAG
ACGGTGGCCT CGCCCTCGAT CATTTTTAAG GCGATGACCC TGGGCGAGTT AAACGCCTAT
CTCGGTGGGT GGTCGCCGGT AGAAGATCCC ATGATTGATC CTCTGGTGGA AAAAGGGGAG
ATCATTCGTG TCGGGGCCAA CATTGAAGAA GCTGTGACCG CCCTGGTCGT TCCCTCGTTT
GTTGCCGAGG CCGGAGTGAC TTCCATTGAA GATCTGGCGG CGCACAAAGA CAAGTTTGAG
AGTACGATTT ACGGCATTGA ATCCGGGTCC GGGGCCAACA ACGATATCCA GGAAGCCATC
GATGCCAATG CCGCCGGGCT CGGCGATTGG GAACTGGCCG CCTCGTCCAC AGCTTCCATG
CTGGCCCAGG TGCAGAGCCT GAGTGAGAAC AAACGGTGGG CCGTTTTCTG GGGTTGGGAG
CCGCATTGGA TGAACGCGGT CATGGATCTG CATTACCTCC AATCCGAAAC CCCGGCGACG
GAAAAAATCG GGGCTTCGGT CAGCGTAGTC TACACCATCA CCTCGAACGA CCTCCCTGAA
GCCAATCCCC AAGCCTACGC GTTTCTGGAA CAGCTCAAGG TGCCTTCCGA TGTCCAGAGC
CAGTGGATCT ACGAGTATCG CCAACAAGAC AAAGAACCTG AAGATCTGGC TCCACAATGG
ATCAAGGCCA ATCTTGATGG GCTGGTGGGG CAGTGGCTGG AAGGTGTCCG TGCTGCCAAC
GGTGAGCCCG CCCTGAAAGT TGTCCGCGCG GCGTTCAAGT AA
 
Protein sequence
MSKRLSIGRI LFTVLVFSLL ALPAMASGPV KFGVPSWPGV TVKSEVASQL IRAMGYEVEQ 
TVASPSIIFK AMTLGELNAY LGGWSPVEDP MIDPLVEKGE IIRVGANIEE AVTALVVPSF
VAEAGVTSIE DLAAHKDKFE STIYGIESGS GANNDIQEAI DANAAGLGDW ELAASSTASM
LAQVQSLSEN KRWAVFWGWE PHWMNAVMDL HYLQSETPAT EKIGASVSVV YTITSNDLPE
ANPQAYAFLE QLKVPSDVQS QWIYEYRQQD KEPEDLAPQW IKANLDGLVG QWLEGVRAAN
GEPALKVVRA AFK