Gene Dret_1852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1852 
Symbol 
ID8419693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2123525 
End bp2124451 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content54% 
IMG OID645038436 
ProductSubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_003198714 
Protein GI258405972 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value1.94668e-11 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA CGTTTCTTCT CATTTCCCTG TTGTGCCTTA CCCTTCTCTT TCCCGCCGCC 
GCTTTCGCGC AAAAAGACAC CATCCGTCTC GGCGTACCCC CCTGGCCTGG GGTCACTGTC
AAAACCGAAG TCGCCACCCA AATCCTTGAA GCCATGGGGT ATGAAACCCA ACAGTTGGAA
ATCGGCCCGC CCATTATCTA CAAAGGGCTG ACCACCGGCG AAATCGACGC CTACCTGGCC
GCTTGGCTGC CGCAGCAAAC GGACATGTTC GAGCCGCTCA AGGAAAAAGG CGCTATCGAT
GTCATCAATA TCAATCTTGA CGACGCCATG ACCGGTTTTG CCGTTCCGAC CTATGTCTGG
GAAGCCGGTA TCCACTCCGT TGCCGATCTG GCCCCCAACG CCGACAAATT CGACTCCACG
TTGCACACCA TCGAAGTCGG CAGCGGCATG CACACCACGA CAGAGGAAAT GGTGAAAAAC
GATGTGGCCA GCCTTGGCGA CTGGGAACTC GCCAGCAGCA CCACCCCGGC CATGCTCACC
GAAGTGAATG AAAAGACCAA GAGCAAGGAA TGGGTTGTTT TCCACGCCTG GAAACCGCAT
TGGATGACTA TCAAGATCGA TATGAAATTT CTTGAGGGCG TCCCTGGTTC CGAGGATCTC
ATCAGTGAGA GTGTCGTCTA CAACGTGGCC AGCCCAGACT TTCAAGAGCG TTTCCCCCAA
GCTCGCAAGT TCTTGGAAAA GTTCTACGTT TCTGGAGACA CCCAGAGTGC CTGGATCCAC
TCTTTCAGCT ATGAGAAAAA AGATCCTGAA GATGTCGCCC GCGAGTGGAT CGCCAATAAT
ATGGAAACAG TGAGCCAATG GCTGGACGGG GTAGAAACCA CCGACGGCCG GCCGGCCATC
GACGCAGTCA AGAACGCCGT CAAATAA
 
Protein sequence
MKKTFLLISL LCLTLLFPAA AFAQKDTIRL GVPPWPGVTV KTEVATQILE AMGYETQQLE 
IGPPIIYKGL TTGEIDAYLA AWLPQQTDMF EPLKEKGAID VININLDDAM TGFAVPTYVW
EAGIHSVADL APNADKFDST LHTIEVGSGM HTTTEEMVKN DVASLGDWEL ASSTTPAMLT
EVNEKTKSKE WVVFHAWKPH WMTIKIDMKF LEGVPGSEDL ISESVVYNVA SPDFQERFPQ
ARKFLEKFYV SGDTQSAWIH SFSYEKKDPE DVAREWIANN METVSQWLDG VETTDGRPAI
DAVKNAVK