Gene Dret_1055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1055 
Symbol 
ID8418880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1246362 
End bp1247924 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content55% 
IMG OID645037627 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_003197921 
Protein GI258405179 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.144567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0392371 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGG ATCTGCGAAC AACGTCACGG CGCACGACGG CGACGTGGCA GAATCTGGAA 
AACAAGATAC TGAAATACGA TATACATCCA TGGGTGTTTT TCGGCGGCGG GGCGGTTATC
ATCCTCGGCG TGGCTTTGAC CTTGATTGCA GGCGAGACGG CCTCGACCTT ATTCAGTTCC
GTGCAGACCT GGATTGCCAC ATATACCGGT TTCTTTTTCG TGCTGGTCAT GAACGTGGTC
CTGGTCTTCT GCTTTTTCCT CTTATTCACC AAAATGGCGT CGATGCGCAT CGGCGGAGAG
GACGCTGAGC CGGAATTCTC CACCATGGGG TGGTTCGCCA TGCTTTTTAG TGCTGGAATG
GGTATCGGGA TCCTTTTCTA CGGGGTGGCC GAACCCATGT TCCACTATGT GGCCAATCCC
CTCTCAGAAC CCGGTTCTCC TGAAGCGGCG CGCATGGCCA TGGAATTGAC CTTCCTGCAT
TGGGGCCTGC ACCCTTGGGG TATTTATGCC CTTGTCGGCC TCGGCCTCGC CTTCTTCGGC
TTTTCCGAAG GACTACCGCT TTCCATCCGT AATATTTTCT ATCCCCTGCT TGGCGACAAA
ATTTACGGCC CCATCGGTAA TTTGATCGAT GTCTTGGCCA CGGTGGCAAC GCTGTATGGG
GTGGCGACTT CCCTGGGGCT CGGGGTCCAA CAGGTCAATG CCGGACTGGC CCATTTGTTT
GGCATTCCGC AAAATCCCTG GGTTCAATGC GGCTTGATTG CCTTGATCAC CGCCATTGCG
ACCTGGTCCG TCGTTCGCGG CCTGGACGCG GGCATCAAAT TTTTGAGTGA ATTGAACATG
GCCGCCGCCG GGTTGCTGAT GCTCTTTGTC CTCTTGTTGG GGCCGACCCA ATTTATCCTT
AACGGTATTT TGGAGAATAT TGGGAATTAT ATTCAGGATT TTGCGCATCT TGCCACCTGG
AACGAGACCT ACACCAACGG CGAATGGCAA AACGGCTGGA CGGTTTTTTA CTGGGGCTGG
TGGATCGCCT GGTCTCCGTT TGTGGGCATG TTCATCGCCC GGGTTTCCTA TGGCCGGACC
ATCCGGGAAT ACCTGCTCGG CGTTCTGCTT GTTCCCGTCG CTGTGACCTT TGTCTGGATG
ACCGTGTTCG GCAACAGTGC TTTGTTCATC GAGCATTTCG GGGCTGGGGG ACTGGCCAAG
GCGGTACAGG AGAACATTCC TGTCTCCTTG TTTGTCTTTT TGGAACATTT TCCCTTGTCT
ATGCTGACCT CCCTTTTGGC AGTCGTTGTC GTCATCACGT TTTTTGTGAC CTCCTCTGAC
TCCGGGTCCA TGGTCATTGA CATCATCACC GCCGGAGGTA ACCCGGATCC GCCGACTCCG
CAGCGCCTGT TCTGGGCCGT TTTGGAAGGC GTTGTCGCTG CGGTCCTGTT GCTCGGCGGC
GGCCTGGTCG CCCTGCAGAC AGCCGCCATC ACAACCGGGT TGCCGTTTGC GGTGGTCATA
TTGATGATGT GCTGGGCCGT GTATCGCGGT CTGCATGACC ATTGGATGCG CTACTACGAC
TAA
 
Protein sequence
MSEDLRTTSR RTTATWQNLE NKILKYDIHP WVFFGGGAVI ILGVALTLIA GETASTLFSS 
VQTWIATYTG FFFVLVMNVV LVFCFFLLFT KMASMRIGGE DAEPEFSTMG WFAMLFSAGM
GIGILFYGVA EPMFHYVANP LSEPGSPEAA RMAMELTFLH WGLHPWGIYA LVGLGLAFFG
FSEGLPLSIR NIFYPLLGDK IYGPIGNLID VLATVATLYG VATSLGLGVQ QVNAGLAHLF
GIPQNPWVQC GLIALITAIA TWSVVRGLDA GIKFLSELNM AAAGLLMLFV LLLGPTQFIL
NGILENIGNY IQDFAHLATW NETYTNGEWQ NGWTVFYWGW WIAWSPFVGM FIARVSYGRT
IREYLLGVLL VPVAVTFVWM TVFGNSALFI EHFGAGGLAK AVQENIPVSL FVFLEHFPLS
MLTSLLAVVV VITFFVTSSD SGSMVIDIIT AGGNPDPPTP QRLFWAVLEG VVAAVLLLGG
GLVALQTAAI TTGLPFAVVI LMMCWAVYRG LHDHWMRYYD