Gene Dret_1742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1742 
Symbol 
ID8419582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2005333 
End bp2006448 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content61% 
IMG OID645038325 
Productglycosyl transferase group 1 
Protein accessionYP_003198604 
Protein GI258405862 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.711443 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.704345 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTTT ACGAAGTTAT CAATGTCCGC TGGTTCAATG CCACCGCCTG GTATGCTATC 
ACCCAAGCCC GGCTGCTTAC GGCCCACGGC CATGAGGTCA TGGTCGTCTG TCTGCCGGAC
TCGCCAGCCC ATCTGAAGGC CCTGGAATAC GGGCTTCCTG TCGCCACCCT GGACCTGAAC
ACCACCTCGC CGCTGGGCAT AGTCCGGCTG TACGTCCGCA TGCGCAGGCT GCTCCGGGAA
TTTCCCCCGG AAATAGTCAA TTGCCACCGC GGGGAGGCCT TTGTCCTTTG GGGACTCCTG
AAATTGCAAA GCAGGGGGCT TTTTCGGCTC GTGCGCACCC GGGGCGACCA GCGTCCCCCC
AAAAACAACC GCGTCAACCG CTGGCTGCAC CGGAGTCTGG CAGACGCCGT AGTTTGCACC
AACTCGGCCA TGGCCCGCCA TTTCCGGGAT ATCCTTGGAC TGCCTGCCTC CCACCTCTGG
CTCATTTTCG GTGGTGTGGA CCGGGACCGG TTCAAATACG ATCCTGAAGG ACGACACACT
GTGCGACAAC GCTACGGCTT TGGCCCCCAG CACAAGGTGG TTGGTCTCTT GGGACGATTC
GACCGCGTCA AGGGACAGTG GGAATTGCTC CAGGCCGTAT CCCGTCTCTA CCATGACGGG
ATGAGCGACC TCCGGGTCTT GCTCATCGGT TTTACCACCG CCACTTCCCA GGCGGAAGTG
GAAGGATGGA TTCAGGACCT GGGGCTGACC GAAGTGGTCC ATATCACGGG ATTGACTGAA
GACGTCCCCG CCTGTCTCTC GGCCCTGGAT CTCGGTATTG TCAATTCCTT GTGGTCGGAG
ACCATTGCCC GCGCCGCCCT GGAGACCATG TCCTGTTCCG TCCCGCTTAT TGGAACCACC
GTCGGAGTCC TGCCCGACCT GCTCTCCCAG GAAGCCCTGG TTCCGCCAGG CGACAGCGGA
GCCCTGGCCG ACCGCATCCG GGACGTTTTC AGCGATCAAT CCCTTGTGCA GCGGCTGCAC
AAGCAGCAGG AAGAGACCAT CCGCGATCTC TCCACCGACC ATTTTGTTCA ACACACCCAG
AGTCTGTACA CCCGGCTCCT GAGTCCCGAC CGATGA
 
Protein sequence
MRVYEVINVR WFNATAWYAI TQARLLTAHG HEVMVVCLPD SPAHLKALEY GLPVATLDLN 
TTSPLGIVRL YVRMRRLLRE FPPEIVNCHR GEAFVLWGLL KLQSRGLFRL VRTRGDQRPP
KNNRVNRWLH RSLADAVVCT NSAMARHFRD ILGLPASHLW LIFGGVDRDR FKYDPEGRHT
VRQRYGFGPQ HKVVGLLGRF DRVKGQWELL QAVSRLYHDG MSDLRVLLIG FTTATSQAEV
EGWIQDLGLT EVVHITGLTE DVPACLSALD LGIVNSLWSE TIARAALETM SCSVPLIGTT
VGVLPDLLSQ EALVPPGDSG ALADRIRDVF SDQSLVQRLH KQQEETIRDL STDHFVQHTQ
SLYTRLLSPD R