Gene Dret_0291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0291 
Symbol 
ID8418095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp360297 
End bp361427 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content60% 
IMG OID645036856 
Productglycosyl transferase group 1 
Protein accessionYP_003197171 
Protein GI258404429 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTCCG GTACATCGCC TCAACGTATC GCTCTCATGC TCCCCAGACT GAGCCGCTAC 
GGCGGGGCGG AACGTTTTGC CTCGCGCTTA GCCAACCATT TGGGTCAAAC CGGGTTTGAC
GTGGACTTTA TCTGCGCCCG GCAGGAATCA GAAGCGCCCC AAGGCGTGAC CCCCAGGGTG
GTTGGTCGCA AGGGGCTCTG CCGCAGCGGC AAGATCCTCT GGTATGCCAT GGCGGCAGAA
CGGCAGCGCC GGGCAGGGAA CTACGACCTC ACCTTGAGCA TGGGCAAGAC CTGGAACCAG
GATGTCTTAC GTTTGAGCGG CGGGCCGCTG CCCGTATTCT GGCGTCTGTC CAAACAAGCC
TACGACCCGG GTATGGCTCG GACGTGGAAA ATGCTGCGCC GGAAAACTGC TCCGGCCAAC
CGGCTGATCA ATTGCATCGA GCGCCGCCAG ATGCGGACGA CGTCCCATTT TGTGGCTGTC
TCCGACAAAC TCGTCGATTG GGTCCAGGAG GCCTACCCTT CGTTTGACAC AAGCCGCATC
CAGGTTATCT ATAATCAGCC AGATCTGACC GCTTTCGAAC CGTATCCCCG CGCCAAGCAG
AGGGCTGAAC GGCAACAACG CGGGCTTGCG CCAGACATGA TCTATATCGG CACTGCTGGG
ACAAATTTCG CACTCAAAGG GGTCGGCTGC CTCATTGCCG CCCTGGCGCA ATTGCCGGAT
TCCCACCATC TTCTCGTGGC TGGCGATCGC AATCCGGACC GGTATCGCAA ACAGGCCCAG
CGTCTCGGGG TCGCGCACCG GGTAACCTTC CTCGGCCGAG TAGAAGACAT GACCGGATTT
TACAACTGCC TGGACGCCTT TGCCCTGCCG ACCTTTTACG ACGCCTGCTC CAATGCAGTC
CTGGAAGCCT TACGCTGCGG CATACCGACC CTGTCGAGCT CTGCAAATGG CAGCAGTGTT
TTCCTGGATC CGGAAAACAC CATCAAAGAT CCCCACGATA CACAGAACTT AGCTCGAACC
TTGCGACGCC TCTGCGCTGA GCCCCGCCGG AACGCGTTTG CCTGGCCCAA TCATATCCGT
GCCGGCCTGG AAGCCTACAC CGAACTGATC GAGACCGCAC TATGCCGATA A
 
Protein sequence
MLSGTSPQRI ALMLPRLSRY GGAERFASRL ANHLGQTGFD VDFICARQES EAPQGVTPRV 
VGRKGLCRSG KILWYAMAAE RQRRAGNYDL TLSMGKTWNQ DVLRLSGGPL PVFWRLSKQA
YDPGMARTWK MLRRKTAPAN RLINCIERRQ MRTTSHFVAV SDKLVDWVQE AYPSFDTSRI
QVIYNQPDLT AFEPYPRAKQ RAERQQRGLA PDMIYIGTAG TNFALKGVGC LIAALAQLPD
SHHLLVAGDR NPDRYRKQAQ RLGVAHRVTF LGRVEDMTGF YNCLDAFALP TFYDACSNAV
LEALRCGIPT LSSSANGSSV FLDPENTIKD PHDTQNLART LRRLCAEPRR NAFAWPNHIR
AGLEAYTELI ETALCR