Gene Dret_0384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0384 
Symbol 
ID8418189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp471336 
End bp472514 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content48% 
IMG OID645036949 
Productglycosyl transferase group 1 
Protein accessionYP_003197263 
Protein GI258404521 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTCT CTATTGTTTT TACATTTGTA GGCTACTACC TACCCGGCTA TAAATCTGGC 
GGCCCAGTGC GGACCATTGC CAACATGGTA GAACATTTGG GTGATGATTT TGATTTTCGG
ATCGTCACCC GGGATCGGGA TGCGTTGGAT ACTGAACCGT ACCCAGATGT AAAGATCGAT
TCCTGGAACA CAGTGGGTAA GGCGCAAGTA TTTTATGCTT CCCAGGAAAC CTTGGGATTG
TGCGGGGTAG CTCGTTTGTT GCGAGAAACC CCGCATGACG TGTTGTATCT GAACAGTTTT
TTTTCTTTTG GGTTCACAGC GCTGCCGCTT TTGGCAAGGC GATTGGGGCT GGCGCCGAAG
AAGCCTTGCG TTATCGCACC CCGCGGGGAG TTTTCAGCAG GGGCCCTGGC CTTGAAGGCA
TGGAAAAAAA AGGCATATTT ACGACTGACC CGGGCTGTGG GGATATCTTC CGGACTTGCC
TGGCAGGCTT CAAGCCAGCA CGAGGTCGAT GATATCCGTT ATGCCATAGG GAATACCGCC
AATAATATTT TTATTGCGCC TAATCTTCCT TCTCCTTTGC AAAATGAACT CCAGATAACA
TCAGAAAACA ATGCCATTGC TGATGATTCT TATTTACGGA TCATTTTTTT GTCACGCGTA
TCGCCGAAGA AGAACCTGGA TTTTGCGCTA AAGGTCCTCG GCCAGGTTAC TGTGCCGGTG
GTGTTCGATA TTTACGGGGT AATAGATGAC AATGCTTATT GGGACAAGTG CAGCCAGCTT
ATAAACACTT TGCCGGAACA GATAAAGGTT TTCTATCATG GGGTTGTTAA CCATAAAGAC
GTGCATGAAC TGCTCTCCGG GTATGATCTT TTTTTTCTGC CTACCCATGG GGAAAACTAT
GGCCATGCGA TTTACGAGAC CCTTGCAGCC GGTGTGCCGC CATTGATTAG CGATCAGACC
CCATGGCGTG ACCTAGATGA AAAAGGCGCT GGCTTTGTCC GGCGACTTGC TGATTTTGAT
GAGTTTGTTT CTGTTATCAA TGCGTATGCG TACAAAACAG ACGATGAGCG AACTTTTTTT
CAAAAAAGTG CCCATGACTA TGCTTTGCAG GTAGCCTCCG GGTCAGAAGT GCTGGAGCAG
AACCGAAAAC TTTTTCAACT GGCAGCCTGT GGAGAATAG
 
Protein sequence
MSFSIVFTFV GYYLPGYKSG GPVRTIANMV EHLGDDFDFR IVTRDRDALD TEPYPDVKID 
SWNTVGKAQV FYASQETLGL CGVARLLRET PHDVLYLNSF FSFGFTALPL LARRLGLAPK
KPCVIAPRGE FSAGALALKA WKKKAYLRLT RAVGISSGLA WQASSQHEVD DIRYAIGNTA
NNIFIAPNLP SPLQNELQIT SENNAIADDS YLRIIFLSRV SPKKNLDFAL KVLGQVTVPV
VFDIYGVIDD NAYWDKCSQL INTLPEQIKV FYHGVVNHKD VHELLSGYDL FFLPTHGENY
GHAIYETLAA GVPPLISDQT PWRDLDEKGA GFVRRLADFD EFVSVINAYA YKTDDERTFF
QKSAHDYALQ VASGSEVLEQ NRKLFQLAAC GE