Gene Dole_0524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0524 
Symbol 
ID5693346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp590753 
End bp591793 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content56% 
IMG OID641263108 
Productglycosyl transferase family protein 
Protein accessionYP_001528411 
Protein GI158520541 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGGCC CAATGGATGA AGTCTTCCAT CAGGACATCG TCTGCTCGGT CTGTATCGCC 
AACTATAACG GCGCCAACGT GCTTGCGGCA TGCCTGGATT CTGTTTTTCA ACAGGCATTT
CCTTATCCGT TTGAGGTGAT CGTGCATGAT GACGCATCCA CAGACGGATC GGCCGGCATT
GTTGCGGAAA AATATCCGAC GGTGCGACTG CTTCAAAGCC GGACCAACGT GGGTTTTTGT
GTCAGCAACA ACCGCATGGC GGCTGTTGCA AGGGGCCGGT TTATCCTGCT GCTGAACAAT
GATGCCGAGT TGCACCGGGA CGCCTTTGCC ACGCTTTATG ATGTTGCGGT CAGGCAAAAC
GTGTATGGCA TTCTCGGCCT CCCCCAATAC AGCATGGCGA CCGGTGAACT GATCGACCGG
GGCAGTCTGC TGGACATTTT CTGTAACCCG GTTCCCAACC TGAACCCGTC CCGGCGCGAC
GTGGGCATGG TGATCGGCGC CTGTCTCTGG CTGCCCCGGC ACCTCTGGCA GGAGCTGGGC
GGTTTTCCGG AATGGTTTGA GAGCCTTGCC GAAGACATGT ACCTCTGCTG TTACGCCCGG
GTCAAGGGAT ATCCGGTCAT CGCCCTTGCG GCGTCCGGGT TCAATCATTG GGTCGGCGAG
AGTTTTGGCG GCGGCAAAGT GGTCGGTCGC ACTTTGCAGA CCACCTATCG CAGGCGCACC
CGGAGCGAGC GCAACAAAAC GTATGTCATG CTGCTGTGCT ATCCCGCCCC CCTTGCCCAG
GTGTTGGTTC CGCTTCATTT GTTGCTGCTG GCGGTAGAGG GACTGCTGCT TTCAGCCTTC
AAAAAAGACG CCCGCATCTG GAAAGAAATT TATTGGCCCT GTTTATTGGC CCTCTGGCGT
CGCCGCCACA TGCTGATGCG CCTGAGGTGC GAGATACAGG CAACACGGCG AGCCTCCTTA
AAGGCTTTTT ATTCGACGCA CACCTTCTGG CCTCATAAAC TGACCATGCT GATCAAATAT
GGTCTGCCGA TATTGAAATA A
 
Protein sequence
MSGPMDEVFH QDIVCSVCIA NYNGANVLAA CLDSVFQQAF PYPFEVIVHD DASTDGSAGI 
VAEKYPTVRL LQSRTNVGFC VSNNRMAAVA RGRFILLLNN DAELHRDAFA TLYDVAVRQN
VYGILGLPQY SMATGELIDR GSLLDIFCNP VPNLNPSRRD VGMVIGACLW LPRHLWQELG
GFPEWFESLA EDMYLCCYAR VKGYPVIALA ASGFNHWVGE SFGGGKVVGR TLQTTYRRRT
RSERNKTYVM LLCYPAPLAQ VLVPLHLLLL AVEGLLLSAF KKDARIWKEI YWPCLLALWR
RRHMLMRLRC EIQATRRASL KAFYSTHTFW PHKLTMLIKY GLPILK