Gene Dole_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2052 
Symbol 
ID5694895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2501690 
End bp2502928 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content59% 
IMG OID641264653 
Productglycosyl transferase group 1 
Protein accessionYP_001529933 
Protein GI158522063 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000028188 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGTCCA GTCTGAAGAT TCTCCACCTG ATCAGCCAGC GGCCCGATGC CACGGGCAGC 
GGTGTCTATG TCCAGGCCAT GCTGCGTCAC GCCGCACAAA AAGGGCATTG CAACCACCTG
GTGGCCGGCA TTCAGGCGAA TAATATACCC CAGGCCCCGG TTGTCAATCA ATGTGAATGC
GCTTTTGTCT GTTTCGAAGG GCCGGACACA CCCCTCCCCA TTGTGGGCAT GAGCGATGTG
ATGCCTTATA AAAGCCGGCG GTTTTGCGAC CTGTCCGATA ACGCGGTGGA TGAATATGAA
ACCTGCTTTG CCGAAAAACT GCGCCATGCC GTCAAACGTT TCGCTCCGGA CCTGATCCAC
AGCCATCACC TGTGGCTGGT TACCTCCCTT GCCAGACGCA TGTTTCCCGG CCTGCCCATG
GTCACCACCT GTCACGGCAC CGATCTGCGC CAGTTTCAGA ACTGTCCTCA CCTGCAGGCG
CGAGTTCTGG AGGGATGCGC CGGGCTGGAC GCGGTCATGG CGCTGAGCCG GGCGCAAAAG
ACTGAGATTG CCTCCCTGTA CGGGCTGTCC GAAGAAAAGA TTCACGTGGT GGGCGCCGGG
TATGACGAGG CCCTGTTTTA TCTTCAGGCC AAGCCCGTCC CGCATCCGGT GCAGGTGGTA
TATGCGGGCA AACTGTGCAA CGCCAAGGGC ACGCCCTGGC TGTTGAAAGC ATTATCCGCC
ATCCATACCG TGCCGTGGCA GCTTCACCTG GTGGGCGGCG GTGCCGGCGA GGAGGCCGAT
CAGTGCTGGA AAATGGCCGG CGACCTGGGA GACCGGGTGT GCGTTTACGG TGCCGTGGAC
CAGTCCACGC TTGCGGCTTT GATGCGACAA AGCCATATTT TTGTGCTGCC CTCATTTTTT
GAAGGCCTGC CCCTTGTGCT GCTGGAGGCC CTGGCCTGCG GATGCCGTGT TGTTGCCACC
GACCTGCCCG GCGTGGCCGA GGTGCTGGAC GGCATGGATG CCGATTATAT CTCCCGGGTC
CATCCGCCGG GATTGCACAC GGTAGACAAA CCCTTTACCC AGGATCTGGA CCGGTTTGTC
AAAGACCTCG CAAACGTTCT TACCACACAG ATGGCCGCAG CGGTCCAACA GCCCGATATT
GATCTTTCCC TTATTCAGGA CCGGCTTTCC GGTTTTACCT GGGGCCGGGT TTTTGAACGG
GTGGAACGGG TCTACAGGTC GGTTTGTCGT CTATCATAA
 
Protein sequence
MVSSLKILHL ISQRPDATGS GVYVQAMLRH AAQKGHCNHL VAGIQANNIP QAPVVNQCEC 
AFVCFEGPDT PLPIVGMSDV MPYKSRRFCD LSDNAVDEYE TCFAEKLRHA VKRFAPDLIH
SHHLWLVTSL ARRMFPGLPM VTTCHGTDLR QFQNCPHLQA RVLEGCAGLD AVMALSRAQK
TEIASLYGLS EEKIHVVGAG YDEALFYLQA KPVPHPVQVV YAGKLCNAKG TPWLLKALSA
IHTVPWQLHL VGGGAGEEAD QCWKMAGDLG DRVCVYGAVD QSTLAALMRQ SHIFVLPSFF
EGLPLVLLEA LACGCRVVAT DLPGVAEVLD GMDADYISRV HPPGLHTVDK PFTQDLDRFV
KDLANVLTTQ MAAAVQQPDI DLSLIQDRLS GFTWGRVFER VERVYRSVCR LS