Gene Dole_1969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1969 
Symbol 
ID5694809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2380431 
End bp2381549 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content48% 
IMG OID641264567 
Productglycosyl transferase group 1 
Protein accessionYP_001529850 
Protein GI158521980 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAATAG GCATTAACGC GCTCTATCTG CTTCCAGGCA AGGTCGGGGG CTCGGAAACA 
TATATCAGAA ACCTTGTGAA GCACCTCGAT CTTCTCGGCG GGGATAATAC CTATTTTATT
TTTATAAACA AGGAAAGCAT CGGTGTATTT CCGGAAACGT CGCGCATAAA GACAGTGCTT
TGCCCCATCC ACGCTGCCAG TCGTCCGATC CGTATACTTT GGGAACAATG CATCCTGCCT
TTTCAAATTA AAAAATATAA CATTGATGCC GTACTTTCGG CAGGAATGAC GGCTCCTTTC
TTCTGTCCGG CGCGGTCGAT TCTCACCATC TTAGATCTTC AGCATATCAA TCAGCCGCAG
AATTTTTCAC GCTTTCATCT GTTTTTCCTG AGGTCGATAA TCTACTTGAG CGCAAAAACC
GCAGACGGCA TAATGACCAT TTCCGAGCAT GTAAAACAGG ACATCGTGAA GTTTTACAAA
ATACGTCCGG AAAAGATCGC CGTGGGCTAT CTCGCTGTAC AGCATAATAT TTTTACGCCC
GCAGCCGGGA AGGACGATTT GGCTATAAGA GCAAAGTATG GTCTCCCTGA GCGTTATATT
CTTTATGCCG CAGCTTTATT GCCCCACAAG AACCATGAGC GGCTTTTAAC GGCATTTAAA
GCGGTTAAAG ACAAGATCCC CGGTAAAAAG CTTGTGTTTA CCGGTGCGTG GAACCAGGGA
TACGATAAGG TCGCAAACAC AATCTCTGCG TTGGACCTGA AAAAAGATGT CATCATGCTT
GGCTGGCTTC CTTTCGAAGA GATATCCGCG GTCTTTCGCG GGGCGGAGCT GTTCGTCTAT
CCCACCCTTC ATGAAGGGTT CGGTCTCCCT ATTCTGGAAG CCATGGCGAG CGGCGTGCCG
GTTGTCTGCT CAAAAATCGA ACCGCTCATC GAGGTTTCGG GGGATGCGTC GATGTTTGTG
GATCCTTTGG ACCCTGCCGA CATTGCGAAC GGGATTTTGT CCGTCCTTAC CCAAAACCAC
CTTCGGGAAC ACCTTGTGGA AAAAGGTGCA AAACGCGCAC GTCAATTCAC GTGGGAAGCA
ACAGCGCAAA CAACGCTTGT GTTTCTTAAT CGGGCATAG
 
Protein sequence
MKIGINALYL LPGKVGGSET YIRNLVKHLD LLGGDNTYFI FINKESIGVF PETSRIKTVL 
CPIHAASRPI RILWEQCILP FQIKKYNIDA VLSAGMTAPF FCPARSILTI LDLQHINQPQ
NFSRFHLFFL RSIIYLSAKT ADGIMTISEH VKQDIVKFYK IRPEKIAVGY LAVQHNIFTP
AAGKDDLAIR AKYGLPERYI LYAAALLPHK NHERLLTAFK AVKDKIPGKK LVFTGAWNQG
YDKVANTISA LDLKKDVIML GWLPFEEISA VFRGAELFVY PTLHEGFGLP ILEAMASGVP
VVCSKIEPLI EVSGDASMFV DPLDPADIAN GILSVLTQNH LREHLVEKGA KRARQFTWEA
TAQTTLVFLN RA