Gene Dole_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2010 
Symbol 
ID5694850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2435170 
End bp2436864 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content60% 
IMG OID641264608 
Productpeptide synthase 
Protein accessionYP_001529891 
Protein GI158522021 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACCAG AAAACCACAA CCCGGTGATC AATATTGCGG CCCGCATGAC GCAAATGGCC 
CGGCAGCATC CCTACAAAAA AGCGGTGATC GCGCCCCAGG GACGGGACCG GGCCGGACGG
GTCACCTATG CCCACTTCAC CTTTGCCCAG TTGGACGCCG ACTCCAGCCG CCTGGCCTCG
GGCCTGGAAA AGGCCGGTAT TCGCCGGGGC ACCCGCACCA TTCTCATGGT ACGGCCCAGC
CTGGACTTTT TTTCCCTGGT CTTTGCCCTG TTCAAGGCCG GCATCGTGCC GGTGGTGGTG
GACCCGGGCA TGGGGGTAAA GCGCATGGTG AGCTGTTTTG CTGAAACCGA TCCCCAGGCC
TTTATCGGCA TTCCCCTGGC CCATGTGGTG AGAAAAATCT ACCCGAAATT CTTCAAAACC
GTTGAAACAT GGGTCACCGT GGGAAATCGC TGGTTCTGGG GCGGCCACAC CCTGGACCGA
ATCCGCGCAT CGGGCACAGA GGATTATAAA ACAGCCGAAA CCCTGTCAGA TGAAACCGCG
GCCATTCTGT TTACAACGGG CAGCACCGGC CCGGCCAAGG GCGTGGTCTA CACCCACGGC
AATTTTGACG CCCAGATTCA GCATATTCAG GACCATTTCC AAATCGGTTC CGACGAGACC
GACCTGCCCA CATTCCCGCT GTTTGCCCTG TTTGACCCGG CCCTGGGCAT GACCGCCGTG
ATTCCGGACA TGGATCCCAC CAAGCCGGCC TTTGTCAACC CGGAACGCAT TCTCGAGGGC
ATTGCCAACC ACGGGGTGAC CAACATGTTT GCCTCCCCTG CCCTGCTCAA CCGGGTGGGC
GGTTACTGCA AGAAACGCAA CATTGTCCTG CCGTCGCTGC GGCGGGTGGT GTCGGCCGGC
GCCCCGGTTC ACCCGTCCAA CATCGAGCAG TTCGCGTCGG CCCTGACCGA TGAGGCCGAA
GTGCACACGC CCTACGGCGC GTCCGAGGCG GTGCCCATCA TCTCCATCGG CAGCCGGGAG
ATCCTGACCG AGACCAAGCA GATGAGCGAG CAGGGGTTCG GCAACTGCGT GGGCCGGCCC
CTGGAAGGCA TTGAGGTAGA GCTGATCACT ATTTCAGACA GGCCCATTGA GGCGTGGTCC
GACGACCTGC TGGTGGCCCC CGGTGATGTG GGAGAGTTTG TGGTCAAGGC AGACCTGGTC
ACCCGTTCTT ACTACAACCG GCCGAAAGAC ACGGCAGGGG CCAAGATACC CGACGGGGAC
GGTTTCTGGC ACCGCATGGG AGACCTGGCA TGGATGGACA ACCACGGCCG GTTCTGGTTC
TGCGGCAGGA AGAGCCACCG GGTGGAGTGT GCGGACCGGA CCCTGTTCAC CGTCCCCTGC
GAGGCCATCT TCAACAACCA TCCCCATGTG GCCAGAAGCG CCCTGGTGGG TGTGGGCCCG
GCGGGAGGTC AGACACCGGT GATCTGTATC GAGGTGATCA AGGAAAAACG GATTCGAAAA
AAAGAGCTGG CATCTGAACT GTTAGACCTT GCCCGGACCC ATGAACTGAC AAAGTCCATC
AAGACCGTCC TGTTTCACGA CAACTTTCCC GTGGATATCC GGCACAACTC GAAAATCTTC
AGGGAAAAGC TGGCGGTGTG GGCCGCGAAA AAGATAAAAA CAAAACCGCG TCCTTCTCCA
AAAAAGCGGG GATGA
 
Protein sequence
MAPENHNPVI NIAARMTQMA RQHPYKKAVI APQGRDRAGR VTYAHFTFAQ LDADSSRLAS 
GLEKAGIRRG TRTILMVRPS LDFFSLVFAL FKAGIVPVVV DPGMGVKRMV SCFAETDPQA
FIGIPLAHVV RKIYPKFFKT VETWVTVGNR WFWGGHTLDR IRASGTEDYK TAETLSDETA
AILFTTGSTG PAKGVVYTHG NFDAQIQHIQ DHFQIGSDET DLPTFPLFAL FDPALGMTAV
IPDMDPTKPA FVNPERILEG IANHGVTNMF ASPALLNRVG GYCKKRNIVL PSLRRVVSAG
APVHPSNIEQ FASALTDEAE VHTPYGASEA VPIISIGSRE ILTETKQMSE QGFGNCVGRP
LEGIEVELIT ISDRPIEAWS DDLLVAPGDV GEFVVKADLV TRSYYNRPKD TAGAKIPDGD
GFWHRMGDLA WMDNHGRFWF CGRKSHRVEC ADRTLFTVPC EAIFNNHPHV ARSALVGVGP
AGGQTPVICI EVIKEKRIRK KELASELLDL ARTHELTKSI KTVLFHDNFP VDIRHNSKIF
REKLAVWAAK KIKTKPRPSP KKRG