Gene Dole_0949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0949 
Symbol 
ID5693784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1105709 
End bp1106953 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content57% 
IMG OID641263546 
ProductPGAP1 family protein 
Protein accessionYP_001528836 
Protein GI158520966 
COG category[R] General function prediction only 
COG ID[COG1075] Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000633482 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGGCCAC GGAACAGATT TTTTCCGTCT GCCAACAACT TTTTTATCGA AAAAAAGGAG 
GAGATCAAGA TGGTAAAACG ATTTTTTGCT TTTGTTCTGG TAAGTTGTCT GATTGCGGCT
CTGCCGGGGG CCGTGTTTGC CCGGGCCGAT AAAACCACCT ATCCGGTGGT ATTTGCCCAC
GGCATGCTGG GGTTTGATGA ACTGGTGGGC ATCGATTATT TCGGCAATGA TTACGGTGTA
TTTGTCGGCG ATCCCTGCGA CGGTTTTCTG GAGACCTCCT GCAACAGCGA AATTGATAAA
AACCAGCAGG CCTTTGCCGC GTCGGTCAAT CCGTTTCAGT CCTCTGAACT AAGGGGTCTG
GAGTTGGCCG ATGCCATAGA GAGCTACATG GCCACCGTAA ACGCGGACTG CGTGAATATC
GTGGGCCACT CCCAGGGCGG CATGGACGCC AGAAAGGCGG CCAAGCTGCT TTACGACCGA
AAGGGCCGGC AGGTGGTCAA GGTGATGGTG TCAATCTCCT CCCCCCATCG CGGATCACCG
GTGGGCAAGG GCGTGCTGGA CCAGGGCCCC GACGGCATGA ACGCCTTTCT CGGCGTGCTG
GTTGATTACC TGGTGGGGCC TGTGCTGGTG GGGGATCTCA GCGACTTTGA AGCCAGCATG
AAGGCATTTG TGTATGACGA TTACGATCCC AACGACGGGG TTGTCACCGG CGCCAAGGCC
TTTAACAACG CTTACGGCAT CAACGACACC CATGTACGGC ACTATGCCTC CATCATCACC
GCCACCCAGG GAAACCTCAA CCCGATTCTT GGCGCCCTGG GCCTTGTGGC GCCTCTTGAC
ATCGACGGCG ACGGCTGGTG CGCCGACTAC ACGGACTGCA ACAACGACGG CGCCGCCGGC
TGCGGAGACG GGGATTTTGA GGACGGCGAC GACGACGGAC TGGTGGGCAT CAACTCCCAG
CAAATGGGCT ACCGCCTGAA GCACAAAAAG AGCTGGCTGT GGGGCACCTA TTTTGACGAA
GATTCGACCA CCGGCTATGT GGGCGACATC GACCGGCCCA GCCAGGTGCA GGCCACATCC
TACAGCAGTG TCATCGATCA GGACCATCTG GATGTACTGG GCCTGGGCGT GATTCCCTAC
CTGATTCCCG ATGACTTTGA TGAGGAAGGC TTCTACGCCG ACCTGATCGA CTACATCGCC
GACAACGAAG GAACCAGCAG CTCCGGCTGG TGGTGGTTCT GGTAG
 
Protein sequence
MGPRNRFFPS ANNFFIEKKE EIKMVKRFFA FVLVSCLIAA LPGAVFARAD KTTYPVVFAH 
GMLGFDELVG IDYFGNDYGV FVGDPCDGFL ETSCNSEIDK NQQAFAASVN PFQSSELRGL
ELADAIESYM ATVNADCVNI VGHSQGGMDA RKAAKLLYDR KGRQVVKVMV SISSPHRGSP
VGKGVLDQGP DGMNAFLGVL VDYLVGPVLV GDLSDFEASM KAFVYDDYDP NDGVVTGAKA
FNNAYGINDT HVRHYASIIT ATQGNLNPIL GALGLVAPLD IDGDGWCADY TDCNNDGAAG
CGDGDFEDGD DDGLVGINSQ QMGYRLKHKK SWLWGTYFDE DSTTGYVGDI DRPSQVQATS
YSSVIDQDHL DVLGLGVIPY LIPDDFDEEG FYADLIDYIA DNEGTSSSGW WWFW