Gene Dole_0644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0644 
Symbol 
ID5693474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp763689 
End bp764855 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content58% 
IMG OID641263236 
Producthypothetical protein 
Protein accessionYP_001528531 
Protein GI158520661 
COG category[R] General function prediction only 
COG ID[COG1373] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGGTT ATATCCCCAG GCTTATTGAA ACGGATATTC TCCGGTCCCT TGCCCGGTCT 
CCGGCCGTGG CAATCCTCGG CCCCCGGCAA TGCGGGAAAT CCACCACTGC CCGGCAGTTG
ATTGATCCGG CCACATCGAT CTATTTGGAT TTGCAGGACC GGGTGGACCG GAACAAGCTT
TCTGAGCCGG AACTGTTTTT TGAGCAATAC CGGAGCAGAC TGATCTGCCT GGATGAGATT
CAGCTGCTGC CGGAATTTTT TTCCGTGCTG CGCTCGGAGA TCGACAAGGA TCGACGGCCG
GGTCGCTTTT TGATTTTAGG GTCGGCGTCC CGGGACCTGA TCCGGCAGTC AACCGAGTCC
CTGGCCGGGC GGATCGCTTA TCATGACCTG ACGCCCTTTT TGCTGGCGGA AATGGTCGGC
AAATTGTCGT GGGCGGACCT GTGGCTTCGG GGCGGGTTCC CGGAAAGCGC CCTGGCCCAT
GACGAGCAGG CCGGTTTTGA ATGGCGCCTG GATTTTATCC GTACATTCAT GGAGCGCGAT
ATCCCGGCCC TGGGATTTAA CATTCCGGTG CCGGTGATCG AACGGCTGTG GCTGCTTCTG
GCCCACTGCC ACGGCCAGAC CATCAACTAC CAGAAACTGG CCGCATCAGC GGACCTGGCC
GTGCCGACCC TGAAAAAGTA CCTGGCCCTG CTGGAACAGA CCTATATGGT CCGGCTGCTG
CCCCCGTTTG CCGCCAATCT TAAAAAACGG CTGGTCAAGT CGCCCAAGGT GTTTCTGACC
GACAGCGGTA TTCTTCACGC GTTGCTGGAT ATTGAGTCCT ATGATTACCT GCTGGCCAAC
CCAACGGCCG GCGCCTCCTG GGAAGGGTTT GTGATTGAAA ATCTTATTGC CCTGCATCCC
CGCTGGCGGC CGTCGTTCTT ACGCACCTCC AACGGCGCTG AAATCGACCT GGTGCTGGAG
CGGGCCGGGC GATACCATGT TTTTGAATGC AAGCTCTCCA AGGCCCCGCA ACCCTCCCGT
GGCTTTTACG AGCTGGTTGA TGGTCTGCGA CCCGAAACCG CCTGCGTGGT CGCGCCGGTG
GATGAGCCGT TTGAAATAAA AAAAGGGATT TGGGTCTGTT CGCCCCTGCA TTTGATTAAG
GAGGAAAAAA AATCGGGGGT GGGATAA
 
Protein sequence
MHGYIPRLIE TDILRSLARS PAVAILGPRQ CGKSTTARQL IDPATSIYLD LQDRVDRNKL 
SEPELFFEQY RSRLICLDEI QLLPEFFSVL RSEIDKDRRP GRFLILGSAS RDLIRQSTES
LAGRIAYHDL TPFLLAEMVG KLSWADLWLR GGFPESALAH DEQAGFEWRL DFIRTFMERD
IPALGFNIPV PVIERLWLLL AHCHGQTINY QKLAASADLA VPTLKKYLAL LEQTYMVRLL
PPFAANLKKR LVKSPKVFLT DSGILHALLD IESYDYLLAN PTAGASWEGF VIENLIALHP
RWRPSFLRTS NGAEIDLVLE RAGRYHVFEC KLSKAPQPSR GFYELVDGLR PETACVVAPV
DEPFEIKKGI WVCSPLHLIK EEKKSGVG