Gene Dole_1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1001 
Symbol 
ID5693836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1175585 
End bp1176781 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content56% 
IMG OID641263598 
Productheterodisulfide reductase, putative 
Protein accessionYP_001528888 
Protein GI158521018 
COG category[C] Energy production and conversion 
COG ID[COG1150] Heterodisulfide reductase, subunit C 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.284636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAC GATATCTTGC CGAGCCTGAT CTGGGGTTTA TCAACGAGGT CATCGGGCTC 
GGGGGAAATA CCCTGAAAAA GTGTTTTCAG TGCGCCACCT GCTCGGTGGT GTGCCCGATT
TCACCGGACA ACAAGCCCTT TCCCCGAAAG GAGATGATTG CCGCGTCCTG GGGTCTTAAA
GACAAACTGG TCAAGGATGT GGACATCTGG CTGTGCCACC AGTGCGGCGA CTGCTCCACC
AAGTGCCCGC GCGGCGCAGC CCCCGGTGAT GTTCTGGCCG CGGTGCGGTC CTATGCCATT
GCCGACTATG CCACGCCCAA GGCCGTGGGC AAGGCGGTCA ATGATCCCAA AAAACTGCCG
ATCCTGATCG CGGTGCCCGC CATCCTGTTT GCCGTTCTGG CCTTTATCAC CGTCCAGTTC
GGCGACACCA TGGCCAGTAT TTTCAAGAGC ATCGGCCTTG AGGCCATCGG TTTCCCGTGG
GCCCATCATC ATGAACCGGG TGTTATCGCC CACGCGGATT TTTATTCGAC CTGGTTTGTG
GACATTGTTT TTGTGCCCCT GGCCGCTTTT GTGGTACTGG TCTTTTTTAT GGGCCTGCGC
CGGTTTATCA AAGATATTCA TGAACAGGCG GTGGCCGACG GGAAAACCAC CCAGCGGAAA
CTGGACTATG TCGGCCTGGT CAAGGGCATC ATCGCGGTGG TCCCCACCAT TCTCAAGCAC
AACAAGTTTT CCGAGTGTAC CGAGAACAAG GACCGGTCCA CGGCCCATAT GATGGTGCTC
TACAGTTTTA TCGGGCTTTT CGTGGTGACC AATATTTTCT TTTTTGCCCT CTACTTTCTG
CACGCGCCCG GACCTTACTC CCAGTTGAAC CCGGTCAAGA TCCTGGCCAA CGCAGCCGGT
ATCGCTCTGA TTATCGGAAG CCTTCTGATG ATCAAGAATC GCCTGGCGGC AAAGGATCAG
GCCTCGTCCT ACAAGGACTG GTACCTGCTG GGCCTGGTTC TGGGCCTGGG TCTGTCGGGC
ATGCTTGCCG AGCTGACCCG CCTGGCCGGA GCCGAAGCCC TGACCTATTT CATGTATTAC
ATTCACCTGA TCTTTATTTT TAACCTGTTT GCGTTTCTGC CGTTTTCCAA GCTGGCCCAC
CTGGTCTACC GTACCGTGGC CATGGGTTAC TCCAACTATG CCGGCCGTGA GAAATAG
 
Protein sequence
MSERYLAEPD LGFINEVIGL GGNTLKKCFQ CATCSVVCPI SPDNKPFPRK EMIAASWGLK 
DKLVKDVDIW LCHQCGDCST KCPRGAAPGD VLAAVRSYAI ADYATPKAVG KAVNDPKKLP
ILIAVPAILF AVLAFITVQF GDTMASIFKS IGLEAIGFPW AHHHEPGVIA HADFYSTWFV
DIVFVPLAAF VVLVFFMGLR RFIKDIHEQA VADGKTTQRK LDYVGLVKGI IAVVPTILKH
NKFSECTENK DRSTAHMMVL YSFIGLFVVT NIFFFALYFL HAPGPYSQLN PVKILANAAG
IALIIGSLLM IKNRLAAKDQ ASSYKDWYLL GLVLGLGLSG MLAELTRLAG AEALTYFMYY
IHLIFIFNLF AFLPFSKLAH LVYRTVAMGY SNYAGREK