Gene Dole_2545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2545 
Symbol 
ID5695396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3079772 
End bp3080899 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content61% 
IMG OID641265154 
ProductHEAT repeat-containing PBS lyase 
Protein accessionYP_001530426 
Protein GI158522556 
COG category[C] Energy production and conversion 
COG ID[COG1600] Uncharacterized Fe-S protein 
TIGRFAM ID[TIGR00276] iron-sulfur cluster binding protein, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00348974 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTTAC AACAGGAAGA GATCATCAAA AAAGCGCGGG AACTGGGGTT TGCCGACATC 
GGTTTTACCA CGGCCGACCC TTTTGAGGAG CACCGACGAA TGCTGCTGGA ACGGCAGGAG
GAGTACGGAT GGGCCGAGCA GGTAGGCCTT GACCTGCTGA AAGGCACTGA TCCCGACGCC
ATCCTGCCGG GGGCAAAAAG CATCATCGTT CTGATCGAAA ACTATTTTTC CCACGCCTAT
CCCCGTTCCA TGGAGGGCAT TTTCGGCCGG TGCTACCTGG ATGACGACCG GGTCACAAAA
GACGGCCTGG TGCCGCGGAT AAAGGCCTTC CGCGCCTTTC TGCGGGAGGA CGGCATCAAC
ACCAAGGTGC CCTTTAACCT GCCCCACCGG GTGGCCGCGG CCCGGGCCGG ACTGGGCGAT
TTCGGCAGGA ACTGCCTCTT TTACGCTCAC AATGCCGTGC GCGGCGGCTC CTGGACCCTG
CCCATCGCCG TGGTGGTGGA TCGAGAATTT ACGCCGGGCA CGCCCACCCT GGGCATCGGA
TGCCCGGACT GGTGCAAGAA CGTCTGTATT GCCGCCTGCC CCACCGGCGC GCTCAAGGGC
AGTGGCAGAA TCGATCCCCG CAAATGCATC TCATTTCTCT CCTATTTCGG CGACGGCATC
ACGCCCCTCA AAATGAGGGA ACCCATGGGT ATGTTTGTCT ACGGGTGCGA CCGGTGCCAG
AATGTCTGCC CCCGCAACCA GCCCTGGCTG GCCCAGGCGC TGCCGGTAAA CGAACGGGCC
GCGGCCAAGG CGGAAAACTT CGACCTGCGG GCCCTGCTGC ACATGGACAC CGCTTATTTT
GAAAGCAGTG TATGGCCCCA CATGTTCTAC ATGTCATCCG CCGACATCTG GCGATGGAAG
ATGAACGTGG CCCGGGCCAT GGGCAACAGC CGGGACCAGG GCTTTGTTCC GGACCTGGCA
CGGGCGTTTG AAGAAAACGA GGACCCCCGC GTCAAGGGAA TGGCGGCCTG GGCACTGGGC
CACATCGGCG GCGATCAGGC CAAAACGGCC CTGGAAAAGT TTTCCGAAAC AACACTTGCC
GGTCCCGTGG CCGAAGAGGT TCGTCTGGCA ATGGATGCCT GCGCCTGA
 
Protein sequence
MMLQQEEIIK KARELGFADI GFTTADPFEE HRRMLLERQE EYGWAEQVGL DLLKGTDPDA 
ILPGAKSIIV LIENYFSHAY PRSMEGIFGR CYLDDDRVTK DGLVPRIKAF RAFLREDGIN
TKVPFNLPHR VAAARAGLGD FGRNCLFYAH NAVRGGSWTL PIAVVVDREF TPGTPTLGIG
CPDWCKNVCI AACPTGALKG SGRIDPRKCI SFLSYFGDGI TPLKMREPMG MFVYGCDRCQ
NVCPRNQPWL AQALPVNERA AAKAENFDLR ALLHMDTAYF ESSVWPHMFY MSSADIWRWK
MNVARAMGNS RDQGFVPDLA RAFEENEDPR VKGMAAWALG HIGGDQAKTA LEKFSETTLA
GPVAEEVRLA MDACA