Gene Dole_0440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0440 
Symbol 
ID5693260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp508158 
End bp510020 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content60% 
IMG OID641263022 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001528327 
Protein GI158520457 
COG category[C] Energy production and conversion 
COG ID[COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits 
TIGRFAM ID[TIGR03336] indolepyruvate ferredoxin oxidoreductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000968749 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAAAT TACTTGCCAA GGAACCGGGA AAAGAGATGC TGCTCTTGGG CAACGAGGCC 
CTGGCCCGGG GCGCCCTGGA GGCGGGTGTG GGGTTTGCCT CCACCTATCC GGGAACGCCA
TCCTCGGAGC TGTCGCTCAA TTTTTTCCAG ATCTCCCGGG AGACCGACCT CTATTTTGAA
TACAGCACCA ACGAAAAGGT GGCCCTGGAA GTGGCGGCCG CTGCCGCCAA CTGCGGGGTG
CGCAGCATGT GCGTGATGAA GCACGTGGGC GTGAACGTGG CGGCCGACGC CCTGATGACC
CTGGCCTATG TGGGCGTCAA GGCCGGCATG GTGCTGTTGT CCGCCGACGA TCCCCACATG
TTTTCCAGCC AGAACGAGCA GGACAACCGC TATTACGGCA AGCTTTCCGG CCTGCCCGTG
GTCGAGCCCT CCTCCGTGGC CGAGGCAAAA GAGATGGCCG TTTACGCCTT TGAAGTCTCC
GAGGCCCTGG GCGAGCCGGT GATCCTGCGC ACCACCACCC GGGTCAACCA TTCATCGGCC
AAGGTGGCGC TGGGCAGCCT TCCTGAAAAG GTCAAAACCG AGGGCCAATT TGAAAAGGAC
CCGTTTAATT ACGTTACCGT GCCGGCGGTT TCCAGAAAGC TTCATGTCCG GCTGCTGGAA
CGTCTGAAAA AGGCCGCCGA CCTGTCCAAT ACATCGCCTT ACAATATTCG CACCGGCAAG
GGCCGATATG GCATCATTTG TAACGGGGTG AGTTATTTTT ACGTTACCGA CGCGTTAAAG
GCCCTGGGCC GGGAGAGCGA TTTTTCCGTG CTGCGCGTCG GGTTTTCCAA CCCCATGCCC
GACGCCCTGG TCAAGGCGTT TCTGGCCGAC TGCGACAGGG TGCTGGTGGC CGAAGAGGGT
GAGCCCTTCA TGGAAGAGGC GGTCAAGGCC ATGGCCGCCG AGGAAAAGCG GTGCATTGAT
ATCGCGGGCA AGCGGGAAGA CCTGTTTTCC CGGCTCTCCG AGTTTGATCC CCAGCTGGTG
GCCCGCTGCA TTGCCCGCTA TTTTGATATT CCTTACACAC CGCCGACGCC GGCGGATATG
TCCGGCGTGC CCGAGATTCC CCAGAGGCCC CCCAACCTGT GCGCGGGCTG CTCCCACCGG
GCCACCTTTT ACGCGGTAAA AAAAGCCGCG GAAGGGTATG ACACGATCTT TCCCACGGAC
ATCGGGTGCT ACACCCTGGG GTTTCTGCCG CCGCTCTCCA TGGGCGATTT TCTGATCTGC
ATGGGATCTT CCGTGGGCAC GGCCTGCGGG TTTTCCCGGG CATCGAACCA GAAGGTGGTG
GCCTTTATCG GGGATTCCAC CTTTTTCCAT TCCGGCATTC CGGCCCTGAT CAACGGGGTG
TTCAACAACC ACGACTTTAC CCTGGTGATC CTGGACAACG GCACCACCGC CATGACCGGG
CACCAGCCAC ACCCCGGCGT GGACATGGAC GAGCTCAATT TTTCCGGTTT TCAGCGGGTC
TCCATCGAGG CACTGGTCAA AGGCGCCGGC GTTCAGCACG TGTCAGTGAT CCGGCCCTAC
AACCTGAAAA AAAGTATTGA GGCGATTCGG GAGGCCATTG AATTCAAGGG CGTTTCCGTG
GTCATTGCAC GGGAAGAGTG CGTGCTCAAG GCCAAAAGCC TCAAGCGGGG AAGCGCCCGG
GTTTTTTACG TGAGCGACCG GTGCAAAAAC CACCGGGACT GCATCAACAC CCTGGCCTGC
CCGGCTTTTT ACGTGGCGGA CGGCCGGGTG CAAATCAACC CCAATCTGTG CGCCGGGTGC
GCGGTGTGCG TTCAGGTGTG CCCGGAGAAG GCCATTGTGC CGGTAAAACA GGATCAGAAG
TAA
 
Protein sequence
MHKLLAKEPG KEMLLLGNEA LARGALEAGV GFASTYPGTP SSELSLNFFQ ISRETDLYFE 
YSTNEKVALE VAAAAANCGV RSMCVMKHVG VNVAADALMT LAYVGVKAGM VLLSADDPHM
FSSQNEQDNR YYGKLSGLPV VEPSSVAEAK EMAVYAFEVS EALGEPVILR TTTRVNHSSA
KVALGSLPEK VKTEGQFEKD PFNYVTVPAV SRKLHVRLLE RLKKAADLSN TSPYNIRTGK
GRYGIICNGV SYFYVTDALK ALGRESDFSV LRVGFSNPMP DALVKAFLAD CDRVLVAEEG
EPFMEEAVKA MAAEEKRCID IAGKREDLFS RLSEFDPQLV ARCIARYFDI PYTPPTPADM
SGVPEIPQRP PNLCAGCSHR ATFYAVKKAA EGYDTIFPTD IGCYTLGFLP PLSMGDFLIC
MGSSVGTACG FSRASNQKVV AFIGDSTFFH SGIPALINGV FNNHDFTLVI LDNGTTAMTG
HQPHPGVDMD ELNFSGFQRV SIEALVKGAG VQHVSVIRPY NLKKSIEAIR EAIEFKGVSV
VIAREECVLK AKSLKRGSAR VFYVSDRCKN HRDCINTLAC PAFYVADGRV QINPNLCAGC
AVCVQVCPEK AIVPVKQDQK