Gene Dret_0833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0833 
Symbol 
ID8418651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp985526 
End bp987373 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content59% 
IMG OID645037401 
Productindolepyruvate ferredoxin oxidoreductase, alpha subunit 
Protein accessionYP_003197702 
Protein GI258404960 
COG category[C] Energy production and conversion 
COG ID[COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits 
TIGRFAM ID[TIGR03336] indolepyruvate ferredoxin oxidoreductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.187423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.439959 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACATC CTTTGCTTGC CGATGCCCCG GGAACAACGC ACCTCTTGCT GGGTAACGAG 
GCCATAGCCC GCGGCGCCCT CGAGGCCGGT GTTGGTTGTG TGACCTGTTA TCCCGGCACC
CCATCCTCTG AAGTCCCGGA CACTTTGTTC AGGGTCTCTC CAGAGGGGAA CTTCCACTTT
GAATATTCAG TCAACGAAAA AGTGGCCTTG GAAGTCGGCG GAGGGGCTGC CCTGGGAGGC
GTCCCCACTC TGGTGACCAT GAAACATGTC GGGGTCAATG TGGCCGCCGA TCCCTTGATG
ACCCTGGCCT ATATCGGCAC ACCGGGAGGT CTGGTCCTGT TGAGCGCCGA CGACCCGGGC
TGCCACTCCA GCCAGAATGA GCAGGATAAT CGGGCCTACG CCCGGCTGGC CGGGATGCCG
TGCTTTGAAC CGTCAACGGC TCAGGAAGCC AAAGACATGA CCCGTGACGC CCTCCTGCTC
TCGGCCAAAT GGCAGCAGCC TGTTATGCTT CGAACGACCA CCCGGGTGAA CCACCTCCGG
GGCCCTGTGC GCTTCGACGC GTTGCCGCCC GCCAAACGGA CCGGCCAATT TGAAAAAAAT
CCCATGCGCT TTGTGCCCAT TCCAGCTGTG GCTCGGGACA GGCACCCGAA GCTATTGCAC
CAACTCGCCT CCATCGAAGA GGAAATCCAG AATCAAGAGT GGAATACGGT TTCCGGGCAA
GGCCCTGTGG GCATCATCGC CAGCAGCATC TGCCGTGCCT ATGTCCAGGA CGCCTTGCTG
GACATGCCTC AGGCCGACCA GTTCAGCCTG CTGGAACTCA AGGTCAGCTA TCCTCTGCCC
CAGCACCAGC TCCTGGAATT CATTCAGGGA CGTGACAAGG TAGTTGTCGT CGAGGAACTG
GAACCTTTTG TGGAAAGCGC CATTCGGGAA ATGGCCCAAC GCCATCAGCT GGATCTGGAA
ATCATCGGGA AAAGCGAGTT CCTGCCGCGT TGCGGGGAAT TTTCCACCAG GACAGTCGCT
CACGCCCTGG CCCAGGCAGT GGCAGGCACC CCGCCCTCTG CACCGGCCTG CCAAGGCCAG
GAAGGACTTC CCAATCGACC GCCCAACCTG TGCGCCGGGT GTTCCCACCG GGCAACCTAT
TACGCCGTGC GCCAGGTTTT CGGTGACGAG GCTATTTATT CCTCAGATAT CGGCTGCTAC
ACCCTGGGCA TCCTGCCCCC GCTCAAGGCT GCGGACTTTT TGTTCTGCAT GGGATCTTCG
GTTTCCGGAG GGTCCGGCAT GGCCGCGGCC ACGGGGCGGG ACGTTGTCGC TTTCATCGGC
GACTCCACGT TCTTCCACTC CGGCATTACC GGATTGGTCA ATGCGGTCTA TAACGACCAC
GACATCCTGG TTGTGGTCCT CGACAACCGT ACCACAGCCA TGACCGGCCA CCAGCCCCAC
CCAGGGGTTG ACCAGACCGC TCTTGGCGAA AATGCAAACA AAGTGGACAT TGAGCAGATC
GTCCGTGGTT GCGGTGTCAG TCAGATCAAG ACTGTCAAAC CGTTCAACCA CAAGGCCACT
CTTGAGGCAT TGCAGGAACT CAAGGCCATG TCGGGTGTCC GAGTGCTCAT CGCCAAGGAT
CCATGTGCGC TTTTTGCCAA ACGAGTGCTG AAGAAAAAGG CCCCGCAAGT GGCGTATGTA
GCCCAACAGG GTCAGGAAGT GCTCCAGTGC GCCGAACAAG TGGCCTGTCC CGCCTTCACC
ATCTCCGAAG GACAGGTGAC CATCAGCGAA GACCAGTGCA CGGGGTGCAT GCTCTGCGTT
CAGATTTGCC CTGATATCAA AGCTCGGAAA AGGAGCGATA ATGGATAA
 
Protein sequence
MPHPLLADAP GTTHLLLGNE AIARGALEAG VGCVTCYPGT PSSEVPDTLF RVSPEGNFHF 
EYSVNEKVAL EVGGGAALGG VPTLVTMKHV GVNVAADPLM TLAYIGTPGG LVLLSADDPG
CHSSQNEQDN RAYARLAGMP CFEPSTAQEA KDMTRDALLL SAKWQQPVML RTTTRVNHLR
GPVRFDALPP AKRTGQFEKN PMRFVPIPAV ARDRHPKLLH QLASIEEEIQ NQEWNTVSGQ
GPVGIIASSI CRAYVQDALL DMPQADQFSL LELKVSYPLP QHQLLEFIQG RDKVVVVEEL
EPFVESAIRE MAQRHQLDLE IIGKSEFLPR CGEFSTRTVA HALAQAVAGT PPSAPACQGQ
EGLPNRPPNL CAGCSHRATY YAVRQVFGDE AIYSSDIGCY TLGILPPLKA ADFLFCMGSS
VSGGSGMAAA TGRDVVAFIG DSTFFHSGIT GLVNAVYNDH DILVVVLDNR TTAMTGHQPH
PGVDQTALGE NANKVDIEQI VRGCGVSQIK TVKPFNHKAT LEALQELKAM SGVRVLIAKD
PCALFAKRVL KKKAPQVAYV AQQGQEVLQC AEQVACPAFT ISEGQVTISE DQCTGCMLCV
QICPDIKARK RSDNG