Gene Dole_2063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2063 
Symbol 
ID5694906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2514490 
End bp2515647 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content57% 
IMG OID641264664 
Product4Fe-4S ferredoxin iron-sulfur binding domain-containing protein 
Protein accessionYP_001529944 
Protein GI158522074 
COG category[C] Energy production and conversion 
COG ID[COG1148] Heterodisulfide reductase, subunit A and related polyferredoxins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000008917 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACA AAGAAGCAGT TGAACAGAAA TTCAAAAAGG CAGCGGGACT GATCTCAAGC 
GCGGGCATGA TTCCCTTTGC GGTGACCGAT ACGCTTTTAG AGATCGTCAG GTTCTATCTG
GATGAAGCGG ATGCCGACTT TATCAATGCC GCCTTTGACG GTGCCAAATC CCTCTCCATA
GACCAGCTCA AAGAGAAGAC CGGTCTTTCG GAAGAGGAGA TAACAGCCAG AACCGATTCG
CTGGCCAAAA AAGGCATTCT GTTCAACCAG CCCAACAGCC GGGGCGTCAT GGTTTACCGG
TTGTTGCCCC TGATCCTGGT GGGGGCCTTT GAATATACCT TCATGAACAA ATTGCCCGAG
GGAAAAGAGC GGGAGCCGCT TGAAAAGATC GCAAAGCTTT ATCACACCCT GCTGCAGGAG
TTGCGGGATA ACATGCAGCG CGGTTACGAC AACCTGCTGC CGATCTTTGA GCAGCAGCCC
CCGGTGGACC GGACCGTTCC CACTTTTACC ACGGAAACCG GCAACACCAT TCAGATCAAC
AGGGCCATTG GGGCCGAAGA CACCGTGCTG CCGGCCCAGA CCGTTGAAGA AATCATCAAC
AAGTTTGACG ATATCGCCGT GGGCCACTGT TTCTGCCGGA ACTACAACAA GGTGCTGGGC
CATGACTGCG AAATTCATGC ACCCGCCGAG GTCTGCTTTA CCTTCGGCAA GTCCGCCCGC
CACACCGTGG CCCAGGGGTT TGCCCGGCTG GTGTCAAAAG AGGAGGCCCT GGCCATCATG
AAGCAGGCGG AAGAGGCCGG CCTGGTCCAC AAAGCCTTTC ACAACGGCTC GAATATCAGC
AAGGAAGAAA ACAGCATCTG CAACTGCTGC AAGGACTGCT GCGACACCTT TACCCTCTGG
CGCAACGGCG CCACACCCAT GATCAACTCC ACCAACTACC TGTCCGTCAT TGACGAGGAC
ACGTGCACCG GCTGCGGCAT CTGCGTGGAA CGCTGCCCGG TGGATGCCAT TGTGCTGGGC
AGTGAGGGCA CGGCGGTTCG CGAGGAAAAA TACTGCATCG GCTGCGGCAT CTGCGCCCGT
TTCTGCCCCG AAGGGGCCAT CTCCCTTCAG GAGGGCATGA GACGGGTTTA TGTTCCGCCC
CCACGCTTGA GAGCATAG
 
Protein sequence
MSDKEAVEQK FKKAAGLISS AGMIPFAVTD TLLEIVRFYL DEADADFINA AFDGAKSLSI 
DQLKEKTGLS EEEITARTDS LAKKGILFNQ PNSRGVMVYR LLPLILVGAF EYTFMNKLPE
GKEREPLEKI AKLYHTLLQE LRDNMQRGYD NLLPIFEQQP PVDRTVPTFT TETGNTIQIN
RAIGAEDTVL PAQTVEEIIN KFDDIAVGHC FCRNYNKVLG HDCEIHAPAE VCFTFGKSAR
HTVAQGFARL VSKEEALAIM KQAEEAGLVH KAFHNGSNIS KEENSICNCC KDCCDTFTLW
RNGATPMINS TNYLSVIDED TCTGCGICVE RCPVDAIVLG SEGTAVREEK YCIGCGICAR
FCPEGAISLQ EGMRRVYVPP PRLRA