Gene Dole_1148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1148 
Symbol 
ID5693982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1365638 
End bp1366906 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content55% 
IMG OID641263741 
Producthypothetical protein 
Protein accessionYP_001529031 
Protein GI158521161 
COG category[S] Function unknown 
COG ID[COG4269] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTGGT TTTACATGGA CGGAGACCGG GAGGTCGGCC CGATAAGCAC GGCCGACATG 
CAGCAGCTGA TCAACACAAA ACAGATTACC GGCAAAACCC TGGCCAGGAA ACAGGACATG
GACCGGTGGC ACCCCCTGGC GGAGTTGACC AAAGCCAAGA AGCCGGACAG TCAGGCACCG
CCGCCGGCCG ATAATCCGCC TTCAGCAAAC GAAGCGCCAC CGCCGGCGCC ACCTGTCAGT
GAACCACCCT CTCCTGCCCC CGCCGCTGTG AGCACCACCC CCCAACGCGC CGTGCCCGAC
AACATCCCGT TTCAATTCAA AGGAACAGGC GGAGAGTATT TTAAAATCTG GATTGTCAAC
GTGCTTTTGT CCATTCTCAC CCTGGGTATC TATTCGGCCT GGGCCAAGGT TCGCCGGAAA
CAGTATTTTT ACGGGAACAC TCAGGTGGCG GGCGCGGGGT TCCGCTACCT TGCCGACCCG
GTTAAAATTC TCAAAGGCCG CCTGATCGTT TTTGTCTTCT TTATTCTCTA CTCCACCGCC
GGCGAATTTA TCCCTGTCCT GGGGGGCATC ATGATGCTGG CATTTCTCAT TTTTCTTCCC
TGGCTGGTGG TGCGGTCCCT GGCATTTAAC GCCCGCAACA GTTCACTGCG AAACATCCGT
TTCAATTTCA CCGGCACTTA TGGCCAGGCC GCCAAGGCGT ATCTGCTTTT TCCGATCCTG
AGCGTCCTGA CTCTGGGAAT CCTGTTGCCA TATGCCTTTT TCCGGCAGAA ACAGTTTGTG
GTTGAAAACT CTTCATACGG CACAACCCCG TTTCGTTTTC ATGCCACGGC AAAAGATTAC
TACCGCATCG TGGGATTGTT TATTCTCCAC GCGCTGATTT TCATCGTGGC GGCGGTGGTC
GTCAGCCTGC TGTTTGCCCC CCTTTCAGCA CTGATCATCA TGGTGCTCTA CCTTTACGCC
ATGGCCTATT TCAGCGTCAA GACCACCAAC CTGCTTTACA GCTCCGGCAC ACTGGCAGAC
CACCGGTTTT CAGCGAACCT GGGAATAAAA GACTACGCCC TGATCATCCT CACCAATTCC
CTGGCCACGG TTGCCACCCT GGGGCTTTTT TACCCTTTTG CCGTGGTGCG GGCGCTGCAA
TACAAAATCG ACCACCTGTC CCTTCTGCCG GGCAGCGATC TTGACCGTTT TGTGGCCGCG
GAGATCAAAG AGACCAGTGC GCTGGGAGAA GAGATGTCCG ATTTTATGGA TTTTGATTTC
GGATTATAG
 
Protein sequence
MMWFYMDGDR EVGPISTADM QQLINTKQIT GKTLARKQDM DRWHPLAELT KAKKPDSQAP 
PPADNPPSAN EAPPPAPPVS EPPSPAPAAV STTPQRAVPD NIPFQFKGTG GEYFKIWIVN
VLLSILTLGI YSAWAKVRRK QYFYGNTQVA GAGFRYLADP VKILKGRLIV FVFFILYSTA
GEFIPVLGGI MMLAFLIFLP WLVVRSLAFN ARNSSLRNIR FNFTGTYGQA AKAYLLFPIL
SVLTLGILLP YAFFRQKQFV VENSSYGTTP FRFHATAKDY YRIVGLFILH ALIFIVAAVV
VSLLFAPLSA LIIMVLYLYA MAYFSVKTTN LLYSSGTLAD HRFSANLGIK DYALIILTNS
LATVATLGLF YPFAVVRALQ YKIDHLSLLP GSDLDRFVAA EIKETSALGE EMSDFMDFDF
GL