Gene Dole_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1940 
Symbol 
ID5694780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2343735 
End bp2345303 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content58% 
IMG OID641264538 
ProductSH3 type 3 domain-containing protein 
Protein accessionYP_001529821 
Protein GI158521951 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCTGGA AAACTCACAC CATTGTCAGA GGCACAATCG TTATATTCGG TATTGTCCTG 
GCGGCATGGA TGCTGGCTGC GTGTGCCCCG TGGTACCGGT CTTATGACAT CACGTCGGGC
GAGGATTTGA CAAAAATTTC CGCCGTTCCC GAGTTACGAA AAGCTCTGAA GGACAGTAAG
CCGGACGTTC GAATGGCGGC GGCCACGGCC CTTGGACGGA TCGGGCCCGA CGCAAGGGAT
GCGCTCTCCG ATCTTGTGGA TGTACTGGGC GACAACAGGC ACGAGGTGCG CGAGGCATCG
GCAAATGCGA TTTCGTCTAT TATCGGAACC GCACCGGTAT CGGAAGCGGA TAAGAACCTG
ATGGTGCAGG TGCAGGCCAA CCGCCTTGCG TCCGAGGACT GGGCGGCTCG TGTGGACGCC
GCCGACCAGT TGGCCCAAAT GGGCCCGGCT GGCGCGGATG CGGTTCCCGT GTTGATTTCC
ACCCTGTCGG ACGAAACAGA ATGGAGTTAT TACTGGACCC GGCAGTACGA TAAAGTAAGA
CGCGCCGCCG CAAACGCCCT TGGGGAGATG CGTTCCGCGG CAACGGCCGC CAGTCCCGCG
CTGATCAAGG CCTCAAAATA TCAAGATCCC GGGGTTCGCC TGGAGGCGGT CAGGGCCCTG
GGTAAAATCG GCACTCCATC CGACAGCACT GTAGTTAAGG CATTAACGGC CGCGTTAAAG
GACGATGACG CGGGTGTGCG TCGCGAGGCG GCCGACGCAC TGGGGGCCTT TGAGGTGTAT
GCAAACAATA CGGTTCCCAA CCTGGTAGAC GCTCTTTCCG ATCAGGATGT TGATGCGCGA
AGAAAGGCGG CTCAGGTACT GGGCCGCTTC GGTCCCAAGA CAGACGCGGC GGCGGAAGCC
CTTGTGGCCG CGCTGAAGGA TACCGACAAA GCGGTCAGGC AGACGGCGGC CCGCGCAATT
GCCGAATTTG GCATCGACAA CAAGACGGCG GCAGCCACCC CCCTGAGACC GCCGGTTGCC
GCTATGGCCC CTGAAGAGAC TGCAGCAGTG GAGACTACGG CACAACCGGA GACAAAAATC
CGTTCCACTG TGGATCTGCT TAACATTCGG GCCATGCCCA GTGTAAACAG CCGACGTGTA
GGTAAACTGC TGCAAAACGA AATTGCAACA GTGGTTGAGA CCCTGGTGGA TTGGGTCAAA
ATCGAGAAAC CCGACGGCAC CACCGGCTAT GTGTTTAAAG AGTACACAGC GATGGTGCAT
GAGACAGGGG ATGCCTCCAG GGTGCTACAA CCAGAGTCAC AGAAGGCGCA AGCCACTGTC
AACATGCCGA TGGTACCGGT TGTCACCGCA CCGGTTGCCG TTGCTTCGGC TTCGACAGTT
CCCAAAATAC GGCCAATCGT GGATGCCCTT GAGATGCGAT CAGAGCCTTT TGGGAGTGAA
CAGGTCGGCC AACTGCTGCG TAATGAAGCG GCGGAGGTTG TCGAGAGCCG GGCCGGATGG
ATCAAAATAA AAAAAGCCGA CGGCACCACC GGCTATGTGT TTAAAGAATA TACAGAGAGT
GCCCCCTGA
 
Protein sequence
MPWKTHTIVR GTIVIFGIVL AAWMLAACAP WYRSYDITSG EDLTKISAVP ELRKALKDSK 
PDVRMAAATA LGRIGPDARD ALSDLVDVLG DNRHEVREAS ANAISSIIGT APVSEADKNL
MVQVQANRLA SEDWAARVDA ADQLAQMGPA GADAVPVLIS TLSDETEWSY YWTRQYDKVR
RAAANALGEM RSAATAASPA LIKASKYQDP GVRLEAVRAL GKIGTPSDST VVKALTAALK
DDDAGVRREA ADALGAFEVY ANNTVPNLVD ALSDQDVDAR RKAAQVLGRF GPKTDAAAEA
LVAALKDTDK AVRQTAARAI AEFGIDNKTA AATPLRPPVA AMAPEETAAV ETTAQPETKI
RSTVDLLNIR AMPSVNSRRV GKLLQNEIAT VVETLVDWVK IEKPDGTTGY VFKEYTAMVH
ETGDASRVLQ PESQKAQATV NMPMVPVVTA PVAVASASTV PKIRPIVDAL EMRSEPFGSE
QVGQLLRNEA AEVVESRAGW IKIKKADGTT GYVFKEYTES AP