Gene Dole_2139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2139 
Symbol 
ID5694982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2595460 
End bp2598495 
Gene Length3036 bp 
Protein Length1011 aa 
Translation table11 
GC content59% 
IMG OID641264740 
Producthypothetical protein 
Protein accessionYP_001530020 
Protein GI158522150 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000321254 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTTGC GAAAATATAA TCGAGCGGTG ATGCTTCTGC TCTGCCTGCT TCTCATTGCT 
CCTGCCGCTT ACGCCCATGG GTGGCGGCCC GCCCGTTCTC TGGAAACAAA GGCGGAAATG
GACAAAGACC GTCAGCTCAC CATCCGGACC GTTGCCATGT CAGAAAAAAC CATGGTGTTT
GAGTATGCCA TGCCCGGCCT TGAGGCGGTT TTTGCGGAAA ACAGCCCCAA AACGGCCACT
GAAAGCGATG CACCATCCGG TCCCGTAAAA CCGGTGCTGG GAAACGCAGC CCACCGTTCT
GAGCCGGGCA TGCCGGTTCT GCCGGTGGTG CCGGCCCGGA TTGTTCTGCC GGAAGGACAG
GACCTGGACG ATGTGTGGGT CACGCCGGGC AAAAAGACTG TTCTGCCGGG CAAATACCTG
GTGGCGCACG GCCAGACCCC TTATCCCCGG ATTCCGGGCG TCAAGCCGGA AAAAACGGAA
AAGAACAGGG CCGTGTATGA ATCCGATGAT CCCTACCCGG GAAAGCTGGT AGAGATCGTC
GGGGTCCAGA AAAAACGGGG GGTCTCCATC CTGCTGGTAA ACCTTTATCC CGTGGTCTAC
CGCCCAAAAA GCGGCACCCT TTCCTGGTAT GAAACCCTGA CCCTGACGGT AACCACAAAG
CCGGCGGACA GCACTTCCTC CCGTAAACCG GGCCTGCCCT ACAGGTCCTC AACTACCGGC
ATGCTGGACC GGGCGGTGGA AAATCCGGAA ATGGTGAACA CCTACACGGA TAAAAAAAAA
ACTGAATACG CGCCAGCGGA AGACGCGATC TACCCCCGGG CCACTGCCCG TTATGTGCTG
ATCACCAGCC AGGCCATTAT CGACGACGTC TCCATAGACC CTTCAGTGGC CGACTTTATC
GCCCACCGCC AGGCCCAGGG ACTGACCACG GCTGTGGTAT CCATTGACTC GGTCCTTGCC
GATGACAACT ATACCGGCAG GAATGATGCC GAAACCCTGC GCAATTTCAT CATTTACGCC
TACGAAAACC TGAACACCGA GTATGTTCTT TTGGGTGGCG ACACCGGCAT CATCCCCCCG
CGTACGCTCT GGTGCGAGGC GGGAAGATAT GAGGATTACA TTCCTTCCGA TCTTTACTAC
CAGTGCCTGG ACGGTGACTT TAACTTCGAC GGGGACGAAG AGTGGGGGGA ACGCACGGAC
GGAGAAGACG GGGGAGATGT TGACCTGATG GCCGAAATCT TCGTGGGCCG GGCCTCGGCG
GAAAACGCCG CGGAGCTGTC CAATTTTTTT TCTAAAACCA TGGCCTATGA AACCGGTGCC
GATGACGCAG CCTGTCTCAC CACGTCCCTG ATGTGCGGCG AATACCTGGG GTTTGGCGGA
ATCGCGGACT ACGCCGAACC GGGCCTGGAG GAGATACGGC TGGGATCGGA CAACCATGGC
TACACCACCG CCGGGTTTGC CGCCTTTCCA AATTTTACAG TAGACACGCT TTACGACAGC
GATTCTTACG AATGGCCTGC CGCTGAAATC ATAAACGACA TCGATTCAAA CGCCTACAGC
ATCATCGTTC ACGACGGCCA CGGGCTTGAT GACCAGATCA TGAAGCTGTA CAACGCTGAT
GCCGACGCCC TTGTCAACAC GAATTTCATA TTTGCACAAT CTCAGGCCTG CTTTTCCGGG
AACTACGAGG GGGACTGCAT TGCCGAGCAC CTCACCACAT CCACCCGCCA CGGCATGTTT
GCCGTGGTTT TCAACTCCCG TGAAGGGTGG GGCTTGCCCT ACAGCACGGA CAGCCCGAGC
CAGCGGCTGG CCCGTGAATT CTGGGACGCC TGTTTCGGTG AGGGGCTGGC CGACCTGGGG
GCGATCAACA CCGCCAGCCA CGAGGACAAC CTCTGGGACA TCGGCGATGG TTTTATTCGC
TGGTCTATCT ACGAAACCAA CCTGCTGGGT GATCCCGCTA CCGTGCTGCG GGGGCGGCTT
TCTCTTTCCC TGGAAATTCC GGCCAATGCC ATGGAAGGCG ACGTTCGCTC CGGCCGGGTG
CAGGTTTTCT CTGTCAAACA GGTTCCGGCC ATCAAAACGG TCGCAACAGA CCTATTGATA
TCGCTGACCA GCAGCGACAC AACAGAGGTA ACCGTGCCCG CTTCCGTCAC CATTCGGGCC
GGTGAATCAG AAGTCTCGTT TGACCTGACG GTGGAAGACG ACCTGTTGCT GGACGATGTT
CAGACCGCGC TGATCACCGC CTCGGCCCCT GGCTTTATCG CGGCAACCGC CGCAATCGAT
GTGGAAGACA ACGACACCGA CACCGACGAC GACGGGCTGT CCGACGACCT GGAGGCAATA
TCCCGCACCG ACCGGGATGA CGCGGATACC GATGACGACG GTATTTTCGA CGGTGATGAA
GACACCAATC AGAACGGCAC GGTGGACCCT GGAGAGACCG ACCCCTGTAA TCCCGACACA
GACGGCGACG GCATTCAGGA CGGCACCGAA CTCGGTTATA CCGATGGCAC CGGCCTGGAT
ACGGACAGCG ATTCCTTCCA GCCCGACCTT GATCCAAACA CCACTACCGA TCCCCTGGAC
ACCGACACTG ACGGCGACGG CCTGCCTGAC GGGCAGGAGG ACGCCAACCG CAACGGCCGG
GTGGATGCCG GTGAGACCGA TCCTTCCATC AATGTACGGC CCACGGCAAA TGCCGGGGCC
GACCAGGCAG TGGATGAGGG AGACCGGACC ACGCTTGACG GCACCGGCTC GTTTGATATT
GACGATGGTA TTTCATCCCA TGTCTGGACC CAGACCGCCG GGCCCACGGT AACGATTGCA
AATTCCCACA CGGTCCGGGC CTCCTTTACG GCCCCGGATG TGGAAGCCGA CCAGACCCTG
ACCTTCCGGC TTTCTGTGAG AGACAGGGCC GGCCAGTTGA CGTCCGATAC CTGCACCGTC
ACTGTCCGGT GGGACGGCAT TGCCCCACCG GATAATGACA CCCCTCCCGC AGCCGGCGGC
GGTGGGGGCG GCTGCTTTAT CGAGGCCGTC AGATAG
 
Protein sequence
MLLRKYNRAV MLLLCLLLIA PAAYAHGWRP ARSLETKAEM DKDRQLTIRT VAMSEKTMVF 
EYAMPGLEAV FAENSPKTAT ESDAPSGPVK PVLGNAAHRS EPGMPVLPVV PARIVLPEGQ
DLDDVWVTPG KKTVLPGKYL VAHGQTPYPR IPGVKPEKTE KNRAVYESDD PYPGKLVEIV
GVQKKRGVSI LLVNLYPVVY RPKSGTLSWY ETLTLTVTTK PADSTSSRKP GLPYRSSTTG
MLDRAVENPE MVNTYTDKKK TEYAPAEDAI YPRATARYVL ITSQAIIDDV SIDPSVADFI
AHRQAQGLTT AVVSIDSVLA DDNYTGRNDA ETLRNFIIYA YENLNTEYVL LGGDTGIIPP
RTLWCEAGRY EDYIPSDLYY QCLDGDFNFD GDEEWGERTD GEDGGDVDLM AEIFVGRASA
ENAAELSNFF SKTMAYETGA DDAACLTTSL MCGEYLGFGG IADYAEPGLE EIRLGSDNHG
YTTAGFAAFP NFTVDTLYDS DSYEWPAAEI INDIDSNAYS IIVHDGHGLD DQIMKLYNAD
ADALVNTNFI FAQSQACFSG NYEGDCIAEH LTTSTRHGMF AVVFNSREGW GLPYSTDSPS
QRLAREFWDA CFGEGLADLG AINTASHEDN LWDIGDGFIR WSIYETNLLG DPATVLRGRL
SLSLEIPANA MEGDVRSGRV QVFSVKQVPA IKTVATDLLI SLTSSDTTEV TVPASVTIRA
GESEVSFDLT VEDDLLLDDV QTALITASAP GFIAATAAID VEDNDTDTDD DGLSDDLEAI
SRTDRDDADT DDDGIFDGDE DTNQNGTVDP GETDPCNPDT DGDGIQDGTE LGYTDGTGLD
TDSDSFQPDL DPNTTTDPLD TDTDGDGLPD GQEDANRNGR VDAGETDPSI NVRPTANAGA
DQAVDEGDRT TLDGTGSFDI DDGISSHVWT QTAGPTVTIA NSHTVRASFT APDVEADQTL
TFRLSVRDRA GQLTSDTCTV TVRWDGIAPP DNDTPPAAGG GGGGCFIEAV R