Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_2139 |
Symbol | |
ID | 5694982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 2595460 |
End bp | 2598495 |
Gene Length | 3036 bp |
Protein Length | 1011 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641264740 |
Product | hypothetical protein |
Protein accession | YP_001530020 |
Protein GI | 158522150 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000321254 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTTGC GAAAATATAA TCGAGCGGTG ATGCTTCTGC TCTGCCTGCT TCTCATTGCT CCTGCCGCTT ACGCCCATGG GTGGCGGCCC GCCCGTTCTC TGGAAACAAA GGCGGAAATG GACAAAGACC GTCAGCTCAC CATCCGGACC GTTGCCATGT CAGAAAAAAC CATGGTGTTT GAGTATGCCA TGCCCGGCCT TGAGGCGGTT TTTGCGGAAA ACAGCCCCAA AACGGCCACT GAAAGCGATG CACCATCCGG TCCCGTAAAA CCGGTGCTGG GAAACGCAGC CCACCGTTCT GAGCCGGGCA TGCCGGTTCT GCCGGTGGTG CCGGCCCGGA TTGTTCTGCC GGAAGGACAG GACCTGGACG ATGTGTGGGT CACGCCGGGC AAAAAGACTG TTCTGCCGGG CAAATACCTG GTGGCGCACG GCCAGACCCC TTATCCCCGG ATTCCGGGCG TCAAGCCGGA AAAAACGGAA AAGAACAGGG CCGTGTATGA ATCCGATGAT CCCTACCCGG GAAAGCTGGT AGAGATCGTC GGGGTCCAGA AAAAACGGGG GGTCTCCATC CTGCTGGTAA ACCTTTATCC CGTGGTCTAC CGCCCAAAAA GCGGCACCCT TTCCTGGTAT GAAACCCTGA CCCTGACGGT AACCACAAAG CCGGCGGACA GCACTTCCTC CCGTAAACCG GGCCTGCCCT ACAGGTCCTC AACTACCGGC ATGCTGGACC GGGCGGTGGA AAATCCGGAA ATGGTGAACA CCTACACGGA TAAAAAAAAA ACTGAATACG CGCCAGCGGA AGACGCGATC TACCCCCGGG CCACTGCCCG TTATGTGCTG ATCACCAGCC AGGCCATTAT CGACGACGTC TCCATAGACC CTTCAGTGGC CGACTTTATC GCCCACCGCC AGGCCCAGGG ACTGACCACG GCTGTGGTAT CCATTGACTC GGTCCTTGCC GATGACAACT ATACCGGCAG GAATGATGCC GAAACCCTGC GCAATTTCAT CATTTACGCC TACGAAAACC TGAACACCGA GTATGTTCTT TTGGGTGGCG ACACCGGCAT CATCCCCCCG CGTACGCTCT GGTGCGAGGC GGGAAGATAT GAGGATTACA TTCCTTCCGA TCTTTACTAC CAGTGCCTGG ACGGTGACTT TAACTTCGAC GGGGACGAAG AGTGGGGGGA ACGCACGGAC GGAGAAGACG GGGGAGATGT TGACCTGATG GCCGAAATCT TCGTGGGCCG GGCCTCGGCG GAAAACGCCG CGGAGCTGTC CAATTTTTTT TCTAAAACCA TGGCCTATGA AACCGGTGCC GATGACGCAG CCTGTCTCAC CACGTCCCTG ATGTGCGGCG AATACCTGGG GTTTGGCGGA ATCGCGGACT ACGCCGAACC GGGCCTGGAG GAGATACGGC TGGGATCGGA CAACCATGGC TACACCACCG CCGGGTTTGC CGCCTTTCCA AATTTTACAG TAGACACGCT TTACGACAGC GATTCTTACG AATGGCCTGC CGCTGAAATC ATAAACGACA TCGATTCAAA CGCCTACAGC ATCATCGTTC ACGACGGCCA CGGGCTTGAT GACCAGATCA TGAAGCTGTA CAACGCTGAT GCCGACGCCC TTGTCAACAC GAATTTCATA TTTGCACAAT CTCAGGCCTG CTTTTCCGGG AACTACGAGG GGGACTGCAT TGCCGAGCAC CTCACCACAT CCACCCGCCA CGGCATGTTT GCCGTGGTTT TCAACTCCCG TGAAGGGTGG GGCTTGCCCT ACAGCACGGA CAGCCCGAGC CAGCGGCTGG CCCGTGAATT CTGGGACGCC TGTTTCGGTG AGGGGCTGGC CGACCTGGGG GCGATCAACA CCGCCAGCCA CGAGGACAAC CTCTGGGACA TCGGCGATGG TTTTATTCGC TGGTCTATCT ACGAAACCAA CCTGCTGGGT GATCCCGCTA CCGTGCTGCG GGGGCGGCTT TCTCTTTCCC TGGAAATTCC GGCCAATGCC ATGGAAGGCG ACGTTCGCTC CGGCCGGGTG CAGGTTTTCT CTGTCAAACA GGTTCCGGCC ATCAAAACGG TCGCAACAGA CCTATTGATA TCGCTGACCA GCAGCGACAC AACAGAGGTA ACCGTGCCCG CTTCCGTCAC CATTCGGGCC GGTGAATCAG AAGTCTCGTT TGACCTGACG GTGGAAGACG ACCTGTTGCT GGACGATGTT CAGACCGCGC TGATCACCGC CTCGGCCCCT GGCTTTATCG CGGCAACCGC CGCAATCGAT GTGGAAGACA ACGACACCGA CACCGACGAC GACGGGCTGT CCGACGACCT GGAGGCAATA TCCCGCACCG ACCGGGATGA CGCGGATACC GATGACGACG GTATTTTCGA CGGTGATGAA GACACCAATC AGAACGGCAC GGTGGACCCT GGAGAGACCG ACCCCTGTAA TCCCGACACA GACGGCGACG GCATTCAGGA CGGCACCGAA CTCGGTTATA CCGATGGCAC CGGCCTGGAT ACGGACAGCG ATTCCTTCCA GCCCGACCTT GATCCAAACA CCACTACCGA TCCCCTGGAC ACCGACACTG ACGGCGACGG CCTGCCTGAC GGGCAGGAGG ACGCCAACCG CAACGGCCGG GTGGATGCCG GTGAGACCGA TCCTTCCATC AATGTACGGC CCACGGCAAA TGCCGGGGCC GACCAGGCAG TGGATGAGGG AGACCGGACC ACGCTTGACG GCACCGGCTC GTTTGATATT GACGATGGTA TTTCATCCCA TGTCTGGACC CAGACCGCCG GGCCCACGGT AACGATTGCA AATTCCCACA CGGTCCGGGC CTCCTTTACG GCCCCGGATG TGGAAGCCGA CCAGACCCTG ACCTTCCGGC TTTCTGTGAG AGACAGGGCC GGCCAGTTGA CGTCCGATAC CTGCACCGTC ACTGTCCGGT GGGACGGCAT TGCCCCACCG GATAATGACA CCCCTCCCGC AGCCGGCGGC GGTGGGGGCG GCTGCTTTAT CGAGGCCGTC AGATAG
|
Protein sequence | MLLRKYNRAV MLLLCLLLIA PAAYAHGWRP ARSLETKAEM DKDRQLTIRT VAMSEKTMVF EYAMPGLEAV FAENSPKTAT ESDAPSGPVK PVLGNAAHRS EPGMPVLPVV PARIVLPEGQ DLDDVWVTPG KKTVLPGKYL VAHGQTPYPR IPGVKPEKTE KNRAVYESDD PYPGKLVEIV GVQKKRGVSI LLVNLYPVVY RPKSGTLSWY ETLTLTVTTK PADSTSSRKP GLPYRSSTTG MLDRAVENPE MVNTYTDKKK TEYAPAEDAI YPRATARYVL ITSQAIIDDV SIDPSVADFI AHRQAQGLTT AVVSIDSVLA DDNYTGRNDA ETLRNFIIYA YENLNTEYVL LGGDTGIIPP RTLWCEAGRY EDYIPSDLYY QCLDGDFNFD GDEEWGERTD GEDGGDVDLM AEIFVGRASA ENAAELSNFF SKTMAYETGA DDAACLTTSL MCGEYLGFGG IADYAEPGLE EIRLGSDNHG YTTAGFAAFP NFTVDTLYDS DSYEWPAAEI INDIDSNAYS IIVHDGHGLD DQIMKLYNAD ADALVNTNFI FAQSQACFSG NYEGDCIAEH LTTSTRHGMF AVVFNSREGW GLPYSTDSPS QRLAREFWDA CFGEGLADLG AINTASHEDN LWDIGDGFIR WSIYETNLLG DPATVLRGRL SLSLEIPANA MEGDVRSGRV QVFSVKQVPA IKTVATDLLI SLTSSDTTEV TVPASVTIRA GESEVSFDLT VEDDLLLDDV QTALITASAP GFIAATAAID VEDNDTDTDD DGLSDDLEAI SRTDRDDADT DDDGIFDGDE DTNQNGTVDP GETDPCNPDT DGDGIQDGTE LGYTDGTGLD TDSDSFQPDL DPNTTTDPLD TDTDGDGLPD GQEDANRNGR VDAGETDPSI NVRPTANAGA DQAVDEGDRT TLDGTGSFDI DDGISSHVWT QTAGPTVTIA NSHTVRASFT APDVEADQTL TFRLSVRDRA GQLTSDTCTV TVRWDGIAPP DNDTPPAAGG GGGGCFIEAV R
|
| |