Gene Dole_1789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1789 
Symbol 
ID5694629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2161724 
End bp2165095 
Gene Length3372 bp 
Protein Length1123 aa 
Translation table11 
GC content52% 
IMG OID641264387 
Producthypothetical protein 
Protein accessionYP_001529670 
Protein GI158521800 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCTGA AAAAAATACT CTTTATGCTG GCATTGTCTG GCCTGATGGC AATAACTGCT 
CATGCAAACA CACCAGGCGA TACGGTTCCC GGCCTGGTTA ACATTGTCTG GGACGGCTCC
GGCAGCATGG TCTGGACAAT TGCTACTGAC GAAACTTTTT ATTGGATCGA CTCAGAACCG
TATGCGTATG TTTTTCCGGA AACCGAATAC TATTATGATG GTTCAAGCTA TGGCCGGCCT
TTGCCTGCCG ACAACCGAAA CCACTGGCAA CCCCAGTGTT CTTTCTTTAA TTACATGTAT
TACAATCCCG AAGTAAACTA CACACCCTGG CCCAGATGGA CCACACTGGA GGGTCATGCC
GGAGAATTTG AAAATTTCCA GGCTGATATA AACACACCAC TTCAACATCC CATCGACATT
TCACAAGGTG TCTTTGACCT GAACGACAAT TTTTTTTCCT GGGATTTAGT ACTGCCGGAC
GGCTTGGAAC CGGAAGACGT GGAGGTTGTT GTCGATAATT CAGACATTCC CTCCATTACA
GTGGATGACA TGGATGCAGG TTTTTTATCA GACCCGGCTA TCGGTTCCTG GATACGATAT
GAAAATCCGG ATTATATAGG AAATGATTAC TATACCATAC GATATAGTGG AAAACCGACA
ACTGCGACCT GGCAGACGGA TATAACAAAC AGCGGTGAAT ATGAGGTCCT GATTAGTGTC
CCCTTTGGAA TAAACCGGAC TCGAGAAGCC AATTACACTG TTTTTCACGA CAACGGTTCA
AATGAGCAAG CAATCAGCCA GCGGGCCGAT ACCCATCAGG AGTGGATATC CTTGGGTAGT
TATAATTTTA CTGATACAGC AAAAATTATG GTAATAAGCA GTCAAGCTTT CACTCACCTG
GCTGTAGATG CTGTCATGCT TGTTCCAAAA TTTGACACAT CCGCCAACTT GCCTGTCGCT
TTTTCCTCAA CCGGGCCATG GCAGACAGTC AGCAGCAGCC AGGCCTACCC TGACGCAACT
GGAAATAGCC ACTGTCTTGT TACCGCGGAG ACCGGCACCC CCTGTTCCGC CACATGGCAG
GCCAACAACC TGGATCCTTC CATAACCTAT GATGTCTATG CCCGTTGGGT GGAAGGCAGC
GACCGCTCAA CCGCTGTTGA ATACAATATT GATCATATAG ACGGCAGCCA GGCTGTAACA
GTAAACCAGC GCCTCAATGG CGGCGCATGG TGCCTCCTGG CTTCCGGCCT GGGCTTTGGC
TGCACCGGTA CAGTCACCTT GTCCCATACC CCCACGGATT TGGCATCAGA CAGTGCCTGC
GCCGACGCAG TCTTATTTGT TAAATCCGGG CTGCGGTTGA CAGGAGATAC AATCTTTGCC
CATTATTTTA TTCAAAAATC CCCCGACGAT GTCTTTCTGA TCGATCTTAA CGGCGGGATC
GCCTACTTCA GGGCACTGCT CGCGCCAGAT GGCAAAACCG TCGAGGCACT GACCCAGATT
ACGGAAGAAG AAGCCGGCGC GGAAGGGCTT GTCACCGGCC GCACCTACCT GGAGGAACGA
CAAAATTTTG CCAACTGGTA TCAATTTTAC CGCAGGCGTT ATTTCACCGG TATCGCGACC
CTTGGACAGA TGATAAATGA CTGGTCTAAT GTTTTTCTGC GTATCACCTC GTTTCCGCCT
GATAATTTTT TCAGAAGAAT CTCTCCTGTT GATGTAACGA TCCGCAATAC CGATGGGGAA
CTGACCCACT ACAACGAAAA AGACGACATC CTTTATGATT TATACACAAT GATGCACCCC
TGGGGCGGGA CCTTTCTGCG CGAAGGGCTG TACCAGGCCG GCCAGTTTTT CGAACTGGGC
CGAAGTGACT TCTGGCCTGA GGACAGATGG GAGAACCTGC CGCCCGGGGC GTTCCAGGTA
TATTCCAGCC CGGATTACTA CCCGTTTTTT ACCCCGGAAA ACGGTGGTGC GTATCAGCAG
GCATTCACCG TTGTAATGAC CGACGGCCTG TGGAACGGCA CATTGAGATA TCCGGTGGGC
AATGTCGACG GGGGTTGGAT GTACAACCCG CCTTTCACCG GCGGCGTTTT CGGTGACGAC
TACTCTGACA CTGCCGCGGA TGTAGCCATG TACTTTTATT GTCGGGACCT GCGCCCGATT
CTAAATGATA TTGTGCCGCC AGGTGACTTT GATTTCGCCC CCCATCAACA CATGGTGACA
TACGCTGTTG CACTAGGCGT TCGTGGGGCA CACTTCAGCG AAGCGTATCG TCAGATGACC
GAACCCTATC TGAGATATGG CGCGGAGCCC CCATCCTGGA TCGGCTGGCC TCAGATCGTG
CCTGATACAA ACAGCACTAT TGATGATCTC TGGCATGCTA CCATAAATGG CCGGGGGTCG
TTTTTTTCAG CCATGGATGT AAATAATTTG GCTGCTGCCA TGGCCGCAAT ACGGCAGGAT
ATCACGCAGC GACTGGAAAT TCCTTCCACC CCCGCCAATC TGATCAACTG GAACGGCAAC
CTGGTGGCCG ACTTCGGCGA CAACGGCCTG TGGTACCACA ACGGCAGCAA CTGGAACTGG
ATGACCAACC GGGGCCATGT CAACCAGATG GTAGTGTGGG ACGGCAAACT GGTTGTGGAT
TTCGGGGCTG ACCACGGCAT GCACTACTAT GATGGTTCGT GGCACTGGAT GACCAACAAG
GATGGTGTGG CCATGATGAC CGTTTGGGGC AACAAGCTGG TTGTCGATTT CGGCAACGGT
CGGAGTGTCT ACAACTATGA CGGTGCATGG CACTGGATGT CCAACAAGGA TGACGTGGCC
GACATGACCG TTTGGAACAA CAAACTGGTT GTTGATTTCG GCGGCGGTCG GAGTGTCTAC
AACTATGACG GTGCATGGCA CTGGATGTCC AACAAGGATG ACGTGGCCGA CATGACCGTT
TGGAACAACA AACTGGTTGT TGATTTCGGC GGCGGTCGTG GAATGTTCAA CAACGACGGC
ACCTGGCACT GGATGACCAA CAAGGATGAT GTGGCCATGA TGACCGTTTG GGGCAACAAG
CTGGTTGTCG ATTTTGGCAA CGGTCGAAGC GTCTACAACT ACGACGGCTC ATGGCACTGG
ATGACCAACA AAGATGACGT GGCGAAAATG GTGACCTGGA AGGACGGTGC CGACAAACTG
GCCGTAGACT TCGGCTCAGG CCGGGGGATG TTCTATTATG ACGGTGTATG GCACTGGATG
TCCAACAAGG ACGATATTAC GGATATGATC GCCTGGGGCA CCTGCCTGGC CGTGGATTTT
GGCTCAGGCC GGGGGATGTA CAACTACGAC GGTGCCTGGC ACTGGATGAA AAACTGGAGC
ACGGCGGATT AA
 
Protein sequence
MRLKKILFML ALSGLMAITA HANTPGDTVP GLVNIVWDGS GSMVWTIATD ETFYWIDSEP 
YAYVFPETEY YYDGSSYGRP LPADNRNHWQ PQCSFFNYMY YNPEVNYTPW PRWTTLEGHA
GEFENFQADI NTPLQHPIDI SQGVFDLNDN FFSWDLVLPD GLEPEDVEVV VDNSDIPSIT
VDDMDAGFLS DPAIGSWIRY ENPDYIGNDY YTIRYSGKPT TATWQTDITN SGEYEVLISV
PFGINRTREA NYTVFHDNGS NEQAISQRAD THQEWISLGS YNFTDTAKIM VISSQAFTHL
AVDAVMLVPK FDTSANLPVA FSSTGPWQTV SSSQAYPDAT GNSHCLVTAE TGTPCSATWQ
ANNLDPSITY DVYARWVEGS DRSTAVEYNI DHIDGSQAVT VNQRLNGGAW CLLASGLGFG
CTGTVTLSHT PTDLASDSAC ADAVLFVKSG LRLTGDTIFA HYFIQKSPDD VFLIDLNGGI
AYFRALLAPD GKTVEALTQI TEEEAGAEGL VTGRTYLEER QNFANWYQFY RRRYFTGIAT
LGQMINDWSN VFLRITSFPP DNFFRRISPV DVTIRNTDGE LTHYNEKDDI LYDLYTMMHP
WGGTFLREGL YQAGQFFELG RSDFWPEDRW ENLPPGAFQV YSSPDYYPFF TPENGGAYQQ
AFTVVMTDGL WNGTLRYPVG NVDGGWMYNP PFTGGVFGDD YSDTAADVAM YFYCRDLRPI
LNDIVPPGDF DFAPHQHMVT YAVALGVRGA HFSEAYRQMT EPYLRYGAEP PSWIGWPQIV
PDTNSTIDDL WHATINGRGS FFSAMDVNNL AAAMAAIRQD ITQRLEIPST PANLINWNGN
LVADFGDNGL WYHNGSNWNW MTNRGHVNQM VVWDGKLVVD FGADHGMHYY DGSWHWMTNK
DGVAMMTVWG NKLVVDFGNG RSVYNYDGAW HWMSNKDDVA DMTVWNNKLV VDFGGGRSVY
NYDGAWHWMS NKDDVADMTV WNNKLVVDFG GGRGMFNNDG TWHWMTNKDD VAMMTVWGNK
LVVDFGNGRS VYNYDGSWHW MTNKDDVAKM VTWKDGADKL AVDFGSGRGM FYYDGVWHWM
SNKDDITDMI AWGTCLAVDF GSGRGMYNYD GAWHWMKNWS TAD