Gene Dole_2755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2755 
Symbol 
ID5695611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3321610 
End bp3324300 
Gene Length2691 bp 
Protein Length896 aa 
Translation table11 
GC content61% 
IMG OID641265368 
ProductCBS domain-containing protein 
Protein accessionYP_001530635 
Protein GI158522765 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[R] General function prediction only 
COG ID[COG0617] tRNA nucleotidyltransferase/poly(A) polymerase
[COG0618] Exopolyphosphatase-related proteins
[COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.1635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACCAGA CAGCAGACAA AGACGGCCTC ACGGTCGTCT CCACCCACAT CAACGCCGAC 
TTTGACGCCA TCGCCTCGGT GCTGGCGGCC CAGAAGCTCT ACCCCGGCTC CATCGTGGTG
CTGCCCGGCT CCAGCGAAAA GAACCTGCGC AACTTCTTCA TCAATTCCAT GGCCTACCTG
TTCAACATGA GCGACATCGG CCAGATCGAC GGGCCAAAGG TCTCCCGGCT GGTGCTGGTG
GACACCAGCC AGAAGGACCG CATCGGGAAA GCGGCCGACC TGCTGGCCAA CCCGGGCCTG
GAGGTCCATG TGTACGACCA CCACCCGGCC GCCGACGGTG ACGTGACCGC CGACCTTCGG
GTCTACGAGG CCACCGGGGC AACGGTGTCG ATTCTGGCAA AAATGCTTCA GGAGCAAAAC
ATCCCCATCT CACCGGACGA AGCCACGGTG ATGTGCCTGG GCATCTATGA GGACACCGGC
AGCTTCACCT TTACCTCCAC CACGCCCAAG GATTTTAGCG CCGCCGCCTT TTTTCTTGAA
AAAGGGGCCA GCATCAACAC CATCGCCAAC ATCGTCTCCC GGGAGATGAC CCCGGAGCAG
GTGGGCATTC TGAACAACAT GATCAACAAC TCCCGGACCC ACAAGATCAA CGGCATGGAC
GTGACCCTTG CCTCCATCTA CACCGAGGAG TATGTGCCGG ACTTTGCCTT CCTGGTTCAC
AAGATGCAGA AGATGAAGGG CATCAACGTT TTGTTCGCCC TGGCCCAGAT GGGCAACAAG
GTCTATATCG TGGGCCGCTC CAAGGCGGAC GAGGTGGATG CCGGCCAGAT TCTCAACCCC
TTTGGCGGCG GGGGCCACCC CTTTGCCGCT TCGGCCAGCA TCAAGCACAT GGCCCTGCCC
CAGGTGGAGC AGGAGCTGCT GGCCATCCTG CGCATGCAGG TCCGCAAAAC CACCCTGGCC
AGGGAAATCA TGTCCACGCC GGTGGTCGCG GCCACCCCGG ACATCTCCTG CCGGGCCGCC
GGGGAGCTTT TGACCCGCTA CAACATAAAC GCCCTGCTGG TCACCGAAAA ACCGGAGGCC
AAAGGAAGGC TCCTGGGGTT TATCACGCGC CAGGTGATAG AAAAGATCCT CTATCACAAG
CTGGAGGAAG CGCCGGTAAG TGAATACATG AACACCGACC TCTCTCTGGC CGGGCCCGAC
GACGAGCTGG CCGATATTCA GCGCAAGATC ATTGAAAACA GCCAGCGCCT GCTGCCCGTG
GTGGAAAACG GGGCCGTCAT CGGTGTGATC ACCCGCACCG ACCTGCTCAA CACCCTGGTC
TACCAGCGGG AGGCGGGCAA CCAGCGACAG CCGGCCCCCA CCCAGATTCA GGCCCATCCC
AAGACCCGGG ACATCAAACG GATGATGAAC GAGCGGCTGA CGCCCCCTGT TCTGGACATT
CTCAAAAACG CGGGCAACAC CGCCGCGGAG CTGGAATACA GCGCCTACGT GGTGGGCGGA
TTCGTGCGGG ACCTGTTTCT TTCCCGGTCC ACCGAGGATG TGGACATCGT GATCGAAGGC
GACGGCATCG CCTTTGCCAG GGAGTTTGCC GGCCGAATGA AGGCCCGGGT CCACTACTAC
AAAAAGTTCG GCACCGCGGT GATCACCTTT GCCGACGGTT CCAAGATCGA CGTGGCCTCG
GCCCGGCTGG AGTATTACCA GTTTCCCGCG GCCCTGCCCA CCGTGGAGAT GAGCTCCATC
AAGCTGGATC TGTTCCGCCG GGATTTTACC ATCAACACCC TGGCCGTTTG CCTGAACCCG
GACAAGTTCG GCCTGCTGGT GGACTTTTTC TCGGCCCAGC GGGACATCAA GGAAAAGACC
ATCCGCGTGC TGCACAGCTT AAGCTTTGTG GAAGACCCCA CCCGCATCTT CCGGGCCGTG
CGGTTTGAGC AGCGGTTCGG GTTTACCATC GGCAAGATGA CTGAAGGGCT GATCAAAAAC
GCGGTAAAAA TGGAGTTCTT CCGGCGCTTA AGCGGCCACC GGGTCTTTGG CGAGCTGCGG
CAGATCCTGG AAGAGGATGA TCCGGTGCCG GCCATTGAGC GGCTGGCCGA GTTCAATCTG
CTGGTCTCCC TGCACGAGGC CCTGAAAATC GACAAGAAGA CCGTGGCCGC GCTGCACGCC
ACCCGGGAGG TGGTATCGTG GTACGACCTG CTGTTCGTGG ACAAGCCCTA CATGAAGTGG
ATGGTCTACC TGATGGTCCT GATGCGGGGC ATGGCCCAGC AGACCACCGA GGATCTGTGC
GACCGGCTGG AGCTGCCGCC CCGCCACCGG GAGATGGCCG ACGCCGGCCG GCGCGAGGCA
GACACCTTTC TCCACTGGAT TCAGCGCAAT CCGGGGATCA AAAACAGCGA GCTCTACCAG
CGGCTGTTCG GCTTCAGGGT GGAGCAGATG CTGTATGTCA TGTCCGTGAC CGACAGCGAC
ACCGTGAAAA AACACATCTC CCGCTACATC CTGACCCTGC AGCACGTTGC GCCCCTGATC
AAGGGCAAGG ACTTAAACGA GATCGGCATC GCCCCGGGCC CGCTCTACAG CGAAATTTTA
AGAAAGATCC TCTACGCCCG GCTGGATGAA AAGGTCCGCA CCCGGGAGGA CGAGCTGGAA
TTTGCCATGC GCTACGCAAA TGACCCCGAC GGCTGGTGGA AACGCAGGTA G
 
Protein sequence
MNQTADKDGL TVVSTHINAD FDAIASVLAA QKLYPGSIVV LPGSSEKNLR NFFINSMAYL 
FNMSDIGQID GPKVSRLVLV DTSQKDRIGK AADLLANPGL EVHVYDHHPA ADGDVTADLR
VYEATGATVS ILAKMLQEQN IPISPDEATV MCLGIYEDTG SFTFTSTTPK DFSAAAFFLE
KGASINTIAN IVSREMTPEQ VGILNNMINN SRTHKINGMD VTLASIYTEE YVPDFAFLVH
KMQKMKGINV LFALAQMGNK VYIVGRSKAD EVDAGQILNP FGGGGHPFAA SASIKHMALP
QVEQELLAIL RMQVRKTTLA REIMSTPVVA ATPDISCRAA GELLTRYNIN ALLVTEKPEA
KGRLLGFITR QVIEKILYHK LEEAPVSEYM NTDLSLAGPD DELADIQRKI IENSQRLLPV
VENGAVIGVI TRTDLLNTLV YQREAGNQRQ PAPTQIQAHP KTRDIKRMMN ERLTPPVLDI
LKNAGNTAAE LEYSAYVVGG FVRDLFLSRS TEDVDIVIEG DGIAFAREFA GRMKARVHYY
KKFGTAVITF ADGSKIDVAS ARLEYYQFPA ALPTVEMSSI KLDLFRRDFT INTLAVCLNP
DKFGLLVDFF SAQRDIKEKT IRVLHSLSFV EDPTRIFRAV RFEQRFGFTI GKMTEGLIKN
AVKMEFFRRL SGHRVFGELR QILEEDDPVP AIERLAEFNL LVSLHEALKI DKKTVAALHA
TREVVSWYDL LFVDKPYMKW MVYLMVLMRG MAQQTTEDLC DRLELPPRHR EMADAGRREA
DTFLHWIQRN PGIKNSELYQ RLFGFRVEQM LYVMSVTDSD TVKKHISRYI LTLQHVAPLI
KGKDLNEIGI APGPLYSEIL RKILYARLDE KVRTREDELE FAMRYANDPD GWWKRR