Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_2755 |
Symbol | |
ID | 5695611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 3321610 |
End bp | 3324300 |
Gene Length | 2691 bp |
Protein Length | 896 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641265368 |
Product | CBS domain-containing protein |
Protein accession | YP_001530635 |
Protein GI | 158522765 |
COG category | [J] Translation, ribosomal structure and biogenesis [K] Transcription [R] General function prediction only |
COG ID | [COG0617] tRNA nucleotidyltransferase/poly(A) polymerase [COG0618] Exopolyphosphatase-related proteins [COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.1635 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACCAGA CAGCAGACAA AGACGGCCTC ACGGTCGTCT CCACCCACAT CAACGCCGAC TTTGACGCCA TCGCCTCGGT GCTGGCGGCC CAGAAGCTCT ACCCCGGCTC CATCGTGGTG CTGCCCGGCT CCAGCGAAAA GAACCTGCGC AACTTCTTCA TCAATTCCAT GGCCTACCTG TTCAACATGA GCGACATCGG CCAGATCGAC GGGCCAAAGG TCTCCCGGCT GGTGCTGGTG GACACCAGCC AGAAGGACCG CATCGGGAAA GCGGCCGACC TGCTGGCCAA CCCGGGCCTG GAGGTCCATG TGTACGACCA CCACCCGGCC GCCGACGGTG ACGTGACCGC CGACCTTCGG GTCTACGAGG CCACCGGGGC AACGGTGTCG ATTCTGGCAA AAATGCTTCA GGAGCAAAAC ATCCCCATCT CACCGGACGA AGCCACGGTG ATGTGCCTGG GCATCTATGA GGACACCGGC AGCTTCACCT TTACCTCCAC CACGCCCAAG GATTTTAGCG CCGCCGCCTT TTTTCTTGAA AAAGGGGCCA GCATCAACAC CATCGCCAAC ATCGTCTCCC GGGAGATGAC CCCGGAGCAG GTGGGCATTC TGAACAACAT GATCAACAAC TCCCGGACCC ACAAGATCAA CGGCATGGAC GTGACCCTTG CCTCCATCTA CACCGAGGAG TATGTGCCGG ACTTTGCCTT CCTGGTTCAC AAGATGCAGA AGATGAAGGG CATCAACGTT TTGTTCGCCC TGGCCCAGAT GGGCAACAAG GTCTATATCG TGGGCCGCTC CAAGGCGGAC GAGGTGGATG CCGGCCAGAT TCTCAACCCC TTTGGCGGCG GGGGCCACCC CTTTGCCGCT TCGGCCAGCA TCAAGCACAT GGCCCTGCCC CAGGTGGAGC AGGAGCTGCT GGCCATCCTG CGCATGCAGG TCCGCAAAAC CACCCTGGCC AGGGAAATCA TGTCCACGCC GGTGGTCGCG GCCACCCCGG ACATCTCCTG CCGGGCCGCC GGGGAGCTTT TGACCCGCTA CAACATAAAC GCCCTGCTGG TCACCGAAAA ACCGGAGGCC AAAGGAAGGC TCCTGGGGTT TATCACGCGC CAGGTGATAG AAAAGATCCT CTATCACAAG CTGGAGGAAG CGCCGGTAAG TGAATACATG AACACCGACC TCTCTCTGGC CGGGCCCGAC GACGAGCTGG CCGATATTCA GCGCAAGATC ATTGAAAACA GCCAGCGCCT GCTGCCCGTG GTGGAAAACG GGGCCGTCAT CGGTGTGATC ACCCGCACCG ACCTGCTCAA CACCCTGGTC TACCAGCGGG AGGCGGGCAA CCAGCGACAG CCGGCCCCCA CCCAGATTCA GGCCCATCCC AAGACCCGGG ACATCAAACG GATGATGAAC GAGCGGCTGA CGCCCCCTGT TCTGGACATT CTCAAAAACG CGGGCAACAC CGCCGCGGAG CTGGAATACA GCGCCTACGT GGTGGGCGGA TTCGTGCGGG ACCTGTTTCT TTCCCGGTCC ACCGAGGATG TGGACATCGT GATCGAAGGC GACGGCATCG CCTTTGCCAG GGAGTTTGCC GGCCGAATGA AGGCCCGGGT CCACTACTAC AAAAAGTTCG GCACCGCGGT GATCACCTTT GCCGACGGTT CCAAGATCGA CGTGGCCTCG GCCCGGCTGG AGTATTACCA GTTTCCCGCG GCCCTGCCCA CCGTGGAGAT GAGCTCCATC AAGCTGGATC TGTTCCGCCG GGATTTTACC ATCAACACCC TGGCCGTTTG CCTGAACCCG GACAAGTTCG GCCTGCTGGT GGACTTTTTC TCGGCCCAGC GGGACATCAA GGAAAAGACC ATCCGCGTGC TGCACAGCTT AAGCTTTGTG GAAGACCCCA CCCGCATCTT CCGGGCCGTG CGGTTTGAGC AGCGGTTCGG GTTTACCATC GGCAAGATGA CTGAAGGGCT GATCAAAAAC GCGGTAAAAA TGGAGTTCTT CCGGCGCTTA AGCGGCCACC GGGTCTTTGG CGAGCTGCGG CAGATCCTGG AAGAGGATGA TCCGGTGCCG GCCATTGAGC GGCTGGCCGA GTTCAATCTG CTGGTCTCCC TGCACGAGGC CCTGAAAATC GACAAGAAGA CCGTGGCCGC GCTGCACGCC ACCCGGGAGG TGGTATCGTG GTACGACCTG CTGTTCGTGG ACAAGCCCTA CATGAAGTGG ATGGTCTACC TGATGGTCCT GATGCGGGGC ATGGCCCAGC AGACCACCGA GGATCTGTGC GACCGGCTGG AGCTGCCGCC CCGCCACCGG GAGATGGCCG ACGCCGGCCG GCGCGAGGCA GACACCTTTC TCCACTGGAT TCAGCGCAAT CCGGGGATCA AAAACAGCGA GCTCTACCAG CGGCTGTTCG GCTTCAGGGT GGAGCAGATG CTGTATGTCA TGTCCGTGAC CGACAGCGAC ACCGTGAAAA AACACATCTC CCGCTACATC CTGACCCTGC AGCACGTTGC GCCCCTGATC AAGGGCAAGG ACTTAAACGA GATCGGCATC GCCCCGGGCC CGCTCTACAG CGAAATTTTA AGAAAGATCC TCTACGCCCG GCTGGATGAA AAGGTCCGCA CCCGGGAGGA CGAGCTGGAA TTTGCCATGC GCTACGCAAA TGACCCCGAC GGCTGGTGGA AACGCAGGTA G
|
Protein sequence | MNQTADKDGL TVVSTHINAD FDAIASVLAA QKLYPGSIVV LPGSSEKNLR NFFINSMAYL FNMSDIGQID GPKVSRLVLV DTSQKDRIGK AADLLANPGL EVHVYDHHPA ADGDVTADLR VYEATGATVS ILAKMLQEQN IPISPDEATV MCLGIYEDTG SFTFTSTTPK DFSAAAFFLE KGASINTIAN IVSREMTPEQ VGILNNMINN SRTHKINGMD VTLASIYTEE YVPDFAFLVH KMQKMKGINV LFALAQMGNK VYIVGRSKAD EVDAGQILNP FGGGGHPFAA SASIKHMALP QVEQELLAIL RMQVRKTTLA REIMSTPVVA ATPDISCRAA GELLTRYNIN ALLVTEKPEA KGRLLGFITR QVIEKILYHK LEEAPVSEYM NTDLSLAGPD DELADIQRKI IENSQRLLPV VENGAVIGVI TRTDLLNTLV YQREAGNQRQ PAPTQIQAHP KTRDIKRMMN ERLTPPVLDI LKNAGNTAAE LEYSAYVVGG FVRDLFLSRS TEDVDIVIEG DGIAFAREFA GRMKARVHYY KKFGTAVITF ADGSKIDVAS ARLEYYQFPA ALPTVEMSSI KLDLFRRDFT INTLAVCLNP DKFGLLVDFF SAQRDIKEKT IRVLHSLSFV EDPTRIFRAV RFEQRFGFTI GKMTEGLIKN AVKMEFFRRL SGHRVFGELR QILEEDDPVP AIERLAEFNL LVSLHEALKI DKKTVAALHA TREVVSWYDL LFVDKPYMKW MVYLMVLMRG MAQQTTEDLC DRLELPPRHR EMADAGRREA DTFLHWIQRN PGIKNSELYQ RLFGFRVEQM LYVMSVTDSD TVKKHISRYI LTLQHVAPLI KGKDLNEIGI APGPLYSEIL RKILYARLDE KVRTREDELE FAMRYANDPD GWWKRR
|
| |