Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_1972 |
Symbol | |
ID | 3756978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | + |
Start bp | 2015901 |
End bp | 2018597 |
Gene Length | 2697 bp |
Protein Length | 898 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637782858 |
Product | CBS |
Protein accession | YP_388464 |
Protein GI | 78357015 |
COG category | [J] Translation, ribosomal structure and biogenesis [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0617] tRNA nucleotidyltransferase/poly(A) polymerase [COG0618] Exopolyphosphatase-related proteins [COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.03578 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACTT CACCCCCTCT TATAGCCGCC CCCGTGGTTG TGACCGCCCA TGCCAATGCC GATTTTGACG CGCTGGCTTC CATTGTCGCC GCAGGCAGAC TATACCCCGG TGCCACACTG CTTTTCCCCG GCAGCCAGGA ACGCACGCTG AGACACTTCT TCATCAAAAG CGCCACCTAT CTTTTCAACT TCCGGCAGGC AAAGGACATT GATGTAACCA GCGTTGAAAA GCTGGTCATA GTCGACACCC GCCAGCACTC AAGGGTGCCT CATGTTCATG CGATACTTGA ACGTCCGGAA GTTGAAGTAC ACCTTTATGA CCACCACCCC GACACGGACG AGGACATACA GGGAAGCCTT GTGACGGTCC GCCAGTGGGG AGCCACAGCC ACCATCCTGA CCACCGAAAT ACAACAGGCC GGACTCGCGC TTACTCCGGA TGAAGCTACC ATCATCGGGC TGGGTATTTT TGAGGATACC GGCTCCTTCA CCTTTGCCTC AACCACCGAA CATGACTTTG CCGCCGCAGG CTGGCTGAAA ACGCAGGGAA TGGACCTGAA CACCATTTCT GAACTGCTTA CCCGTGATCT GACCAGCGAA CAGATATCCG CCCTTAATGA ACTGCTTGAA ACAGCTCACA CGCACGACAT CAACGGTGTT CCCGTTGTGA TAGCCGAAGC CAGCATGGAT GAATACATGG GCGACTTTGC CCTGCTGGCC CACAAGCTGA TGGATATGGA AAACATCAGC GTGCTCTTTG CTTTGGGACG CATGGAAGAC AGGGTACAGA TAGTGGCCCG CGCACGCTCT TCAGACGTTG ATGTGGGAGC CATATGCTCC GGTTTCGGCG GTGGCGGACA CACATACGCA GCTTCAGCAT CCGTCAAATT CAAAACACTT TCTCAAGTAA AGGATGAACT TTTTACGCTT CTCTATTCCG CGGTCAACAA ACAGCTGACT GTCGGCTCCA TCATGTCTGC GCCGGTCATT GCCGAACCGG AAACGGCAAC CATTGCCAAA GCGGCGGAAA CCATGCAGCG CTTCGGCCTG AAGGCCGTGC CGGTGGTGAT GCCGGGTACC GCACGCTGTA CGGGCTACAT AGAATACCAG ACCGCCACCC GCGCAGTCGC TCACGGGCTG GGCGCAGCCC CCGTGACCGA CTACATGAAC CGCCATGTGC ATACTGCATC GCCCGAAGAC GACCTGCACA CCATGATGGA AATAATAGTA GGTCAGCGTC AGCGTCTGGT GCCCGTGGTG GACGAAAACG GCAACGCCAT CGGAGTGATC ACCCGGACGG ATCTCATCAA TACAATAATT GAAGAGCCCG CCCGGATTCC AGAAACGCTT CAGCCCGACA GAAAGCGTGA CCGGAACATC CGTTCGCTGA TACGTGAACG GCTGCCCCGC ATGCACTGCG GCTGGCTGGA AGCCGCAGGA AAGCTGGGGG ATGTTTCAGG GGTCGCTGTA TACGCTGTTG GCGGCTTTAT ACGCGACCTG CTGCTGGGCA GGCCCAATCT GGACCTTGAT CTGGTTGTTG AAGGAGACGG GATAACATTT GCCCGCAGAC TGGCAAAACA ACTCAACGGA CGCATACGCG AGCACCAGAA GTTCAAAACC GCAGTTGTAA TCTTTGATGA CGAAAACGGT GTAGAAAGCC GGATAGATGT GGCTACGGCC AGACTGGAAT ATTATGAATA CCCTGCGGCC CTGCCCACTG TAGAGCTATC TTCCATCAAG ATGGATCTCT TCCGGCGGGA TTTCACCATT AACGCTCTGG CAGTTCAGCT CAATGAAAGC AGTTTCGGCA GACTGGTGGA CTTTTTCGGC TCGCAGCAGG ATATCAAAGA CAAGCGTATC CGGGTACTCC ATTCGCTGAG CTTCGTGGAA GACCCCACCA GGATTCTTAG AGCCATCCGC TTTGAAAAAC GATACAATTT TGCTATAGGC CCGCAGACTT CACGGCTCAT CAAAAACGCT CTCGGTCTCG GACTCATGGA AAAGTTGTCC GGTTCACGCC TGTTCCACGA ACTGCGCCTT ATCACTGAAG AGGCCTCCCC GCTTGCCTGC TTTAAAAGGA TGGAAGAGTT CGACCTGTTG CGCAGTGTGC ACCCGGTGCT GGCACTGAGC TATGGCAAAG AACAAATTAT CGCCGACGTG GAAAAGGTGC TCGGCTGGCA CAGGCTGCTG TATCAGGACC CTGCGGCGGA AGCATGGACT GTGCATCTGC TTGCCTTGTG CCACAATGCC AAATATGCCG ACGTGCGCGA TCTGCTTAAA AGACTGTATC TGCCGCGTAA GCAGAGCCGT GATTTTATGC TGCTGCGTGA AAACACCCGT GAAGCGGCGG CCCTGCTTTC CACATGGCTG AAAAAAGACC GTTCCATGAG CGGACTTTAC AACATCCTGC ATTCTTTGCC GCCCGAAGGC CTTCTTTACA TCATGGCAAG AACACGTGTG GATGAAATCC GCAAATTCGT CTCGCATTAC CTCACCAAGC TGCGGGATAT CCGGACAGAC ATCAGCGGCG AAGACCTGCA GGCCATGGGC GGGCACCCCG GCCCTTGCTT CGGCAAGGTG CTGAACCATG TGCTGGCAGC CAAACTGGAC GGAGCAGCCC ACAGCCGTAA AGAACAGCTG GAGCTGGCCC ACAGCATGCT GCGACGCTAT AATTACGGCC GCGACGAACA GCAATAG
|
Protein sequence | MTTSPPLIAA PVVVTAHANA DFDALASIVA AGRLYPGATL LFPGSQERTL RHFFIKSATY LFNFRQAKDI DVTSVEKLVI VDTRQHSRVP HVHAILERPE VEVHLYDHHP DTDEDIQGSL VTVRQWGATA TILTTEIQQA GLALTPDEAT IIGLGIFEDT GSFTFASTTE HDFAAAGWLK TQGMDLNTIS ELLTRDLTSE QISALNELLE TAHTHDINGV PVVIAEASMD EYMGDFALLA HKLMDMENIS VLFALGRMED RVQIVARARS SDVDVGAICS GFGGGGHTYA ASASVKFKTL SQVKDELFTL LYSAVNKQLT VGSIMSAPVI AEPETATIAK AAETMQRFGL KAVPVVMPGT ARCTGYIEYQ TATRAVAHGL GAAPVTDYMN RHVHTASPED DLHTMMEIIV GQRQRLVPVV DENGNAIGVI TRTDLINTII EEPARIPETL QPDRKRDRNI RSLIRERLPR MHCGWLEAAG KLGDVSGVAV YAVGGFIRDL LLGRPNLDLD LVVEGDGITF ARRLAKQLNG RIREHQKFKT AVVIFDDENG VESRIDVATA RLEYYEYPAA LPTVELSSIK MDLFRRDFTI NALAVQLNES SFGRLVDFFG SQQDIKDKRI RVLHSLSFVE DPTRILRAIR FEKRYNFAIG PQTSRLIKNA LGLGLMEKLS GSRLFHELRL ITEEASPLAC FKRMEEFDLL RSVHPVLALS YGKEQIIADV EKVLGWHRLL YQDPAAEAWT VHLLALCHNA KYADVRDLLK RLYLPRKQSR DFMLLRENTR EAAALLSTWL KKDRSMSGLY NILHSLPPEG LLYIMARTRV DEIRKFVSHY LTKLRDIRTD ISGEDLQAMG GHPGPCFGKV LNHVLAAKLD GAAHSRKEQL ELAHSMLRRY NYGRDEQQ
|
| |