Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0130 |
Symbol | |
ID | 8417934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | - |
Start bp | 170552 |
End bp | 172195 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645036695 |
Product | choline dehydrogenase |
Protein accession | YP_003197010 |
Protein GI | 258404268 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | [TIGR01810] choline dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000105235 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.777409 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCAAA AAAAATACGA TTACATCATC GTTGGCGGGG GTTCTGCCGG AAGTGTGTTG GCCAATCGGC TGAGCGCCAA CCCCAAAAAC AAGGTCCTCG TCCTCGAAGC GGGGCTTCCC GATTACCGTC TTGATTTCCG CATCCACATG CCCGCGGCGC TGACCTACCC CTTGCAAGGG AAGACCTACA ATTGGTGGTA CGAATCCGAT CCCGAGCCGT ACATGCACAA CCGGCGCATC TATCAACCCC GCGGCAAGGT CCTGGGAGGG TCGAGCTGTA TCAACGGCAT GATCTATATC CGCGGCAACG CCATGGATTA CGAAAAATGG GCCAGCTTTG AGGGATTGGA AGACTGGGAT TACGCCCGCT GCCTGCCCTA TTTCAACCGC GCCGAATACC GGCTCAGTGG TGCGGACGCC TACCAGGGCG TCGGCGGCCC CCTGTACCTG ACCACGCCGG AATGCGACAA TCCCCTGTTC GAAGCCTTTT TCAAGGCCGT CCAGCAAGCC GGGCACCCTG TTGTGGACAA TGTCAACGGC TACCGGCAGG AAGGATTTTC CAAATTTGAC GCCAATATCT ACCGCGGCCG GCGGTGGAAC GCGGCCCGGG CCTACGTGCA CCCGGTCAAA AACCGCAAGA ACCTGGACAT CAGGTGCCGG GCGATGAGTA CCCGGATCCT GTTCGAAGGC AAGAAGGCGA TCGGGGTAGA ATACAAGAAG GGCAACACTA CCCATAAAGT CTACGGCGGC GAGATCATCA GCTGCGGTGG GGCCATCAAT TCGCCCCAGC TCTTGCAGCT CTCCGGCGTC GGCGCCGGGG ATCACCTGCG CCAGCTCGGC ATCGACGTGG TCCAGGACCT GCCCGGAGTC GGTGAAAACC TGCAGGACCA CCTCGAACTC TATGTCCAAT GGGCGGCCAA AAAACCGGTC AGCATGTTCC CAGCCCTGAA GTGGTACAAC CAGCCCAAGA TCGGCATGGA ATGGCTCTTT GCCAACAAGG GAGCGGCCGC GACCAACCAT TTTGAGGCTG GCGGCTTTAT CCGCGGCAAC GACCAGGTCG ACTATCCGAA CCTGCAGTTC CACTTCCTGC CCTTGGCGAT CCGCTACGAC GGCACCGCAC CCAACGAAGG ACACGGCTTC CAGCTCCACG TCGGCCCCAT GAACTCCGAC GTCCGCGGTC GGGTCAAGAT TACCTCGGCC GACCCCGGGG ACTATCCGAG CATCCTGTTC AACTACCTCT CCACGGAACA GGAACGCCGT GAATGGGTTG AGGCCATACG CGCATCGCGC CACATCGTGG AACAGTCCGC TTTTGACGAA TTGCGGGGCA AGGAACTCGC TCCGGGCAGC GACGCCCAGA CCGACGAGGA GATCCTGGAC TTTGTTGCCC GGGAGGGCGA AAGCGCTTAC CATCCGAGTT GCACCTGCAA AATGGGCTAC GACGATATGG CCGTGGTCGA CAGTGATCTG CGCGTGCACG GCGTCGAAAA CCTCCGCGTT GTCGATGCCT CGATCATGCC CACCATCACC AACGGCAATA TCTACGCTCC GACAATGATG CTCGCGGAAA AGGCGGCGGA CAAAATCCTG GGCAACACCC CCCCGGAACC GGCGCAAGCC CCGTTTTACA AAACCGAAGT CTAG
|
Protein sequence | MAQKKYDYII VGGGSAGSVL ANRLSANPKN KVLVLEAGLP DYRLDFRIHM PAALTYPLQG KTYNWWYESD PEPYMHNRRI YQPRGKVLGG SSCINGMIYI RGNAMDYEKW ASFEGLEDWD YARCLPYFNR AEYRLSGADA YQGVGGPLYL TTPECDNPLF EAFFKAVQQA GHPVVDNVNG YRQEGFSKFD ANIYRGRRWN AARAYVHPVK NRKNLDIRCR AMSTRILFEG KKAIGVEYKK GNTTHKVYGG EIISCGGAIN SPQLLQLSGV GAGDHLRQLG IDVVQDLPGV GENLQDHLEL YVQWAAKKPV SMFPALKWYN QPKIGMEWLF ANKGAAATNH FEAGGFIRGN DQVDYPNLQF HFLPLAIRYD GTAPNEGHGF QLHVGPMNSD VRGRVKITSA DPGDYPSILF NYLSTEQERR EWVEAIRASR HIVEQSAFDE LRGKELAPGS DAQTDEEILD FVAREGESAY HPSCTCKMGY DDMAVVDSDL RVHGVENLRV VDASIMPTIT NGNIYAPTMM LAEKAADKIL GNTPPEPAQA PFYKTEV
|
| |