Gene Dret_0130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0130 
Symbol 
ID8417934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp170552 
End bp172195 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content60% 
IMG OID645036695 
Productcholine dehydrogenase 
Protein accessionYP_003197010 
Protein GI258404268 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000105235 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.777409 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAAA AAAAATACGA TTACATCATC GTTGGCGGGG GTTCTGCCGG AAGTGTGTTG 
GCCAATCGGC TGAGCGCCAA CCCCAAAAAC AAGGTCCTCG TCCTCGAAGC GGGGCTTCCC
GATTACCGTC TTGATTTCCG CATCCACATG CCCGCGGCGC TGACCTACCC CTTGCAAGGG
AAGACCTACA ATTGGTGGTA CGAATCCGAT CCCGAGCCGT ACATGCACAA CCGGCGCATC
TATCAACCCC GCGGCAAGGT CCTGGGAGGG TCGAGCTGTA TCAACGGCAT GATCTATATC
CGCGGCAACG CCATGGATTA CGAAAAATGG GCCAGCTTTG AGGGATTGGA AGACTGGGAT
TACGCCCGCT GCCTGCCCTA TTTCAACCGC GCCGAATACC GGCTCAGTGG TGCGGACGCC
TACCAGGGCG TCGGCGGCCC CCTGTACCTG ACCACGCCGG AATGCGACAA TCCCCTGTTC
GAAGCCTTTT TCAAGGCCGT CCAGCAAGCC GGGCACCCTG TTGTGGACAA TGTCAACGGC
TACCGGCAGG AAGGATTTTC CAAATTTGAC GCCAATATCT ACCGCGGCCG GCGGTGGAAC
GCGGCCCGGG CCTACGTGCA CCCGGTCAAA AACCGCAAGA ACCTGGACAT CAGGTGCCGG
GCGATGAGTA CCCGGATCCT GTTCGAAGGC AAGAAGGCGA TCGGGGTAGA ATACAAGAAG
GGCAACACTA CCCATAAAGT CTACGGCGGC GAGATCATCA GCTGCGGTGG GGCCATCAAT
TCGCCCCAGC TCTTGCAGCT CTCCGGCGTC GGCGCCGGGG ATCACCTGCG CCAGCTCGGC
ATCGACGTGG TCCAGGACCT GCCCGGAGTC GGTGAAAACC TGCAGGACCA CCTCGAACTC
TATGTCCAAT GGGCGGCCAA AAAACCGGTC AGCATGTTCC CAGCCCTGAA GTGGTACAAC
CAGCCCAAGA TCGGCATGGA ATGGCTCTTT GCCAACAAGG GAGCGGCCGC GACCAACCAT
TTTGAGGCTG GCGGCTTTAT CCGCGGCAAC GACCAGGTCG ACTATCCGAA CCTGCAGTTC
CACTTCCTGC CCTTGGCGAT CCGCTACGAC GGCACCGCAC CCAACGAAGG ACACGGCTTC
CAGCTCCACG TCGGCCCCAT GAACTCCGAC GTCCGCGGTC GGGTCAAGAT TACCTCGGCC
GACCCCGGGG ACTATCCGAG CATCCTGTTC AACTACCTCT CCACGGAACA GGAACGCCGT
GAATGGGTTG AGGCCATACG CGCATCGCGC CACATCGTGG AACAGTCCGC TTTTGACGAA
TTGCGGGGCA AGGAACTCGC TCCGGGCAGC GACGCCCAGA CCGACGAGGA GATCCTGGAC
TTTGTTGCCC GGGAGGGCGA AAGCGCTTAC CATCCGAGTT GCACCTGCAA AATGGGCTAC
GACGATATGG CCGTGGTCGA CAGTGATCTG CGCGTGCACG GCGTCGAAAA CCTCCGCGTT
GTCGATGCCT CGATCATGCC CACCATCACC AACGGCAATA TCTACGCTCC GACAATGATG
CTCGCGGAAA AGGCGGCGGA CAAAATCCTG GGCAACACCC CCCCGGAACC GGCGCAAGCC
CCGTTTTACA AAACCGAAGT CTAG
 
Protein sequence
MAQKKYDYII VGGGSAGSVL ANRLSANPKN KVLVLEAGLP DYRLDFRIHM PAALTYPLQG 
KTYNWWYESD PEPYMHNRRI YQPRGKVLGG SSCINGMIYI RGNAMDYEKW ASFEGLEDWD
YARCLPYFNR AEYRLSGADA YQGVGGPLYL TTPECDNPLF EAFFKAVQQA GHPVVDNVNG
YRQEGFSKFD ANIYRGRRWN AARAYVHPVK NRKNLDIRCR AMSTRILFEG KKAIGVEYKK
GNTTHKVYGG EIISCGGAIN SPQLLQLSGV GAGDHLRQLG IDVVQDLPGV GENLQDHLEL
YVQWAAKKPV SMFPALKWYN QPKIGMEWLF ANKGAAATNH FEAGGFIRGN DQVDYPNLQF
HFLPLAIRYD GTAPNEGHGF QLHVGPMNSD VRGRVKITSA DPGDYPSILF NYLSTEQERR
EWVEAIRASR HIVEQSAFDE LRGKELAPGS DAQTDEEILD FVAREGESAY HPSCTCKMGY
DDMAVVDSDL RVHGVENLRV VDASIMPTIT NGNIYAPTMM LAEKAADKIL GNTPPEPAQA
PFYKTEV