Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4942 |
Symbol | |
ID | 5706492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5611099 |
End bp | 5612661 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641274337 |
Product | choline dehydrogenase |
Protein accession | YP_001539679 |
Protein GI | 159040426 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.953244 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGACT TCGTAGTGGT CGGCGGCGGT ACAGCGGGTT GCGTCCTGGC AAGCCGGCTA AGCGAGGACC CCTCCGTCAC GGTGTGCCTG GTCGAAGCCG GGCCAGCCGA CAATCACGAT AACTTTCGTA TCCCGGTAGC TGGCGGGAAG TTCTTCAAAA CACGGTTCGA CTGGGACTAC GACAGTCATC CTGAACAGTT CTGTGATGGC CGCCGTGTTT ACCTTCCGCA AGCGCGAGTG CTCGGTGGCG GAAGCTCGGT TAATGGCATG GTCTACATTC GCGGGAATCG CGCCGACTAC GACGAATGGC AGCAGCCGGG ATGGAGCTAC GACGAGTTAC TGCCGTTTTT CAAACGGTCC GAGGACAACG AGCGGGGCGC TGATGAGTTC CACGGGGCCG GTGGACCGAT GCGGGTCAGT GACGGACGCG CGCACAGCCC GAGCGCCATG GCCTTCACCC AGGCGGCACT CGACGCCGGC TACCCGGCCA ACCCCGACTT CAACGGCGCG GTCCAGGAGG GCTTCGGGGA GTACCAGGTG ACCCAGCGGG ACGGCCGTCG GGCCAGCGCG GTCACCGAGT TCCTGCATCC GGCGAGGCAC CGTCCGAACC TCGTCGTCGA AACTAATCTG CAGGTACAGC GGATCATGAT CGAGAACGGG CGGGCGGCCG GTGTGGTCGG CAACCGGTTC GACGACCTGG TCGAACTTCG GGCCGAGCGG GAGGTCATTG TCTCCGCAGG CACGTACAAC TCACCACACC TGCTCATGCT CTCCGGGATC GGGCCCGCCG ATCTACTGCG CGCCTTCGAG CTGCCGGTCT TTGTCGACCA GCCCCAGGTC GGGCAGAACC TCCAGGACCA CCCGCACATC TGGCTCAGCT ACCGCCACGA TCTGCCGGTG AGCCTACTGG CAGCGGCCGA GTCCGAGCGC GTCCACCAGT ACGAACGCGA TCGCACCGGC ATGCTCGCCT CGAACGGTCC GGAGAGCGGC GGCTTCGTCC GGACCAGTGC GGCGCTGGCC GGCCCCGACC TCCAGTTCAT CTGCCTGCCG ATGATGGTCG CGGACACCTT CCTCTCGCCA CCGACCGGGC ACGGAGTCTC CTTCGGTGCC TCGGTGATGA GGCCGGTGAG CAGCGGCCAC GTGACGCTGT TCAGCGGCGA GCCGACCGCC AAGCCCAAGA TCGTGCAGAA CTACCTCGCC GATCCCGCCG ACCTGCAGAC GGCGGTCAGC GGCCTGCGGA TCAGCCTGGA GCTGTCCCGC CAGGCCGCGC TGAAGCCCTA CGCCGTCGAG CCGTCCGCGG CGCCGAGTTC CGACACGGAA ACCGACCTGC GGGCGTATGC GCGCAGCCAC GTCCAGACCG GGCTGCATCC GGTCGGTACC TGCGCGATGG GCCGGGTCGT TGACGCGGAA CTGCGGGTGT TCGGAGTCGA CGGGCTGAGG GTCGTGGACG CCTCCGTCAT TCCCTTGATC ATCCGGGGTA ACACGAACGC GCCGGTGATG GCCGTGGCCG AGAGGGCGGC AGATCTCGTC CGCGGCGCAC AATCCCTGCC CGGCGCGAGG TAG
|
Protein sequence | MYDFVVVGGG TAGCVLASRL SEDPSVTVCL VEAGPADNHD NFRIPVAGGK FFKTRFDWDY DSHPEQFCDG RRVYLPQARV LGGGSSVNGM VYIRGNRADY DEWQQPGWSY DELLPFFKRS EDNERGADEF HGAGGPMRVS DGRAHSPSAM AFTQAALDAG YPANPDFNGA VQEGFGEYQV TQRDGRRASA VTEFLHPARH RPNLVVETNL QVQRIMIENG RAAGVVGNRF DDLVELRAER EVIVSAGTYN SPHLLMLSGI GPADLLRAFE LPVFVDQPQV GQNLQDHPHI WLSYRHDLPV SLLAAAESER VHQYERDRTG MLASNGPESG GFVRTSAALA GPDLQFICLP MMVADTFLSP PTGHGVSFGA SVMRPVSSGH VTLFSGEPTA KPKIVQNYLA DPADLQTAVS GLRISLELSR QAALKPYAVE PSAAPSSDTE TDLRAYARSH VQTGLHPVGT CAMGRVVDAE LRVFGVDGLR VVDASVIPLI IRGNTNAPVM AVAERAADLV RGAQSLPGAR
|
| |