Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1514 |
Symbol | |
ID | 4029210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 1721738 |
End bp | 1723420 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637966697 |
Product | choline dehydrogenase |
Protein accession | YP_573566 |
Protein GI | 92113638 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | [TIGR01810] choline dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0264792 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACAGG CTCGTGAATA CGACTACATC ATCATCGGGG CCGGTTCCGC CGGCAACGTA CTCGCCACTC GCCTGACCGA GGATCCGGAC GTCCAGGTGC TGCTGCTCGA GGCCGGCGGT CCCGACTACC GCTTCGACTT CCGCACGCAG ATGCCGGCGG CGCTGGCCTA CCCCCTGCAG GGCAAGCGCT ACAACTGGGC GTTCGAGACC GACCCCGAAC CCTACATGAA CAATCGCCGC ATGGAGTGCG GACGCGGCAA GGGCCTGGGC GGGTCGTCGT TGATCAACGG CATGTGCTAC TTGCGCGGCA ACGCGCTGGA TTACGACAAC TGGGCCAAGA TACCGGGCCT GGAGGACTGG AACTACCTGC AGTGCCTGCC CTACTTCAAG CGCGCCGAGA CCCGCGACAT CGGCCCCAAC GATTATCATG GCGGTGACGG CCCGGTGTCG GTGGCCACAC CCAAGGAAGG CAACAACGAG CTCTACGGCG CCTTCATCCG CGCAGGCATC GAGGCCGGCT ATCCGGCCAC CGAGGACGTC AACGGCTATC AGCAGGAAGG CTTCGGCCCC ATGGACCGCA CGACGACGCC CAACGGACGT CGTGCCTCCA CGGCGCGCGG CTACCTGGAT ATCGCCAAGC AACGCCCCAA CCTGACCATC GAGACGCACG CCACGACCGA TGTCATCGAA TTCGAGGGCA AGCGCGCCGT CGGCGTGAGC TACGAGCGCA AGGGACAGGC CCAGCGTGTT CGCGCACGCC GCGAAGTGCT GCTGTGCGCG GGCGCCATCG CCTCGCCGCA GATCCTGCAG CGTTCCGGCG TGGGCAATCC CGAGCATCTC GAGGAATTCG ACATTCCCGT GGTGCACGAG CTGCCGGGCG TCGGCGAAAA CCTCCAGGAT CACCTGGAAA TGTACATTCA GTACGAGTGC AAGAAGCCCA TTTCGCTGTA CCCGGCGCTC AAGTGGTACA ACCAGCCCAA GATCGGTGCC GAGTGGCTGT TTTTCGGCAA GGGCATCGGC GCCAGCAACC AGTTCGAGGC GGCCGGCTTC ATTCGTACCA ACGACCAGGA AGAGTGGCCC AATCTGCAGT ACCACTTCTT GCCGATCGCC ATCAGCTACA ACGGCAAGAG CGCGGTGCAG GCCCACGGCT TCCAGGCCCA CGTCGGCTCC ATGCGCTCCA TGAGCCGCGG TCGCATTCGC CTGACATCGC GCGACCCCAA GGCCGCGCCG AGCATCCTGT TCAACTACAT GTCCCACGAC AAGGATTGGC AGGAATTCCG CGACGCCATT CGCATCACGC GCGAGATCAT CGAGCAGCCG ACGATGGACG AGTACCGCGG CCGCGAAATC TCGCCGGGGC CGAATGTGCA AAGCGACGCC GAGCTCGACG AGTTCGTGCG CCAGCACGCC GAGACCGCCT ATCACCCCGC CGGCTCCTGC AAGATGGGCA GTGCCGATGA CGCGATGGCG GTGGTCGATG GTGCGGGACG CGTGCATGGC CTCGAAGGGC TGCGTGTCAT CGATGCCTCG ATCATGCCCG TGATCGCCAC CGGCAACCTC AATGCGCCGA CGATCATGAT CGCCGAAAAG ATGGCCGACA AGGTTCGCGG TCGCGATCCG CTGCCGCCGG CCAAGGTCGA CTACTACGTG GCCAACGGTG CGCCGGCCCG CCGTCGGGCG TGA
|
Protein sequence | MTQAREYDYI IIGAGSAGNV LATRLTEDPD VQVLLLEAGG PDYRFDFRTQ MPAALAYPLQ GKRYNWAFET DPEPYMNNRR MECGRGKGLG GSSLINGMCY LRGNALDYDN WAKIPGLEDW NYLQCLPYFK RAETRDIGPN DYHGGDGPVS VATPKEGNNE LYGAFIRAGI EAGYPATEDV NGYQQEGFGP MDRTTTPNGR RASTARGYLD IAKQRPNLTI ETHATTDVIE FEGKRAVGVS YERKGQAQRV RARREVLLCA GAIASPQILQ RSGVGNPEHL EEFDIPVVHE LPGVGENLQD HLEMYIQYEC KKPISLYPAL KWYNQPKIGA EWLFFGKGIG ASNQFEAAGF IRTNDQEEWP NLQYHFLPIA ISYNGKSAVQ AHGFQAHVGS MRSMSRGRIR LTSRDPKAAP SILFNYMSHD KDWQEFRDAI RITREIIEQP TMDEYRGREI SPGPNVQSDA ELDEFVRQHA ETAYHPAGSC KMGSADDAMA VVDGAGRVHG LEGLRVIDAS IMPVIATGNL NAPTIMIAEK MADKVRGRDP LPPAKVDYYV ANGAPARRRA
|
| |