Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1882 |
Symbol | |
ID | 4077379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1980763 |
End bp | 1982418 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638007198 |
Product | choline dehydrogenase |
Protein accession | YP_613877 |
Protein GI | 99081723 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | [TIGR01810] choline dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCGG ATTATGTGAT TGTTGGGGCG GGCAGCGCAG GCTGCGCCAT GGCCTACCGT CTGAGCGAGG CGGGCAAATC GGTGCTGGTG ATCGAGCATG GTGGCACGGA TGCGGGCCCC TTCATCCAGA TGCCCGGCGC ATTGAGCTAC CCCATGAACA TGTCGATGTA TGACTGGGGT TACAAATCCC AGCCCGAGCC GCATCTGGGC GGGCGCGAGC TGGTGACGCC GCGCGGCAAG GTCATTGGCG GATCCTCTTC GATCAACGGC ATGGTCTATG TGCGCGGCCA CGCGGGCGAC TATAACCACT GGGCCGAAAC GGGCGCGACC GGCTGGTCCT ATGCCGACGT GCTGCCCTAT TTCAAACGTA TGGAAACCTG GGATGATCGC GGCCATGGCG GCGATCCCGA CTGGCGCGGC ACCGACGGCC CGCTGCATGT CACCCGTGGC CCCCGCGACA ACCCGCTACA TGATGCCTTT GTGAAGTCCG GGCAGCAGGC GGGGTATCCG GTCACCAAGG ATTATAACGG CCAGCAGCAA GAGGGCTTTG GCCCGATGGA GATGACCGTC CACAAGGGCC GCCGCTGGTC TGCCGCCAAT GCCTATCTGA AACCTGCGCT CAAGCGCGAC AATTGCGATC TGATCCGCGC GCTGGCCCGC AAGGTGGTGA TCGAGGATGG CCGCGCCGTC GGTGTCGAAG TCGAGCGCGG CGGCAAGATC GAAGTCATCC GCGCCAATAT CGAGGTGATC CTCGCGGCGT CTTCGCTCAA CTCGCCCAAG CTCCTGATGC TCTCGGGCAT TGGCCCCGCC GCACATCTGG CCGAACATGG CATCGACGTC ATCGCGGACC GGCCCGGCGT TGGCCAGAAC CTGCAGGACC ATCTGGAGTT CTATTTCCAG TTTGCCTCCA AGAAGCCGAT CACGCTCTAT AAATACTGGA ACCTCTTCGG CAAGGCCTTG GTCGGGGCGC AGTGGCTCTT TACCAAGACC GGGCTCGGGG CCTCGAACCA GTTCGAGAGC GCGGCCTTCA TTCGCTCGGA CAAGGGGATC GACTATCCCG ACATCCAGTA TCACTTCCTG CCGATCGCCG TGCGCTATGA CGGGCAGGCG GCGGCCGAGG GCCACGGCTT TCAGGCCCAT GTCGGCCCGA TGCGCTCGCA GTCGCGCGGC GAGGTAACGC TGGCCAGCGC CGATCCCAAC GCCGCGCCAA AGATCCTGTT CAACTACATG TCTACCGAGC AGGACTGGAT CGATTTCCGC AAATGCGTCC GCCTCACGCG TGAGATCTTT GCACAGGATG CGATGAAGCC TTTTGTGAAA CACGAGATCC AGCCGGGCAC CGACCTGCAA ACGGACGAGG AGATCGACGG ATTCCTGCGC GAACATGTCG AGAGCGCCTA TCACCCCTGC GGCACCTGCA AGATGGGTGC GGTGGATGAT CCGATGGCGG TGGTTGACCC CGAATGCCGG GTGATTGGCG TCGAGGGGCT GCGGGTGGCG GATAGTTCGA TCTTCCCGCG CATCACCAAC GGCAACCTCA ACGGGCCCTC GATCATGACC GGCGAGAAAG CCTCCGATCA CATTCTGGGG CGCCGTCTGC CTTCGTCGAA TGCCGAGCCG TGGTTCAACC CGAACTGGCA GACCTCGCAG CGTTGA
|
Protein sequence | MNADYVIVGA GSAGCAMAYR LSEAGKSVLV IEHGGTDAGP FIQMPGALSY PMNMSMYDWG YKSQPEPHLG GRELVTPRGK VIGGSSSING MVYVRGHAGD YNHWAETGAT GWSYADVLPY FKRMETWDDR GHGGDPDWRG TDGPLHVTRG PRDNPLHDAF VKSGQQAGYP VTKDYNGQQQ EGFGPMEMTV HKGRRWSAAN AYLKPALKRD NCDLIRALAR KVVIEDGRAV GVEVERGGKI EVIRANIEVI LAASSLNSPK LLMLSGIGPA AHLAEHGIDV IADRPGVGQN LQDHLEFYFQ FASKKPITLY KYWNLFGKAL VGAQWLFTKT GLGASNQFES AAFIRSDKGI DYPDIQYHFL PIAVRYDGQA AAEGHGFQAH VGPMRSQSRG EVTLASADPN AAPKILFNYM STEQDWIDFR KCVRLTREIF AQDAMKPFVK HEIQPGTDLQ TDEEIDGFLR EHVESAYHPC GTCKMGAVDD PMAVVDPECR VIGVEGLRVA DSSIFPRITN GNLNGPSIMT GEKASDHILG RRLPSSNAEP WFNPNWQTSQ R
|
| |