Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3744 |
Symbol | |
ID | 5901206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4059489 |
End bp | 4061153 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641564267 |
Product | choline dehydrogenase |
Protein accession | YP_001685369 |
Protein GI | 167647706 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | [TIGR01810] choline dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0339169 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGCA CGCGCTACGA TTACGTCATC ATCGGCGCCG GCTCGGCCGG CTGCGTCCTG GCGGCTCGGC TGACCGAGGA CGCCAACGTC AAGGTCCTGC TGCTGGAGGC CGGCGGCAAG AACACCTCGA TCCTGGTCAA GATGCCCGCC GGGGTGGGCG AGTTGATCAA GGCCAAGGGC GATCAGAACT GGGGCTTCTG GACCGAGGCC GAGCCGCACC TGAATGACCG CAAGCTGTGG TGGCCGCGCG GCAAGGGCCT GGGCGGCAGC TCGGCCATCA ACGGCATGAT CTACATCCGC GGCCACGCCC GCGACTATGA CCAGTGGCGG CAGATGGGGC TGTCGGGCTG GTCCTATGCC GAGGTGCTGC CCTACTTCAA GCGTTCGGAG ACCCACCATG GCGGCGGCGA CGCCTATCAC GGCGGGGCCG GACCGCTGCA CGTGTCGGGC GGCGAGAGCA AGAGCCCGTT CTACCCCGCC CTGATCGAGG CCGGCCGCCA GGCGGGCCAT GCGACCACGA AGGATTTCAA CGGCTTCCGG CAGGAAGGCT TTGGCCCCTA CGATCTGACG ATCCGCGACG GCAAGCGCTG GAGCGCGGCG GCGGCCTATC TGACGGCGGC CCTGGCCCGT CCGAACCTGA CCTGCGTGAC CGAGGCCCGC ACCACGCGGA TCCTGATCGA GAACGGCAAG GCGATCGGCG TGGAATATGT GGTCGGGACC GATCCGGCGC GGCTGGTCGC CCATGCCGAC GCCGAGGTGC TGCTCAGCGC CGGCGCCGTG CAGTCGCCGC ATATCCTGCA GCTGTCGGGC GTCGGCGATC CGGACGACCT GAAGGCCCAC GGCATCGCCC CCGTGCACGA GGCCAAGGGC GTCGGCGCCA ACCTGCAGGA TCACCTGGAC GTCTGCCTGT CGTGGACCAG CAAGAACCTG GTCACCGCCT ATTCGGCCAA CAAGGGCCTC AAGAAGCTGG GCACGGGCCT GTCCTACATG CTGCTGGGCA AGGGCCTGGG TCGTCAGCAG TTCCTGGAGA GCGGAGCCTT CCTGAAGTCG CGCCCCGATC TGGACCGCCC CGACCTGCAG ATCCACGGCG TGCTGGCGAT CATGCAGGAC CACGGCAAGA CGATGATCGA GAAGGACGGC TTCACCCTGC ACGTCTGCCA GCTTCGTCCC GAGAGTCGCG GAAAGGTCGG GTTGCGCTCG GCCGACCCGT TCGACGACCC GACCATCCTG GGCAACTACC TGGCGACCGA CGAGGACCGG CGCGCGATCC GCGAGGGGGT GCGCATCGGC CGCGACGTGG CCGCCCAGGC GGCGCTGGAT CCCTATCGGG AGTCCGAATA CGCGCCAGGC GCCGACATCA AGACCGACGC CGAGATCGAC GCCTGGGTCC GTGCCAAGGC CGAGACCATC TATCACCCGG TCGGCACCTG CCGCATGGGC GCGGCGGGCG ACCCGCTGGC CGTGGTCGAT GACCAGCTGC GCGTACAGGG GATCGAAGGC CTGCGGGTGA TCGACGCCTC GGTGATGCCC ACCCTGATCG GCGGCAACAC CAACGCCCCC ACGATCATGA TCGCCGAACG GGCTTCCGAC CTGATCCGCG GCAAGGTCCT GCTGCCGCCG GTCGAGGTTC CGGTGTTCGA GGACGGAAGG GCGGTCGCGG CTTAA
|
Protein sequence | MASTRYDYVI IGAGSAGCVL AARLTEDANV KVLLLEAGGK NTSILVKMPA GVGELIKAKG DQNWGFWTEA EPHLNDRKLW WPRGKGLGGS SAINGMIYIR GHARDYDQWR QMGLSGWSYA EVLPYFKRSE THHGGGDAYH GGAGPLHVSG GESKSPFYPA LIEAGRQAGH ATTKDFNGFR QEGFGPYDLT IRDGKRWSAA AAYLTAALAR PNLTCVTEAR TTRILIENGK AIGVEYVVGT DPARLVAHAD AEVLLSAGAV QSPHILQLSG VGDPDDLKAH GIAPVHEAKG VGANLQDHLD VCLSWTSKNL VTAYSANKGL KKLGTGLSYM LLGKGLGRQQ FLESGAFLKS RPDLDRPDLQ IHGVLAIMQD HGKTMIEKDG FTLHVCQLRP ESRGKVGLRS ADPFDDPTIL GNYLATDEDR RAIREGVRIG RDVAAQAALD PYRESEYAPG ADIKTDAEID AWVRAKAETI YHPVGTCRMG AAGDPLAVVD DQLRVQGIEG LRVIDASVMP TLIGGNTNAP TIMIAERASD LIRGKVLLPP VEVPVFEDGR AVAA
|
| |