Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0845 |
Symbol | |
ID | 4076020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 895892 |
End bp | 897502 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638006143 |
Product | glucose-methanol-choline oxidoreductase |
Protein accession | YP_612840 |
Protein GI | 99080686 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.189337 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGACT TTGACTATAT CATTGTGGGA GCCGGATCCG CGGGCTGTGT GCTGGCCGAG CGCCTGAGCG CCAATGGCCG CCATAGTGTG CTTGTGCTGG AGGCCGGAGG TCGCCCGCGC ACACCATGGA TCGCGTTGCC GCTTGGCTAC GGCAAGACCT TCTATGACCC GGCGGTGAAC TGGAAATATC AGACTGAACC CGAAGAGACA CTGGGCGGAC GCGCCGGATA TTGGCCGCGT GGCAAGGTCG TGGGGGGATC GGGTGCGATC AATGCGCTTG TCTACGCCCG TGGCCTGGCG CGCGATTTCG ACGATTGGGA AGAGGCGGGC GCGACGGGCT GGAACTGGGA CGCGGTCCAG AAAACCTATG AGCGCCTTGA GAGCCGCTTT GATGTCGATG GCACCCGCAC CGGCGAGGGG CCGATTCACG TTCAGGATGT CTCGGACCAG ATCCACCGGG CCAACCGGCA TTTCTTTGCC GCAGCGAAAG AGCTGGGTCT GCCACGGACA CCCGATATGA ACGGTATCAC CCCCGAAGGC GCGGGCGTCT ACCGGATCAA CACCAGCGGT GGGCGCAGGA TGCATTCGGC GCGCGCCTGT TTGGCTCCTG CGCTCCGGCG CGCAAATGTG ACGCTGATGA CGGGCGTTCT GGTGGAGCGG ATCGGCTTTG AGGGAAAGCG GGCCACCTCC GTCGAGGTGG TCCACAAGGG GCGCGCGCAG TCCTTGCAGG CCGGGCGAGA GATCATTCTC GCGGCAGGGG CTGTAAATTC ACCGCGCATC TTGCAACTCT CGGGGCTTGG CCCCGCGGAG CTGCTGCGTG AGCATGGGAT CGCGCCGCTG ATGGATGCGC CTCATGTAGG TGGCAACCTG CAGGATCATC TGGGCATAAA CTATTATTTC CGTGCCACCG AACCCACGCT CAACAACGTG CTGAGGCCGC TCCATGGCAA GATCCGCGCA GCGCTGCAAT ATGCGCTCAC GCGGCGCGGG CCGCTCGCGC TCTCGGTCAA CCAATGTGGT GGATTTTTTC GCTCGGATGC GGGGCAGCGG GCGGCTGATC AGCAGCTTTA CTTCAACCCC GTGACCTATA CCACCACACC GGACGGCAAA CGCACGGTGG TGCAGCCCGA CCCCTTTGCG GGCTTTATCC TTGGGTTTCA GCCCACCCGG CCCATCAGCC GGGGGCGAAT CGACATTTCC GCCGCCGACG CGCTTGCGCC GCCCCGGATC AGGCCGGACT CGCTGGCTGC TCAGGAAGAT CAGGCGCAGG TGATCGCAGG CGGGCTGCTC TGTCAGAAGA TCGCCAAGAC CGAGGCGCTC AGCCGCTTGA TCGCCGCGCC CATGGGCGAG GATCTGCGCG AGATGACACC GGAGCAGATC CTAGCGGACT TTCGCGAGCG CTGCGGCACC GTGTTTCACC CGGTCGGCAC CTGTCGCATG GGTGCAGACA GCACCAAGTC CGTGGTTTGC CCTCGGCTCA AGGTGCATGG GGTCGCGGGG CTGCGGGTCG TTGATGCCTC GGTCTTCCCG AATATCACCT CGGGCAACAC CAACGCCCCA ACCATGATGC TTGCCACCCG CGCGGCCGGT CTCATTCTGG AGGACGCATG A
|
Protein sequence | MRDFDYIIVG AGSAGCVLAE RLSANGRHSV LVLEAGGRPR TPWIALPLGY GKTFYDPAVN WKYQTEPEET LGGRAGYWPR GKVVGGSGAI NALVYARGLA RDFDDWEEAG ATGWNWDAVQ KTYERLESRF DVDGTRTGEG PIHVQDVSDQ IHRANRHFFA AAKELGLPRT PDMNGITPEG AGVYRINTSG GRRMHSARAC LAPALRRANV TLMTGVLVER IGFEGKRATS VEVVHKGRAQ SLQAGREIIL AAGAVNSPRI LQLSGLGPAE LLREHGIAPL MDAPHVGGNL QDHLGINYYF RATEPTLNNV LRPLHGKIRA ALQYALTRRG PLALSVNQCG GFFRSDAGQR AADQQLYFNP VTYTTTPDGK RTVVQPDPFA GFILGFQPTR PISRGRIDIS AADALAPPRI RPDSLAAQED QAQVIAGGLL CQKIAKTEAL SRLIAAPMGE DLREMTPEQI LADFRERCGT VFHPVGTCRM GADSTKSVVC PRLKVHGVAG LRVVDASVFP NITSGNTNAP TMMLATRAAG LILEDA
|
| |