Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_4716 |
Symbol | |
ID | 6130958 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 5189072 |
End bp | 5190757 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641644858 |
Product | choline dehydrogenase |
Protein accession | YP_001771486 |
Protein GI | 170742831 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | [TIGR01810] choline dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.959774 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCAGG AATACGATTT CATCATCGTC GGTGCCGGCT CGGCCGGAAA CGTGCTGGCC GCCCGGTTGA CCGAGGACGC GGACGTCAGC GTCCTGCTCC TGGAGGCGGG TGGTCCCGAC TACCGCTTCG ACTTTCGCAC CCAGATGCCA GCGGCGCTCG CCTATCCACT CCAGGGGCGG CGCTACAACT GGGCGTATCT CACCGATCCG GAGCCTCACA TGAACAACCG GCGCATGGAG TGCGGCCGGG GCAAGGGGCT CGGCGGCTCG TCGCTGATCA ACGGCATGTG CTACATCCGC GGCAATCCCC TCGACTACGA CGGCTGGGCG CGATTGCCCA GCCTCGCGGA CTGGTCCTAC GGCGATTGCC TGCCGTACTT CCGCAGGGCG GAGACACGCG ACCTTGGGCC GGACACCTAT CACGGCGGAG ACGGACCGCT CTACGTCACC GCGGCCAAGC CCGGCGGAAA CGTTCTGTTC GAGGCCATGA TCGAGGCCGG CGTGCAGGCG GGCTACCCCC GCACCGCGGA CCTCAACGGC TACCAGCAGG AAGGCTTCGG TCCGATGGAC AGGAGCGTCA CCGCGAAGGG ACGCCGGTCA AGCACCGCCC GGGGCTACCT CGACCAGGCG CGGGAGCGGC CCAACCTCAC TATCGTCACC CACGCACTCA CCGACCGCAT CCTGTTCAAT GGCAAACGCG CGAGCGGCGT CGTCTATCTG CGCGGGAACA AGGAGCCGTC CTTTGCGCGG GCACGCCGCG AGGTGCTGGT CTGCTCCGGC GCGATCGCCT CGCCGCAGAT CTTGCAGCGC TCCGGCGTCG GGCCTGCCAA TCTCCTGCGC AATCTCGACG TCCCCCTCGT TCTCGACCTG CCCGGCGTCG GCGAGAACCT GCAGGACCAT CTTGAGATGT ACGTGCAATA CGAGTGCCGG CAGCCCGTCT CCCTGGCACC CGCCCTCAAG CCCTGGAACC AGCCCGCCAT CGGCGCGCGA TGGCTGTTCG CGGGGACGGG GATCGGCGCA AGCAACCAGT TCGAAGCGGG CGGCTTCATC CGCTCCGACT CCGAATTCGC CTGGCCGAAC TTGCAATATC ACTTCCTACC GCTGGCGATC AGCTACAACG GCAGCCACGC GGTGAAATCG CACAGCTTTC AGGCCCATGT CGGCTCGATG CGCTCACCGA GCCGGGGTCG GATCCGGCTC ACTTCGCGGG ACCCGCACGC CCATCCGAGC ATCCTGTTCA ACTACATGTC GGCCGACCAG GACTGGCGCG AGTTCCGCGT TGCGATTCGG ATCACCCGCG AGATCATGGC CCAGCCGGCC CTTGATCCGT ACCGCGGGCG CGAGATCAGC CCAGGCGCGG CGCTACGTTC CGACGCGGAG CTCGACGCCT TCGTGCGCGC CCATGCCGAG ACCGCCTACC ACCCTTCCTG TTCCTGCAAG ATGGGCGAGG ATGCGATGGC GGTGGTCGAC GGCCGGGGCC GGGTGCATGG GCTGGAGGGT CTGCGCGTGG TCGACGCCTC GATCATGCCG CAGATCGTCA CGGGCAACCT CAACGCCCCG ACGATCATGA TCGCGGAGAA GATCGCCGAC GACATTCGCG GCCGGACCCC CCTGCCGCGC AGCCAGGCAG CCTATTTCGT GGCCGGCGAT ACGCCGCCGC GGAGGAGGCC GGTTCGCGCA AGCTGA
|
Protein sequence | MTQEYDFIIV GAGSAGNVLA ARLTEDADVS VLLLEAGGPD YRFDFRTQMP AALAYPLQGR RYNWAYLTDP EPHMNNRRME CGRGKGLGGS SLINGMCYIR GNPLDYDGWA RLPSLADWSY GDCLPYFRRA ETRDLGPDTY HGGDGPLYVT AAKPGGNVLF EAMIEAGVQA GYPRTADLNG YQQEGFGPMD RSVTAKGRRS STARGYLDQA RERPNLTIVT HALTDRILFN GKRASGVVYL RGNKEPSFAR ARREVLVCSG AIASPQILQR SGVGPANLLR NLDVPLVLDL PGVGENLQDH LEMYVQYECR QPVSLAPALK PWNQPAIGAR WLFAGTGIGA SNQFEAGGFI RSDSEFAWPN LQYHFLPLAI SYNGSHAVKS HSFQAHVGSM RSPSRGRIRL TSRDPHAHPS ILFNYMSADQ DWREFRVAIR ITREIMAQPA LDPYRGREIS PGAALRSDAE LDAFVRAHAE TAYHPSCSCK MGEDAMAVVD GRGRVHGLEG LRVVDASIMP QIVTGNLNAP TIMIAEKIAD DIRGRTPLPR SQAAYFVAGD TPPRRRPVRA S
|
| |