Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3803 |
Symbol | |
ID | 4785972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 4021802 |
End bp | 4023448 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640092386 |
Product | putative choline dehydrogenase lipoprotein oxidoreductase |
Protein accession | YP_001022991 |
Protein GI | 124268987 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0363471 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACG ACGTCACCCC CACCTTCGAT CACGTGATCA TCGGCGGCGG CACGGCCGGC TGCCTGCTGG CCAACCGCCT GAGCGCCGAT CCGGCCAAGC GCGTGCTGCT GCTCGAGGCC GGTGGCCGCG ACGACTACCA CTGGATCCAC ATCCCGGTCG GCTACCTGCA CTGCATCGGC AACCCGCGCA CCGACTGGCT CTACCAGACC GAGCCCGATC CGGGGCTGAA CGGCCGCTCG CTGCGCTACC CGCGCGGCAA GGTGCTGGGC GGCTGCTCGA GCATCAACGG CATGATCTAC ATGCGCGGCC AGTCGCGCGA CTACGACCAC TGGGCGGCGG TGACCGGCGA CGATGGCTGG CGCTGGGACG CCTGCCTGCC GGTCTTCAAG CAGCACGAGG ACCACCACGG TGGCGCCGAC GAGATGCATG GCGCCGGCGG CGAATGGCGG GTCGAGAAGC AGCGCCTGCG CTGGGACGTG CTGGAGGCGT TCGCGTTGGC CGCGCAGCAG GCCGGCATCC CGGCGAGCGC CGACTTCAAC CGCGGCAACA ACGAGGGTGT GGGCTACTTC GAGGTCAATC AGCGCGCCGG CTGGCGCTGG AACACGGCCA AGGCCTTCCT CCGACCGACC TGCTATGGCC GGCCCAACTT CGAGATGTGG ACCGGCGTGC AGGTGACCCG GCTGCTGATC GAGCGCGGTG CCGACGGCGC GCTGCGCTGC ACCGGCTGCG ATGTGCAGAC GCCCCACGGC CCCGAGACCG TCCGGGCCAG TGCCGAGGTG CTGCTCAGCG CCGGTGCGGT GGGTTCGCCG CAGCTGCTGC AGCTGTCGGG CCTGGGACCG GCGGGGCTGC TGCAGCAGTA CGGCATCGCG GTGGCGCAGG ACCTGCCCGG CGTCGGCGAG AACCTGCAGG ACCATCTCCA GATTCGCGCC GTCTTCAAGC TGCAGGGCGT GCCGACGCTC AACACGCTGT CGGCGTCGTG GTGGGGCAGG GCCCGCATCG GCCTGGAATA CGCGTTCAAG CGCAGCGGAC CGATGAGCAT GGCGCCATCG CAGCTCGGTG CCTTCACCCG CTCGTCGCCG GACCACACGT GGCCCAACCT CGAGTACCAC GTGCAACCGC TGAGCCTGGA GGCGTTCGGT GAGCCATTGC ATCGCTTCGA CGCCTTCACG GCGAGCGTCT GCAACCTCAA TCCGACCAGC CGCGGCGCAA TCCGCATCCG CAGCCCGCGC TTCGAGGACG CGCCGCTCAT CGCGCCGCGC TACCTGTCCA CCGAAGCCGA CCGCCAGGTG GCGGCCGACA GCCTGCGGGT GACGCGGCGC ATCGCGGCGC AGCCGGCGTT GGCCAAATAC CGGCCCGAGG AGGTGAAGCC TGGCGTGCAG TTCCAGACCG ACGCCGAGCT GGCGCGGCTG GCCGGCGACA TCGGCACGAC GATCTTCCAC CCGGTGGGCA CCTGCCGGAT GGGGCGCGAC GACGACCCGC AGGCTGTCGT CGATGCGCAG CTGCGGGTAC GCGGCGTGGC CGGCCTGCGG GTGGTCGACG CCAGCGTGAT GCCCACGATC ACCAGCGGCA ACACCAACAG CCCCACGCTG ATGATTGCGG AGAAGGCGGC CCAGTGGATT CGGGCCGATC ACGGCCACCG GCGTTGA
|
Protein sequence | MNDDVTPTFD HVIIGGGTAG CLLANRLSAD PAKRVLLLEA GGRDDYHWIH IPVGYLHCIG NPRTDWLYQT EPDPGLNGRS LRYPRGKVLG GCSSINGMIY MRGQSRDYDH WAAVTGDDGW RWDACLPVFK QHEDHHGGAD EMHGAGGEWR VEKQRLRWDV LEAFALAAQQ AGIPASADFN RGNNEGVGYF EVNQRAGWRW NTAKAFLRPT CYGRPNFEMW TGVQVTRLLI ERGADGALRC TGCDVQTPHG PETVRASAEV LLSAGAVGSP QLLQLSGLGP AGLLQQYGIA VAQDLPGVGE NLQDHLQIRA VFKLQGVPTL NTLSASWWGR ARIGLEYAFK RSGPMSMAPS QLGAFTRSSP DHTWPNLEYH VQPLSLEAFG EPLHRFDAFT ASVCNLNPTS RGAIRIRSPR FEDAPLIAPR YLSTEADRQV AADSLRVTRR IAAQPALAKY RPEEVKPGVQ FQTDAELARL AGDIGTTIFH PVGTCRMGRD DDPQAVVDAQ LRVRGVAGLR VVDASVMPTI TSGNTNSPTL MIAEKAAQWI RADHGHRR
|
| |