Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_12251 |
Symbol | |
ID | 4776371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1065563 |
End bp | 1067218 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640086734 |
Product | glucose-methanol-choline (GMC) oxidoreductase:NAD binding site |
Protein accession | YP_001017239 |
Protein GI | 124022932 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATTCAGC ACCCTTATGA GGTGATCGTG ATCGGCTCTG GTGCTACTGG AGGGGTTGCT GCTCTAACCC TTGCAGAAGC TGGTGTACGT GTGCTCGTGG TAGAAGCTGG GCCAGATTTG TCTGCTCAAA AAGCCCTGGG CTCAGAACCT GGAAACACCC TTAGACGTTT GGATGGTTTA TGTAGCGGCA AGCATCGATC TCAGGCTCAA CATCCTGGCT ATTGGAAAGC GAATCCGTTG CTCTACGCGA ATGAAAAGGA GAATCCCTAT ACCTATCCTT CTGAACACCC CTTCATCTGG ACCCAGGGTC GTCAGGTGGG GGGGCGCAGC CTCACTTGGG GTGGAATCAC TCTTCGTCTT TCAGATCAGG ACTTAAAGGC TTCGCGCAGA GATGGTTATG GGCCTGAATG GCCACTGCAA TACAGCGAGT TAGCTCCTCA TTATTCCGCC CTAGAAGAGC GCTTGAAGGT TCATGGTCAT GTGGATGGGT TGGAACAGTT ACCGGATGGC AACTACATCG CCCCATTACC GTTCACAGCT AGTGAACAAC AGTTCGCTAG CGCCGTTGAT ACTGAACTTG GTTATCCAGT CATTCATTCA CGAGGGTTCG GACCTCACCA GCCTTCAGTT GATGGACCTT GGCCTCGTTC AAGCAGTCCG GGCAGCACCT TACAGATGGC ACTTGCCACA GGCAAAGTAG AGATCCTCAG CAACCACAAG GCTGAGCGGT TGCTGATGCA TCCAGATCAT GAAGCAGCCC GAGGGGTGCT GGTGATTGAT CAGCGCAACG GCAACCGACA AGAGCTCCAT GGTGAGCTTG TGGTGCTTTG TGCATCGACA ATTCAGAGTC TTCGACTCCT GCTGAGTTCC GAAGTGAGCC ATCACAGCGC GGGGTTTACT GACCCCTCGG GCAACCTCGG TTGCTATTTG ATGGACCACG TCTCCACCTG TCGTTTCTTT GCTCTGCCAC GTAGCCAAGT GAAGCAGGTG TCTGAGACTG ACTCAACGGC GAATGTGCTT TCTGGAGCTG GCAGTTTTTT TCTTCCTTTC GGTGCTTGCT TAGAGCCTAA AAATCAGTTG AAGTTCTTGC GGGGTTATGG ACTCTGGGGA GGGATTGATC GCTTTGAACC CCCAGATTGG TTGAAACGTA AACCAGACAC AGCTACAGGT TTCCTGATTG GGCATGGTGA AGTGTTGCCT TCACCTCACA ACAAAGTGAC GCTGTCAAGC ACTTTGGATC GCTGGGGTGT TCCTGTGCCA CATATCGATT GTCGATGGGG AGAGAACGAG CAAGCCATGG TTGATCACAT GCAAGACACG ATCAAGACAG CGATCCAGTC AGCTGGGGGA ACAATGTTGC CGCTCAAGGA GCTGATTAAT TTGATGTTTC TCGAACCCCT TCTCGATGGT GCGCTAGCTC TCAGTGAAAC GTCTCCACCG CCGGGGTATT ACATCCATGA AGTTGGCGGT GCTGCGATGG GAGAACGTGA AGATTGCAGT GTGGTGGATC GTTGGAACCG TCTTTGGCGA TGTCCAAATG TGCTTGTTGT GGATGGAGCG TGTTGGCCAA CATCCGCCTG GCAGAGTCCC ACTCTGACAA TGATGGCGAT TACAAGAAGG GCTTGTCTAC AAGCCCTTAA GCCTCGGCGT GGCTGA
|
Protein sequence | MIQHPYEVIV IGSGATGGVA ALTLAEAGVR VLVVEAGPDL SAQKALGSEP GNTLRRLDGL CSGKHRSQAQ HPGYWKANPL LYANEKENPY TYPSEHPFIW TQGRQVGGRS LTWGGITLRL SDQDLKASRR DGYGPEWPLQ YSELAPHYSA LEERLKVHGH VDGLEQLPDG NYIAPLPFTA SEQQFASAVD TELGYPVIHS RGFGPHQPSV DGPWPRSSSP GSTLQMALAT GKVEILSNHK AERLLMHPDH EAARGVLVID QRNGNRQELH GELVVLCAST IQSLRLLLSS EVSHHSAGFT DPSGNLGCYL MDHVSTCRFF ALPRSQVKQV SETDSTANVL SGAGSFFLPF GACLEPKNQL KFLRGYGLWG GIDRFEPPDW LKRKPDTATG FLIGHGEVLP SPHNKVTLSS TLDRWGVPVP HIDCRWGENE QAMVDHMQDT IKTAIQSAGG TMLPLKELIN LMFLEPLLDG ALALSETSPP PGYYIHEVGG AAMGEREDCS VVDRWNRLWR CPNVLVVDGA CWPTSAWQSP TLTMMAITRR ACLQALKPRR G
|
| |