Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_07141 |
Symbol | |
ID | 4719530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | - |
Start bp | 648917 |
End bp | 650557 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640080392 |
Product | glucose-methanol-choline (GMC) oxidoreductase:NAD binding site |
Protein accession | YP_001011030 |
Protein GI | 123965949 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.915082 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGATATAA ATCCTTATGA TGCAATCGTA GTAGGTTCAG GTGCCACAGG AGGAGTAGCT GCACTTACTT TAGCCGAACA AGGAATAAGA GTGCTAGTAA TAGAAGCTGG TCCTCAAATT AAAAGAACTG AGGCTAGCAG TAATGAGCCA AAAGATACCT TAAACAGATT ATCAGGAATA ATATCAAAAA AACACGCTAA TCAATGTCAG CATCCCGGTT ATTGGAAAAA TAATCCTAAT TTATATAAAA ACGAATTAAA ACATCCTTAT GTTCAACAAA AAAACAAGCC ATTCCTTTGG ACTCAAGGAA ACCAATATGG AGGAAGATCA TTAACGTGGG GAGGTATTAC CTTAAGATTT TCTAGAGAGG ATTTTCATCC ATCAAAAAAA GATGGATATG GGCCAGATTG GCCTATTTCC TACGACGAAT TATCACCTCA TTATGATTTT ATAGAAAATT TTTGTGGAAT ATATGGACAT AAAGATAACA TCAAAGAAGT CCCAAATGGT AAATATATTG GGAAGATACC TCTTACAAAA ATTGAAAGTA TTTTTGGCAA TCAAGTCAAA TCGAAATTAA ACTATCCCTT TATTCAATCT AGAGGATTCG ATCGTAATTC TTCAGTAAAA GAGGAACAAT GGCCAAAATC TTCGAGTGTA GGGACCACAT TCAAAAAAGC ACTAGAGACT GGAAATGTCC AAATACTCTC TAATCACTTA GTAGAATCAT TCGAAACGGA TAAAATAACA GAGCTTGCTT CAAAAATAAT CATCGTAAAT GTTGAAAATG GTAAAAGAAA ATCATTAAAT TGCGATTTAA TTATTCTATG TGCATCAACA ATCTCAACAT TGAGGATACT ACTTAACTCT GAAACCAAAT CAAATTCTTC TGGTTTTAAA GATACATCTG GAAAATTAGG AAAATTTCTA ATGGATCATA TTTCTGTCTG TAGGTTTTTT TCAGTTCCAA ATACAACTCA AAGAAACAAT ATATCTAATT CGTATCCTGA TCTTTCCGGA GCTGGGAGTT TTTTCATACC CTTTGGGACA AATCTGCCCA AACCTGAAAG TATTAATTTT TTACGGGGTT ATGGAATTTG GGGAGCAATT GACCGTCTAG GAATACCAAA GTTTTTGCAA AAAGACTTAA ATTCTTCTAC AGGTTTTCTA ATCGCTCATG GTGAGGTCCT ACCAAGAGAA GAAAATTCAG TCTCCCTTTC TGACAAAACA GACCGATGGG GGATTCCGGT CCCTCATATT GAATTCGAAT GGAGTGAAAA TGAATTAAAT ATGGCTAAAC ATATGGAGAG AACGATGCGA GATTCAATAA AAGCTGCAGA TGGAAAAATC AAAGGGATCG ATGAACTTAT AAAAATCCCA TATGCAGGGT TGTTTACAAA AAAATCAATT GCTCTTTCAG GAAATCCGCC ACCCCCGGGA TACTATATCC ATGAAGTAGG AGGAGCACCA ATGGGATTTA GAGAGGAAGA TAGTGTTGTA AATAAATCAA ATAGACTTTG GAGATGTAAG AATGTTCTTG TATTAGATGG TGCATGTTGG CCCACTTCTT CATGGCAGAG TCCAACTTTA ACAATGATGG CTATAAGCAG AAGAGCTTGT TTAAAAGTTA AAAAGACTTA G
|
Protein sequence | MDINPYDAIV VGSGATGGVA ALTLAEQGIR VLVIEAGPQI KRTEASSNEP KDTLNRLSGI ISKKHANQCQ HPGYWKNNPN LYKNELKHPY VQQKNKPFLW TQGNQYGGRS LTWGGITLRF SREDFHPSKK DGYGPDWPIS YDELSPHYDF IENFCGIYGH KDNIKEVPNG KYIGKIPLTK IESIFGNQVK SKLNYPFIQS RGFDRNSSVK EEQWPKSSSV GTTFKKALET GNVQILSNHL VESFETDKIT ELASKIIIVN VENGKRKSLN CDLIILCAST ISTLRILLNS ETKSNSSGFK DTSGKLGKFL MDHISVCRFF SVPNTTQRNN ISNSYPDLSG AGSFFIPFGT NLPKPESINF LRGYGIWGAI DRLGIPKFLQ KDLNSSTGFL IAHGEVLPRE ENSVSLSDKT DRWGIPVPHI EFEWSENELN MAKHMERTMR DSIKAADGKI KGIDELIKIP YAGLFTKKSI ALSGNPPPPG YYIHEVGGAP MGFREEDSVV NKSNRLWRCK NVLVLDGACW PTSSWQSPTL TMMAISRRAC LKVKKT
|
| |