Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_07591 |
Symbol | |
ID | 5730639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 663866 |
End bp | 665515 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641285122 |
Product | glucose-methanol-choline (GMC) oxidoreductase:NAD binding site |
Protein accession | YP_001550644 |
Protein GI | 159903300 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00259994 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACAAATA CTTACGAAGC AATTGTAATT GGATCAGGTG CCACTGGGGG GGTAGCTTCT CTTACCCTTG CGAAAGCAGG AGTAAGAGTT CTTTTAGTAG AGGCTGGGCC TCTTTTAACT GCGAAACAAG CACTAGGAAC AGAGCCTTTA AATACATTCA AAAGATTATT ATCAATATTT AATGGAAATC ATAGAAAACA AATTCAACAT CCTGGTTATT GGAAGGCAAA TCCTTCTTTA TATATAAATG AGAAAGAGAA CCCATATATA TATCCAAAAG AAAAGCCATT TATTTGGACA CAAGGAAGAC AAGTTGGAGG TAGAAGTCTT ACTTGGGGGG GGATAACATT AAGACTTTCA AATAGTGATC TAAAAGCAGC AACTAAAGAT GGATATGGTC CCGAATGGCC TATAAGCTAT TCAGATCTTG AACCTCATTA CAATATTTTA GAAAGATTTT TCAAAGTGCA TGGTTGTAAT GATGGACTAA CACAATTACC TGACGGATAT TTTATTAAAA ACCTACCTTT TACTAATTCT GAGTCTCTAT TTGCTAGCGA AGTAAAAAGA AAGCTCGGTT ATCCAATAAT CCATTCAAGA GGTTTTGGTC CCCATGCTCA TGTCAATGAT GAAGAATGGC CCAGATCAAG TAGCCCTGGG AGTACGCTTA AAGTTGCTTT GGCCACAGGA AAAGTTAATT TGCTTCCTGA GCATATAGCA GAAACATTAA TAATAAATAA ATCGAATTCA ATTGCCGAAG GAATAATAGT TATTGACAAA GAAACAGGGG CTCGTAAAGA ACTTAAGAGT AAATTAATCG TACTATGTGG CTCTACAATA CAAAGCCTTA GACTTCTACT AAATTCACAA GAAAAGTATA ATAATAAAAG ATTAATAGAT CTTTCAGAAA GTCTAGGATG TAATTTAATG GATCATATTT CCACCTGTAG ATTTTTTGCA ATGCCAACAA AAACTGAGTC TAAAGACTCT AAATTTTTAA CAGCCCAAAA GTTATCAGGT GCAGGTAGCT TCTTTATTCC ATTTGGCAAT AAGTTAGATT CAACAGATGA TGTTGATTTT CGAAGAGGCT ATGGTATTTG GGGAGCAATA GATCGATTTG AGCCTCCTGG AATACTAAAA AGAAAACCAA ATTCAAAAAT AGGTTTTTTA ATAGGACATG GTGAAGTACT TCCCTATAAA GAGAACAAGG TCACTCTTTC TACTCAACTA GATAAGTGGG GTGTTCCAAT ACCAAGTATT GAATGCGAAT GGAAGACTAA CGAGATAAAG ATGGTTGCTC ATATGAATAA AACAATTCAA AAATGTATTT CCGCAGCTGG AGGTGAAATA CTCCCGCTAA AAGAACTAAT AAAAATGCCT TTTGTAGAAC CTATTATTAA CAGTGCAATG GCAATACAAG ACAAAGCTCC TCCGCCTGGG TATTACATAC ATGAAGTAGG TGGAGCTCCT ATGGGGTATT GTCCAGAATC TAGCGTTTTA GACCCATTAA ATAGATTATG GGCCTGTCCA AATGTATTAG TAGTAGATGG ATCTTGTTGG CCAACCTCTT CTTGGCAAAG TCCAACTTTG ACAATGATGG CTATATCAAG AAGAGCTTGC TTAGGAGCTA TTAAGAATCA GAGAGCTTAA
|
Protein sequence | MTNTYEAIVI GSGATGGVAS LTLAKAGVRV LLVEAGPLLT AKQALGTEPL NTFKRLLSIF NGNHRKQIQH PGYWKANPSL YINEKENPYI YPKEKPFIWT QGRQVGGRSL TWGGITLRLS NSDLKAATKD GYGPEWPISY SDLEPHYNIL ERFFKVHGCN DGLTQLPDGY FIKNLPFTNS ESLFASEVKR KLGYPIIHSR GFGPHAHVND EEWPRSSSPG STLKVALATG KVNLLPEHIA ETLIINKSNS IAEGIIVIDK ETGARKELKS KLIVLCGSTI QSLRLLLNSQ EKYNNKRLID LSESLGCNLM DHISTCRFFA MPTKTESKDS KFLTAQKLSG AGSFFIPFGN KLDSTDDVDF RRGYGIWGAI DRFEPPGILK RKPNSKIGFL IGHGEVLPYK ENKVTLSTQL DKWGVPIPSI ECEWKTNEIK MVAHMNKTIQ KCISAAGGEI LPLKELIKMP FVEPIINSAM AIQDKAPPPG YYIHEVGGAP MGYCPESSVL DPLNRLWACP NVLVVDGSCW PTSSWQSPTL TMMAISRRAC LGAIKNQRA
|
| |