Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_06751 |
Symbol | |
ID | 4912804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 602142 |
End bp | 603782 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640160256 |
Product | glucose-methanol-choline (GMC) oxidoreductase:NAD binding site |
Protein accession | YP_001090899 |
Protein GI | 126696013 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGATATAA GTCCTTATGA TGCAATTGTT GTTGGTTCTG GAGCTACAGG AGGAATAGCA GCACTTACAT TGGCAGAACA AGGGATCAAA GTTTTAGTAA TAGAAGCAGG GCCTCAAGTT AAAAGGGATG AAGCTAGTAA TCATGAGCCA AAAAGTACAT TAAAAAGATT ATCAGGACTA ATAACAAAAA AAAATGCCAA TCAGTGCCAA CATCCTGGTT ATTGGAAAAA TAATCCTGAC TTATATTCAA ATGAATTGAA GCATCCTTAT GACTTCCCAA AAAAAAAGCC ATTTCTTTGG ACACAAGGTA AACAATTTGG GGGGAGATCA TTAACCTGGG GAGGCATAAC TTTAAGACTT TCCTCAGAAG ACTTTCATCC AGCTAAAAAA GACGGATTCG GGCCAAACTG GCCTATCTCG TACGAAGAAA TCTCCCCGCA CTATGATTTT ATTGAAAATT TCTGCGGAAT TTATGGCCGA AAAGATGATA TCAAGGAGGT CCCAAACGGT AAATATATTG GTGAGATTCC TCTTACAGAA AACGAAAATG TTTTTGGCAG CAAAGTTAAA TCAAAATTAA ACTATCCATT TATCCAATCA AGAGGATTTG ACCGTAATTC ATCTGTAAAA GAAAAAAATT GGCCCAAGTC CTCTAGCTTA GGAAGCTCTT TTAAAAAAGC TTTAGATACT GGAAATGTAC AAATAATCTC TAATTACCTA GTGGAGTCTT TTGAGATTAA CAAGATAACA GAGCTTGCCT CAAAACTAAC GATTGTAAAC CTAGAAAATG GATACAAAGA AGTATTGGAT TGTGATTTGA TTCTTCTTTG CGCATCAACA ATTTCAACAT TGAGAATACT ATTAAACTCA GAATACAAAT CAAATTCCTC AGGGTTTAAA GATAATTCTG GGAAATTAGG TAAATATCTA ATGGATCACA TATCTATATG TAGATTTTTT TCAGTCCCAA AAGCAAAAAA CTCAGATAAA TCACTAGATA ATCCTCCCGA TCTTTCTGGA GCAGGCAGCT TCTTTATTCC TTTTGGTTCA AATTTGCCAG AAATTGATGA CATAAATTTC CATAGAGGTT ATGGGATCTG GGGGGCAATT GATAGATTAG GGATACCTAA ATTTTTACAA AAAGACGTAA ACAAATCCAT TGGCTTTCTT ATCGCCCATG GTGAAGTCCT TCCAAGAGAG AAAAACTCAG TTTCTCTCTC AAGAAAAACA GATGAATGGG GAATACCAAT TCCCTACATT GAATTCGAAT GGAGCGAGAA TGAGTTAAAT ATGGCAAAAC ATATGGAAAA AACAATACAA AGATCAGTCA AAGCTGCAAA TGGGAAAATA AAAAATATTG ATGAACTAAT GAATATTCCA CTAGGGAGTT TATTTACAAA AAATTTGATA GCACTTTCAG ATAGTCCTCC TCCTCCTGGA TATTATATTC ATGAGGTAGG GGGAGCACCG ATGGGGTTAA ATGAAGAAAA TAGCGTAGTT GATAAATTTA ATAGATTGTG GAGATGTAAG AATGTACTGG TACTAGATGG AGCATGCTGG CCCACATCAT CTTGGCAAAG CCCCACACTT ACAATGATGG CCTTGAGTAG AAGAGCCTGT TTAAATATTA AAAAGACTTA G
|
Protein sequence | MDISPYDAIV VGSGATGGIA ALTLAEQGIK VLVIEAGPQV KRDEASNHEP KSTLKRLSGL ITKKNANQCQ HPGYWKNNPD LYSNELKHPY DFPKKKPFLW TQGKQFGGRS LTWGGITLRL SSEDFHPAKK DGFGPNWPIS YEEISPHYDF IENFCGIYGR KDDIKEVPNG KYIGEIPLTE NENVFGSKVK SKLNYPFIQS RGFDRNSSVK EKNWPKSSSL GSSFKKALDT GNVQIISNYL VESFEINKIT ELASKLTIVN LENGYKEVLD CDLILLCAST ISTLRILLNS EYKSNSSGFK DNSGKLGKYL MDHISICRFF SVPKAKNSDK SLDNPPDLSG AGSFFIPFGS NLPEIDDINF HRGYGIWGAI DRLGIPKFLQ KDVNKSIGFL IAHGEVLPRE KNSVSLSRKT DEWGIPIPYI EFEWSENELN MAKHMEKTIQ RSVKAANGKI KNIDELMNIP LGSLFTKNLI ALSDSPPPPG YYIHEVGGAP MGLNEENSVV DKFNRLWRCK NVLVLDGACW PTSSWQSPTL TMMALSRRAC LNIKKT
|
| |