Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_01691 |
Symbol | |
ID | 4912571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 157638 |
End bp | 159233 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640159735 |
Product | NAD(P)H-quinone oxidoreductase subunit 4 |
Protein accession | YP_001090393 |
Protein GI | 126695507 |
COG category | [C] Energy production and conversion |
COG ID | [COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) |
TIGRFAM ID | [TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.588128 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGGGAG CTGGATTGTC TAACTTTCCT TGGTTGTCTG CCTCAATTTT ATTCCCAATT GGTAGTGCAT TTGTAATACC TTTTTTCCCA GATAAAGGGG ATGGCAAAGA AGTGAGATGG TTCGCATTAT CTATTGCATT AATAACTTTT TTAATAACTG TAGGTTCATA TATTAATGGC TTTGATATCA ATAATGAAAA TGTTCAACTG AAAGAAAATA TTAGTTGGCT CCCGGATTTA GGGCTTACTT GGTCTGTTGG TGCTGATGGC ATGTCTATGC CTTTAATACT ATTAACTAGT TTTATAACTG CTTTAGCAGT TCTTGCTGCA TGGCCAGTCA AGTTCAAACC AAAGTTATTT TTCTTTTTGA TATTGGTTAT GGATGGTGGG CAAATCGCTG TATTTGCTGT ACAAGATATG CTTTTATTCT TTCTAACTTG GGAACTGGAG TTGATTCCAG TATATTTATT ACTGGCCATT TGGGGTGGCA AAAATCGACA ATATGCCGCA ACTAAATTCA TTATCTATAC GGCTGGCAGT TCTATTTTCA TTCTTTTAGC AGCATTAGCA ATGGGTTTCT ATGGTACAGA AATTCCTAAC TTTGAGTTTT CTCACTTGGC AGCTCAAGAT TTTAATCAGA AATTCCAAAT ATTCTGTTAT GTAGGGCTAC TAATTGCATT TGGAGTAAAA CTCCCAATAG TACCTCTTCA TACATGGCTT CCAGATGCAC ATGGAGAGGC TACAGCTCCT GTTCATATGC TTCTTGCAGG AATTTTATTA AAGATGGGAG GATATGCTCT TTTAAGATTT AATGCACAAT TATTACCCGT TGCTCATGCT CAATTTGCCC CTTTGTTAAT AGTTTTAGGG GTAGTCAATA TAATTTATGC TGCATTAACT TCTTTTGCTC AAAGAAATCT TAAAAGAAAA ATTGCATACA GTTCGATAAG TCATATGGGT TTCGTTCTCA TTGGAATAGG TAGTTTTAGT AGCCTTGGAA CAAGCGGAGC TATGCTGCAA ATGGTTAGTC ATGGATTAAT TGGCGCAAGT TTATTTTTTC TTGTTGGGGC TACCTATGAC AGAACAAAGA CTCTTAAACT TGATGAAATG AGTGGCGTTG GACAAAAAAT GAGAATAATG TTTGCCCTTT GGACTGCTTG CTCCCTGGCT TCCCTCGCTT TACCTGGTAT GAGTGGATTT GTTTCCGAAT TGATGGTATT TACAGGATTT GTTACTGATG AAGTGTATAC ACTTCCTTTT AGGGTAGTGA TGGCCTCTTT AGCTGCTATC GGTGTAATAC TTACCCCTAT TTATCTACTT TCTATGTTAA GAGAGATTTT CTTTGGTAAA GAAAATCCTA AATTAATCGA AGAACGAAAA CTTATAGATG CAGAGCCAAG GGAAGTTTAT ATTATAGCTT GTTTACTTTT GCCTATTATC GGAATAGGTT TGTACCCAAG ATTAGTTACT GAAAGTTATA TTGCTTCTAT AAATAATTTG GTCGATCGAG ATTTAACTGC AGTTAAAAGT GCAGTTAAAA CAAATATTTT TTCAGGAACT AAGACAAATG ATATTCTAAA AGCTCCAACA ATATAA
|
Protein sequence | MLGAGLSNFP WLSASILFPI GSAFVIPFFP DKGDGKEVRW FALSIALITF LITVGSYING FDINNENVQL KENISWLPDL GLTWSVGADG MSMPLILLTS FITALAVLAA WPVKFKPKLF FFLILVMDGG QIAVFAVQDM LLFFLTWELE LIPVYLLLAI WGGKNRQYAA TKFIIYTAGS SIFILLAALA MGFYGTEIPN FEFSHLAAQD FNQKFQIFCY VGLLIAFGVK LPIVPLHTWL PDAHGEATAP VHMLLAGILL KMGGYALLRF NAQLLPVAHA QFAPLLIVLG VVNIIYAALT SFAQRNLKRK IAYSSISHMG FVLIGIGSFS SLGTSGAMLQ MVSHGLIGAS LFFLVGATYD RTKTLKLDEM SGVGQKMRIM FALWTACSLA SLALPGMSGF VSELMVFTGF VTDEVYTLPF RVVMASLAAI GVILTPIYLL SMLREIFFGK ENPKLIEERK LIDAEPREVY IIACLLLPII GIGLYPRLVT ESYIASINNL VDRDLTAVKS AVKTNIFSGT KTNDILKAPT I
|
| |