Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMN2A_1667 |
Symbol | |
ID | 3607068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL2A |
Kingdom | Bacteria |
Replicon accession | NC_007335 |
Strand | + |
Start bp | 332815 |
End bp | 334995 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 637688548 |
Product | selenide,water dikinase |
Protein accession | YP_292858 |
Protein GI | 72383503 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism |
COG ID | [COG0709] Selenophosphate synthase [COG1252] NADH dehydrogenase, FAD-containing subunit |
TIGRFAM ID | [TIGR00476] selenium donor protein [TIGR03169] pyridine nucleotide-disulfide oxidoreductase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.854134 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAGTG ATCATCTAGT TTTAGCTGGT GGTGGTCATA CTCATGCCCT TGTGTTGCTC CAGTGGGCAA TGAATCCAAA ATTGAAGCCT GCTGGAATGA TTACTTTAGT TAATAAAGCA AGTACAACTG TTTATTCTGG AATGTTTCCA GGTGTCATAG CAGGTAAATA CAAGATAGAT GAAATACTAA TTGATTTGAG GAAACTTGCT TTAAAAGCAG GAGTTTCATT TGTGATGGCA GAAATTGAGG GGATCAATCT TAAGGAAAAA AAGTTACTTT TAGAAGGACG GCCAGAAATT GAATATTCTC TATTATCCCT AAATATAGGA ACAAAAACTA ATATAAATTC TAAACTTTTT AATAGAGGTG ATAAAGATTT AGCTGTTCCA ATTAAACCTT TTTCTGAATC CTATAAATTT ATTGTCGATC AAGATATCCA CAAGAATGAT TCTTCTGCAA AACCATTTGT AATTATTGGT GGTGGATTTG CTGGAATAGA AATAGCTTTT TCTTTAAGAA AAAGATGGCC AAAAAGGCCT ATTTTATTAA AAGTCAAATC AGGAAGAAAT ATAAATAAAA ACCTGCTAAG AAATTTAAAG GCCTTAGATA TTGAAATCAC ACAAAAGCAA CCATCTATTT TATATCCAAA ATTAATATGT ACCGGTAATA AATCATTTAA TTGGTTAAAG GATAGTGGTT TACCTATAGA TGAGAATGGG AGAGTTCTAA CTAAAAAAAC TCTTCAAGTC CTTAACTATC CAGAGTTATT TGCTGTAGGA GATTGTGGTG TCATTAAGGA TTATCCTCGA CCTTCTTCTG GAGTATGGGC AGTTCGTGCA GCAAAATCAC TTGCAAATAA TTTAGAGTTT ATAACTAAGG GTTTAAAACT AGAGGAATGG AAACCTCAAA GAAAAGCAAT ACAACTGTTG GATATCAACA TTAGAAAAAA GAAATCTAAA GCTTTTATTT CCTGGGGTGA AGTTATTATT GGTCCTTTTG ATTTTTTATC AAGTTTAAAA GAATTAATTG ATAAACAATT TATCTCTAAA TTTGATCTAG TTAAAGATAT AAATTCAGAT ATGTCTTCTG AAGAAGACAT GATTAAATGT AGAGGATGTG CGGCTAAATT AGCTTTTACT CCATTAAGTT CAGCATTAAA AAAAGTAGAT TTAATAGAAT CTTCAAAAGA TGATTCTATA AATATAGGGA TATTAAATTC TGATAAAACT TTGATACAAA GTGTAGATGG ATTCCCTTCT TTAATTAGTG ATCCTTGGTT AAATGGAAGA CTTTTGGCGT TTCATTCCTG TTCTGATATT TGGGCATGCG GAGGATCTGT GATCTCTGCA CAGTCTCTTA TTAATTTACC ATCCATATCT AATAATTTAC AGCAAGAATT GTTATATCAA GTTTTGGAAG GTATTAATTC TGCTTTAACT ATCCAAGGTG CAAATCTTAT AGGTGGACAT ACATTAGAAT CAAGAAAAAT ATCTGAAGAG CCTTTTTCCC TAGGAATAGA AAGCTCATTA ACTGTAAATG GGGTTATTGA TGATAAAAAA TATTTTTGGC CTAAGGGAGG AATGAGAAAT GGAGATGAAA TTTTAATTAG TCGTTCTTTG GGAACTGGAA TTATTTTTTC TGCATTTATG AATGGTCAAG TAAAACCTTA TATACTTGAT AATGTCTTAA AAGAAATGAA TAAAAGTCAG CATGAGATTG TTAATTATAT TAATCAATTA ACAAATCTAA ATCCACGCTC AAAAATTGTT AATGCATGTA CTGATATAAC TGGATTTGGT TTGCTAGGTC ATTTGTCAGA AATGTTGGAA TCTACAAATA GTGATCAATT AAAGATGAAT TCAGAACCAT TTAAAGTCAA TCTTGAACTG GATAAAATAC CATTATATGA TGGTGTGAAA GAACTTTTAG ATAAAGGCTT TGAAAGCACT TTGGCCCCTG CAAATCAAAT TTTCCTAAAA AATATTGATG GAGATAAAAA CTTAAGGTTT GAGCTCACAT CTAATGATTC TTGCTCTAAT AGATCCTTTT ATAATGCCAT GCTGAAAATC TTAGTAGACC CACAAACTTG TGGCCCTTTG GTTGTTTGTT GTTCATCGAT TTATTCAGAA AAACTTATAC AGCAAGGACC TTGGATTAAA ATAGGTTTTA TTTCCAAATA A
|
Protein sequence | MFSDHLVLAG GGHTHALVLL QWAMNPKLKP AGMITLVNKA STTVYSGMFP GVIAGKYKID EILIDLRKLA LKAGVSFVMA EIEGINLKEK KLLLEGRPEI EYSLLSLNIG TKTNINSKLF NRGDKDLAVP IKPFSESYKF IVDQDIHKND SSAKPFVIIG GGFAGIEIAF SLRKRWPKRP ILLKVKSGRN INKNLLRNLK ALDIEITQKQ PSILYPKLIC TGNKSFNWLK DSGLPIDENG RVLTKKTLQV LNYPELFAVG DCGVIKDYPR PSSGVWAVRA AKSLANNLEF ITKGLKLEEW KPQRKAIQLL DINIRKKKSK AFISWGEVII GPFDFLSSLK ELIDKQFISK FDLVKDINSD MSSEEDMIKC RGCAAKLAFT PLSSALKKVD LIESSKDDSI NIGILNSDKT LIQSVDGFPS LISDPWLNGR LLAFHSCSDI WACGGSVISA QSLINLPSIS NNLQQELLYQ VLEGINSALT IQGANLIGGH TLESRKISEE PFSLGIESSL TVNGVIDDKK YFWPKGGMRN GDEILISRSL GTGIIFSAFM NGQVKPYILD NVLKEMNKSQ HEIVNYINQL TNLNPRSKIV NACTDITGFG LLGHLSEMLE STNSDQLKMN SEPFKVNLEL DKIPLYDGVK ELLDKGFEST LAPANQIFLK NIDGDKNLRF ELTSNDSCSN RSFYNAMLKI LVDPQTCGPL VVCCSSIYSE KLIQQGPWIK IGFISK
|
| |