Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_03251 |
Symbol | |
ID | 4717012 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 297504 |
End bp | 299675 |
Gene Length | 2172 bp |
Protein Length | 723 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640078027 |
Product | selenide,water dikinase |
Protein accession | YP_001008720 |
Protein GI | 123967862 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism |
COG ID | [COG0709] Selenophosphate synthase [COG1252] NADH dehydrogenase, FAD-containing subunit |
TIGRFAM ID | [TIGR00476] selenium donor protein [TIGR03169] pyridine nucleotide-disulfide oxidoreductase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.844451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTTTA ATCATCTGGT ACTAATTGGA GGAGGGCACT CAAATGTTTC TTTATTGAAG AAATGGTTAA TGTTTCCGAA ATTAATGCCA GAAATTCCTG TTTCAATTAT ATCTAGAGAT TCTCATTTAG TTTACTCGGC TTTATTCCCA TCGGTAATTT CAAAATCAAT CACATTAGAA GAGAGTTTAA TTGATATAAA ATCTTTAGCA AAAAATGCAA AAGTATCTTT TATAGAAGAA GAAGTAAAGG ATATTGATTT CAATTTAAAG AAAATTGTTT TAAGTAATAG ACCTTCAGTT AATTATTCGA AGTTGGTGCT TAATTATGGA AGTCAAACAA TAATTCCAAA AGAATTTGAA TCACAAGTTA AAAATCGAAA TGCTTTTTCA ATTAAACCTT TTTTAAGTGC TTATGACTCA ATACTTAAAG AGGACATTTT TGATTCAGTT AATGAACTTC CATTTGTAAT TGTTGGGAGT GGACTCGCTG CAATTGAAGT ATCATATGCT TTGAGAAAAA GATGGGGAGA TAGACCTTTA AAACTATTAT GTGATTCAAG AAAAATTAAT AATAAAATCC TAAAAAGTTT ACGGAATTCC AATATTGAAT TAGTTGAAAA ACTTCATTTT GATTATGGTA AGCTTCTTTT ATGCACTGGA AATACGTCTC CGTTATGGGT ACAAAAAAAA TTATTAGATT CGGATTCTCA TAGCAGAATA ATTACAAATC AGAATTTGCA GATAAAAAGT TTCTCTGGAA TCTTTGCTGT CGGTGATTGT GCAGTTGTAG GTTCAGCAAA AAGACCAGCA TCGGGAGTTT TTGCAGTAAA AGTTGTAAAT ACATTAGTAC AAAATCTAAA AAAAGATGTA GAAGGTAGAT CATTAAAAAA GTGGTTTCCT CAAAAGATCG GATTGCAAAT ACTAAATATA TTTCCAAGCC ATCATCCAAA GGCTTTTGCA ATTTATCACA ATTTTGTTTT CGGCCCTTCT TTTATTTTTT GGATTTTAAA GCAAAAAATT GATATCAACT TTATTAAAAA GTTCAGATCA AAAAGGCTAA TTATGAAAAG TAGTGAAAAA AGTATCTCAA TTAATGATTG CAGAGGATGT GCAGCTAAAA TTCCTCAATT TGTTTTGAAC AAATCATTAA TAAATTCTAA TTTAAATTCT TTTGCCTCAT CACCTGAAGA TTCAGTTGAG ATATATAAAA ATGGTCAAGA TATTATATTG CAAAGTGTAG ATGGATTTCC TGCTTTGGTA AGTGATCCTT GGCTTAATGC AAAAATTACT ACTTTGCATG CTTGCTCAGA TTTGTGGGCA TGTGGAGCAA AACTTTCATC AGCGCAGGCT TTAATTTCAT TACCAAAAGT TGAAAGGGAA TTTCAGAGTT ACCTCTTTAC TCAATCACTT CAAGGTATTA AATCAACAGT TGAAGATCAT GGAGGCGAAC TACTTGGAGG CCATACTTTC GAGGCAAGAA GTTTTGTAAA TAAACCTTAT TCATTAGGAA TAGATATTTC TTTAACAGTT CAAGGTATTT TAAAAAATGG AGCAAAACCA TGGCTTAAAT CTGGAATGAA TATTGGAGAT ATTCTCATGA TGTCTAGACC TCTAGGCGTT GGGATCTACT TTGCCGGTCA AATGCAAAAT ATTAATATGC TAGGTAGTTC TTCTGAAGTA ATTAATAATT TAGTAAAGAG TCAGCAATAT TTGATTGATG AAATTTATCT TTTTCAAAAT GAATTTAAAG AATCATTAGT AAATGCTGCC ACTGACATTA CTGGATATGG ATTTATTGGA CATCTTAAAG AAATGGTTGA ATCATCTAAT TTATATCGGC AAAGTAATAA TCTTGAGCCA ATAAAAGTTT TATTAGATTT ATTTGCATTT AAAGCTTATC CTGGAGTATT TGATCTAATA AGAAAAGATG TTAAAAGTAC TTTCTTTGAA TCTAATAGAG AAATATTTGA CAAAATTTAT AAAGTAAATA ATCAAAAAAG AATAATTAAT TTTTTAAACG AAAAGTCATT AGATAAAAAG ACTTTTAACG AGAGAATATC ACTATTATTA GATCCTCAAA CATGTGGCCC CTTGTTGATT AGTTGCAACC GTAAATATGA AAATGTTCTA AAGGATAAAT GGTACAAGGT TGGAGAGGTT GTAAAAATGT AA
|
Protein sequence | MTFNHLVLIG GGHSNVSLLK KWLMFPKLMP EIPVSIISRD SHLVYSALFP SVISKSITLE ESLIDIKSLA KNAKVSFIEE EVKDIDFNLK KIVLSNRPSV NYSKLVLNYG SQTIIPKEFE SQVKNRNAFS IKPFLSAYDS ILKEDIFDSV NELPFVIVGS GLAAIEVSYA LRKRWGDRPL KLLCDSRKIN NKILKSLRNS NIELVEKLHF DYGKLLLCTG NTSPLWVQKK LLDSDSHSRI ITNQNLQIKS FSGIFAVGDC AVVGSAKRPA SGVFAVKVVN TLVQNLKKDV EGRSLKKWFP QKIGLQILNI FPSHHPKAFA IYHNFVFGPS FIFWILKQKI DINFIKKFRS KRLIMKSSEK SISINDCRGC AAKIPQFVLN KSLINSNLNS FASSPEDSVE IYKNGQDIIL QSVDGFPALV SDPWLNAKIT TLHACSDLWA CGAKLSSAQA LISLPKVERE FQSYLFTQSL QGIKSTVEDH GGELLGGHTF EARSFVNKPY SLGIDISLTV QGILKNGAKP WLKSGMNIGD ILMMSRPLGV GIYFAGQMQN INMLGSSSEV INNLVKSQQY LIDEIYLFQN EFKESLVNAA TDITGYGFIG HLKEMVESSN LYRQSNNLEP IKVLLDLFAF KAYPGVFDLI RKDVKSTFFE SNREIFDKIY KVNNQKRIIN FLNEKSLDKK TFNERISLLL DPQTCGPLLI SCNRKYENVL KDKWYKVGEV VKM
|
| |