Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_25471 |
Symbol | |
ID | 4778754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2239144 |
End bp | 2241336 |
Gene Length | 2193 bp |
Protein Length | 730 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640088068 |
Product | selenide,water dikinase |
Protein accession | YP_001018543 |
Protein GI | 124024236 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism |
COG ID | [COG0709] Selenophosphate synthase [COG1252] NADH dehydrogenase, FAD-containing subunit |
TIGRFAM ID | [TIGR00476] selenium donor protein [TIGR03169] pyridine nucleotide-disulfide oxidoreductase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCTG ATCACCTCGT TCTGGCCGGG GGGGGACATA GCCATGCCCT AATGCTGCGT CGCTGGGCCA TGCGTCCCCA ACTCCGACCT GCAGGACTGA TCACCCTGAT CAACCGCCAC AGCACCACCC TCTACTCCGG CATGGTGCCT GGTCTCATTG CCGGTCATTA CAGACACAGT GAAATCGCGA TCGACCTCCG TCGCCTCACC GACCGAGCTG GCGTAGCACT CATCATTGCC GAAATCACAG CAGTGGAGGC CCACCACAAT CGCCTGCTCC TCGCCCAGCG TCCTCCCATA CATTTCCAAC GAATTAGTTT CGATGTGGGC GCCGAAACCT TCAACAAGGG CCCTTACCTT GAACGAAGCC AGGCTGCACT GGCAATGCCA ATCAAGCCTT TGGAGCCTGC CCTGGCATGG CTTGAACAGC AAGACAGTCA GGTGCTGCTC AATGATTCCA CGCCACTGAC GGTGATCGGG GCTGGCCTGG CGGGAGTAGA AGTGGCGCTC GCCCTTCGTC ATCGCTGGCC CAAGCGCCCT TTAAATCTGC AAGCGCATCA TGGGCAACCC AGACCAGCTC TAAAACAAGC CCTCTCCAAG GCCGCAATCG TAATAGTGCC AAGCGGAACC CCCTTGAGTG GCCCCGCCCT GCTCTGCACT GGCAGCCAAG CCCCCGCTTG GCTTGCTACC AGTGGCTTTC CCGTGGATCC CCTTGGACGT GTTCGCACCA CTAAAACACT GCAAGTCATC AACCATCCCC ACTGCTTCGC CGTCGGCGAC TGTGCGGTGA TTGATAAGGC CCAGCGGCCA GCCGCAGGGG TATGGGCCGT GCAAGCCGCA AAGCCTCTTG CCCAAAACCT AGAGCGGCTC AGCCGTAGAC AACCCACCCG TCCATGGCAG CCACAACAGC TTGCCTTGCA AATACTGGGA GGTCAGCTGA GCTCAGGGAG GTTCACCGCC TGGGCTTTTT GGGGCAACCT GATCATCGGG CCCCATCCCT GGCTTTGGTA CTGGAAAGAA GCTATCGATC GGCGCTTCAT GGGGAGCTTT AACGAACTCC CAAGCATGAG CGGAGTTCTC AAGCGACAGG AGAGCATGGC TTGCCGAGGT TGCGCAGCCA AATTGGCTGA GAAGCCGTTA AACGATGCCC TAAAGCAGGC AGGCCTAGGA GCTCTTGGGC AACAACCTGA GGACGCTGCC TTGATTGCCA GCACATCCTC AGGCGACAGC TTGCTTCAAA GTGTCGATGG TTTCCCTGCA CTGATCAGCG ACCCCTGGCT TAATGGGCGC TTAACCACAC TGCATGCCTG CTCAGATCTT TGGGCCAGTG GTGCGCATGT GATCTCTGCA CAGGCTGTCA TCACCCTGCC CAAGGTGTCC TCTGAACTTC AACAAGAGTT GTTGGTCCAA ACGCTCAAAG GAATCCAATC AACCCTCGAG CCGCAAGGCG CCAAATTAAT TGGAGGGCAT ACCCTCGAAG CCCGCAGCAT TCCGCCCCAA CCAATCAATC TTGGAATCCA GCTCACCCTT AGCGTTAACG GCAAGGTGGC TTCTGGGCGT GTGCCTTGGA GCAAAGGCAA GCTGCAATCG GGAGATGTCC TGCTGCTCAG CCGCCCCATA GGCAGCGGCG TGATCTTTGC TGCAGCCATG GCAGGAGAAG CTCACCCAGA GGATCTCGAC GCTGCACTTG CGCAGATGAC AATCAGCCAG CACAACCTCC TGACAGCGCT GCGCAGCCTT GAAGAGAGAC ATACAGGAAT GCAAACCATT CATGCATGTA CCGACATCAC CGGCTTCGGA CTACTGGGCC ATCTCGGAGA AATGCTCAGT GCAAGCAATC ACCAACGCCA TAGAGCAGGG CTACAACCTC TACGTCTCCT CCTCGAAGCC GCCGCCATTC CCTCCCTGCA GGGTGCTCTA CTGCTACTCA AAGCTGGCTA TTCGAGCACC CTGGCCCCAG CCAATCGCCG CAACTGGCAC TTGCTCAATC CAAGCATCAA CGGCGAAGCC GCACCGATCG AAATCGCACT GAACGATGTA ACACCAGGCA GTGAACACCA CCAGGCACTG CTCGAACTCA TGGTGGATCC ACAAACCTGC GGCCCATTAC TCCTGGCCTG TTCAACCAAG ATCGCCTCAG AACTTCTCAG GGATGGGCCT TGGCAACGCA TCGGCCAGGT TCAGCCGATG TAG
|
Protein sequence | MTADHLVLAG GGHSHALMLR RWAMRPQLRP AGLITLINRH STTLYSGMVP GLIAGHYRHS EIAIDLRRLT DRAGVALIIA EITAVEAHHN RLLLAQRPPI HFQRISFDVG AETFNKGPYL ERSQAALAMP IKPLEPALAW LEQQDSQVLL NDSTPLTVIG AGLAGVEVAL ALRHRWPKRP LNLQAHHGQP RPALKQALSK AAIVIVPSGT PLSGPALLCT GSQAPAWLAT SGFPVDPLGR VRTTKTLQVI NHPHCFAVGD CAVIDKAQRP AAGVWAVQAA KPLAQNLERL SRRQPTRPWQ PQQLALQILG GQLSSGRFTA WAFWGNLIIG PHPWLWYWKE AIDRRFMGSF NELPSMSGVL KRQESMACRG CAAKLAEKPL NDALKQAGLG ALGQQPEDAA LIASTSSGDS LLQSVDGFPA LISDPWLNGR LTTLHACSDL WASGAHVISA QAVITLPKVS SELQQELLVQ TLKGIQSTLE PQGAKLIGGH TLEARSIPPQ PINLGIQLTL SVNGKVASGR VPWSKGKLQS GDVLLLSRPI GSGVIFAAAM AGEAHPEDLD AALAQMTISQ HNLLTALRSL EERHTGMQTI HACTDITGFG LLGHLGEMLS ASNHQRHRAG LQPLRLLLEA AAIPSLQGAL LLLKAGYSST LAPANRRNWH LLNPSINGEA APIEIALNDV TPGSEHHQAL LELMVDPQTC GPLLLACSTK IASELLRDGP WQRIGQVQPM
|
| |