Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_03811 |
Symbol | |
ID | 4779850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 350624 |
End bp | 352804 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640083649 |
Product | selenide,water dikinase |
Protein accession | YP_001014210 |
Protein GI | 124025094 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism |
COG ID | [COG0709] Selenophosphate synthase [COG1252] NADH dehydrogenase, FAD-containing subunit |
TIGRFAM ID | [TIGR00476] selenium donor protein [TIGR03169] pyridine nucleotide-disulfide oxidoreductase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.599686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.186769 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAGTG ATCATCTCGT TCTAGCTGGT GGTGGTCATA CTCATGCCCT TGTGTTGCTC CAGTGGGCAA TGAATCCAAA ATTGAAGCCT GCTGGAATGA TTACTTTAGT TAATAAAGCA AGTACAACTG TTTATTCTGG AATGTTTCCA GGTGTCATAG CAGGTAAATA CAAGATAGAT GAAATACTAA TTGATTTGAG GAAACTTGCT TTAAAAGCAG GAGTTTCATT TGTGATGGCA GAAATTGAGG GAATCAATCT TAAGGAAAAA AAGTTACTTT TAGCAGGACG GCCAGAAATT GAATATTCTC TATTATCCCT AAATATAGGA ACAAAAACTA ATATAAATTC TAAACTTTTT ATTAGAGGTG ATAAAGATTT AGCTGTTCCA ATTAAACCTT TTTCTGAATC CTATAAATTT ATTGTCGATC AAGATATCCA CAAGAATGAT TCTTCTGCAA AACCATTTGT AATTATTGGT GGAGGATTCG CTGGAATAGA AATAGCTTTT TCTTTAAGAA AAAGATGGCC AAAAAGGCCT ATTTTATTAA AAGTCAAATC AGGAAGAAAT ATAAATAAAA ATCTTTTAAG AAATTTAAAG GCTTTAGATA TTGAAATTAC ACAAAAGCAA CCATCTATTT TATATCCAAA ATTAATATGT ACCGGTAATA AATCATTTAA TTGGTTAAAG GATAGTGGTT TACCTATAGA TGAGAATGGG AGAGTTCTAA CTGAAAAAAC TCTTCAAGTC CTTAACTATC CAGAGTTATT TGCTGTAGGA GATTGTGGTG TCATTAAGGA TTATCCTCGA CCTTCTTCTG GAGTATGGGC GGTTCGTGCA GCAAAACCAC TTGCAAATAA TTTAGAGTTT ATAACTAAAG GTTTAAAACT AGAGGAATGG AAACCTCAAA GAAAAGCAAT ACAACTTTTG GATATCAATA TTAGAAAAAA GAAATCTAAA GCTTTTATTT CCTGGGGTGA AGTTATTATT GGTCCTTTTG ATTTTTTATC AAGTTTCAAA GAATTAATTG ATAAACAATT TATCTCTAAA TTTGATCTAG TTAAAGATAT AAATTCAGAT ATGTCTTCTG AAGAAGAGAT GATTAAATGT AGAGGATGTG CGGCAAAATT AGCTTTTACT CCATTAAGTT CAGCATTAAA AAAAGTAGAT TTAATAGAAT CTTCAAAAGA TGATTCTATT AATATAGGGA TATTAAATTC TGATAAAACT TTGATACAAA GTGTAGATGG ATTCCCTTCT TTAATTAGTG ATCCTTGGTT AAATGGAAGA CTTTTGGCGT TTCATTCCTG TTCTGATATT TGGGCATGCG GAGGATCTGT GATCTCTGCA CAGTCTCTTA TCAACTTACC ATCCATATCT AATAATTTAC AGAAAGAATT GTTATATCAA GTTTTGGAAG GTATTAATTC TGCTTTAACT ATCCAAGGTG CAAATCTTAT AGGTGGGCAT ACATTAGAAT CAAGAAAAAT ATCTGAAGAG CCTTTTTCCC TAGGAATAGA AAGCTCATTA ACTGTAAATG GGGTTATTGA TGATAAAAAA TATTTTTGGC CTAAGGGAGG AATGAGAAAT GGAGATGAAA TTTTAATTAG TCGTTCTTTG GGAACTGGAA TTATTTTTTC TGCATTTATG AATGGCCAAG TAAAACCTTA TATACTTGAT AATGTCTTAA AAGAAATGAA TAAAAGTCAG CATGAGATTG TCAATTATAT TAATCAATTA ACAAATCTAA ATCCACGCTC AAAAATAGTT AATGCATGTA CTGATATAAC TGGATTTGGT TTGCTAGGTC ATTTGTCAGA AATGTTGGAA TCTACTAATA GTGATCAATT AAAGATGAAT TTAGAACCAT TTAAAGTCAC TCTTGAACTG GATAATATAC CATTATATGA TGGTGCAAAA GAACTTTTAG ATAAAGGCTT TGAAAGCACT TTAGCCCCTG CAAATCAAAT TTTTCTAAAA AATATTGATG GAGATAAAAA CTTAAGGTTT GAGCTCAAAT CTAATGATTC TTCCTCTAAC AGATCCTTTT ATAATGCCAT GCTAAAAATC TTAGTAGACC CACAAACTTG TGGCCCTTTG GTTGTTTGTT GTTCATCGAT TTATTCAGAA AAACTTATAC AGCAAGGACC TTGGATTAAA ATAGGTTTTA TTTCTAAATA A
|
Protein sequence | MFSDHLVLAG GGHTHALVLL QWAMNPKLKP AGMITLVNKA STTVYSGMFP GVIAGKYKID EILIDLRKLA LKAGVSFVMA EIEGINLKEK KLLLAGRPEI EYSLLSLNIG TKTNINSKLF IRGDKDLAVP IKPFSESYKF IVDQDIHKND SSAKPFVIIG GGFAGIEIAF SLRKRWPKRP ILLKVKSGRN INKNLLRNLK ALDIEITQKQ PSILYPKLIC TGNKSFNWLK DSGLPIDENG RVLTEKTLQV LNYPELFAVG DCGVIKDYPR PSSGVWAVRA AKPLANNLEF ITKGLKLEEW KPQRKAIQLL DINIRKKKSK AFISWGEVII GPFDFLSSFK ELIDKQFISK FDLVKDINSD MSSEEEMIKC RGCAAKLAFT PLSSALKKVD LIESSKDDSI NIGILNSDKT LIQSVDGFPS LISDPWLNGR LLAFHSCSDI WACGGSVISA QSLINLPSIS NNLQKELLYQ VLEGINSALT IQGANLIGGH TLESRKISEE PFSLGIESSL TVNGVIDDKK YFWPKGGMRN GDEILISRSL GTGIIFSAFM NGQVKPYILD NVLKEMNKSQ HEIVNYINQL TNLNPRSKIV NACTDITGFG LLGHLSEMLE STNSDQLKMN LEPFKVTLEL DNIPLYDGAK ELLDKGFEST LAPANQIFLK NIDGDKNLRF ELKSNDSSSN RSFYNAMLKI LVDPQTCGPL VVCCSSIYSE KLIQQGPWIK IGFISK
|
| |