Gene NATL1_03811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03811 
Symbol 
ID4779850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp350624 
End bp352804 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content31% 
IMG OID640083649 
Productselenide,water dikinase 
Protein accessionYP_001014210 
Protein GI124025094 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0709] Selenophosphate synthase
[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID[TIGR00476] selenium donor protein
[TIGR03169] pyridine nucleotide-disulfide oxidoreductase family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.599686 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.186769 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAGTG ATCATCTCGT TCTAGCTGGT GGTGGTCATA CTCATGCCCT TGTGTTGCTC 
CAGTGGGCAA TGAATCCAAA ATTGAAGCCT GCTGGAATGA TTACTTTAGT TAATAAAGCA
AGTACAACTG TTTATTCTGG AATGTTTCCA GGTGTCATAG CAGGTAAATA CAAGATAGAT
GAAATACTAA TTGATTTGAG GAAACTTGCT TTAAAAGCAG GAGTTTCATT TGTGATGGCA
GAAATTGAGG GAATCAATCT TAAGGAAAAA AAGTTACTTT TAGCAGGACG GCCAGAAATT
GAATATTCTC TATTATCCCT AAATATAGGA ACAAAAACTA ATATAAATTC TAAACTTTTT
ATTAGAGGTG ATAAAGATTT AGCTGTTCCA ATTAAACCTT TTTCTGAATC CTATAAATTT
ATTGTCGATC AAGATATCCA CAAGAATGAT TCTTCTGCAA AACCATTTGT AATTATTGGT
GGAGGATTCG CTGGAATAGA AATAGCTTTT TCTTTAAGAA AAAGATGGCC AAAAAGGCCT
ATTTTATTAA AAGTCAAATC AGGAAGAAAT ATAAATAAAA ATCTTTTAAG AAATTTAAAG
GCTTTAGATA TTGAAATTAC ACAAAAGCAA CCATCTATTT TATATCCAAA ATTAATATGT
ACCGGTAATA AATCATTTAA TTGGTTAAAG GATAGTGGTT TACCTATAGA TGAGAATGGG
AGAGTTCTAA CTGAAAAAAC TCTTCAAGTC CTTAACTATC CAGAGTTATT TGCTGTAGGA
GATTGTGGTG TCATTAAGGA TTATCCTCGA CCTTCTTCTG GAGTATGGGC GGTTCGTGCA
GCAAAACCAC TTGCAAATAA TTTAGAGTTT ATAACTAAAG GTTTAAAACT AGAGGAATGG
AAACCTCAAA GAAAAGCAAT ACAACTTTTG GATATCAATA TTAGAAAAAA GAAATCTAAA
GCTTTTATTT CCTGGGGTGA AGTTATTATT GGTCCTTTTG ATTTTTTATC AAGTTTCAAA
GAATTAATTG ATAAACAATT TATCTCTAAA TTTGATCTAG TTAAAGATAT AAATTCAGAT
ATGTCTTCTG AAGAAGAGAT GATTAAATGT AGAGGATGTG CGGCAAAATT AGCTTTTACT
CCATTAAGTT CAGCATTAAA AAAAGTAGAT TTAATAGAAT CTTCAAAAGA TGATTCTATT
AATATAGGGA TATTAAATTC TGATAAAACT TTGATACAAA GTGTAGATGG ATTCCCTTCT
TTAATTAGTG ATCCTTGGTT AAATGGAAGA CTTTTGGCGT TTCATTCCTG TTCTGATATT
TGGGCATGCG GAGGATCTGT GATCTCTGCA CAGTCTCTTA TCAACTTACC ATCCATATCT
AATAATTTAC AGAAAGAATT GTTATATCAA GTTTTGGAAG GTATTAATTC TGCTTTAACT
ATCCAAGGTG CAAATCTTAT AGGTGGGCAT ACATTAGAAT CAAGAAAAAT ATCTGAAGAG
CCTTTTTCCC TAGGAATAGA AAGCTCATTA ACTGTAAATG GGGTTATTGA TGATAAAAAA
TATTTTTGGC CTAAGGGAGG AATGAGAAAT GGAGATGAAA TTTTAATTAG TCGTTCTTTG
GGAACTGGAA TTATTTTTTC TGCATTTATG AATGGCCAAG TAAAACCTTA TATACTTGAT
AATGTCTTAA AAGAAATGAA TAAAAGTCAG CATGAGATTG TCAATTATAT TAATCAATTA
ACAAATCTAA ATCCACGCTC AAAAATAGTT AATGCATGTA CTGATATAAC TGGATTTGGT
TTGCTAGGTC ATTTGTCAGA AATGTTGGAA TCTACTAATA GTGATCAATT AAAGATGAAT
TTAGAACCAT TTAAAGTCAC TCTTGAACTG GATAATATAC CATTATATGA TGGTGCAAAA
GAACTTTTAG ATAAAGGCTT TGAAAGCACT TTAGCCCCTG CAAATCAAAT TTTTCTAAAA
AATATTGATG GAGATAAAAA CTTAAGGTTT GAGCTCAAAT CTAATGATTC TTCCTCTAAC
AGATCCTTTT ATAATGCCAT GCTAAAAATC TTAGTAGACC CACAAACTTG TGGCCCTTTG
GTTGTTTGTT GTTCATCGAT TTATTCAGAA AAACTTATAC AGCAAGGACC TTGGATTAAA
ATAGGTTTTA TTTCTAAATA A
 
Protein sequence
MFSDHLVLAG GGHTHALVLL QWAMNPKLKP AGMITLVNKA STTVYSGMFP GVIAGKYKID 
EILIDLRKLA LKAGVSFVMA EIEGINLKEK KLLLAGRPEI EYSLLSLNIG TKTNINSKLF
IRGDKDLAVP IKPFSESYKF IVDQDIHKND SSAKPFVIIG GGFAGIEIAF SLRKRWPKRP
ILLKVKSGRN INKNLLRNLK ALDIEITQKQ PSILYPKLIC TGNKSFNWLK DSGLPIDENG
RVLTEKTLQV LNYPELFAVG DCGVIKDYPR PSSGVWAVRA AKPLANNLEF ITKGLKLEEW
KPQRKAIQLL DINIRKKKSK AFISWGEVII GPFDFLSSFK ELIDKQFISK FDLVKDINSD
MSSEEEMIKC RGCAAKLAFT PLSSALKKVD LIESSKDDSI NIGILNSDKT LIQSVDGFPS
LISDPWLNGR LLAFHSCSDI WACGGSVISA QSLINLPSIS NNLQKELLYQ VLEGINSALT
IQGANLIGGH TLESRKISEE PFSLGIESSL TVNGVIDDKK YFWPKGGMRN GDEILISRSL
GTGIIFSAFM NGQVKPYILD NVLKEMNKSQ HEIVNYINQL TNLNPRSKIV NACTDITGFG
LLGHLSEMLE STNSDQLKMN LEPFKVTLEL DNIPLYDGAK ELLDKGFEST LAPANQIFLK
NIDGDKNLRF ELKSNDSSSN RSFYNAMLKI LVDPQTCGPL VVCCSSIYSE KLIQQGPWIK
IGFISK