Gene NATL1_02161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_02161 
Symbolpds 
ID4779454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp200780 
End bp202168 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content39% 
IMG OID640083481 
Productphytoene desaturase 
Protein accessionYP_001014045 
Protein GI124024929 
COG category[S] Function unknown 
COG ID[COG3349] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02731] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTAG CAATCGCTGG AGCCGGATTG GCAGGACTCT CATGTGCAAA ATACTTAGCC 
GATGCTGGTC ATACGCCATT TGTTTATGAA GCAAGAAACG TACTTGGCGG AAAAGTTGCT
GCTTGGAAAG ATGATGATGG TGACTGGTAT GAGACTGGAT TACATATATT TTTTGGAGCT
TATCCAAATA TGCTCCAGCT TTTTAAAGAA CTAGATATTG AAGATCGTCT TCAATGGAAA
AGTCATTCCA TGATTTTCAA CCAACCAGAA GAACCTGGGA CATATAGCCG TTTCGACTTC
CCTGATCTTC CTGCTCCAAT CAATGGAGTG GCAGCGATTT TAAGCAACAA TGACATGCTT
AGCTGGCCAG AAAAAATTTC GTTTGGACTG GGACTAGTAC CAGCTATGTT GCGTGGCCAA
AATTATGTAG AGGATTGTGA TAAGTACTCT TGGACGGAAT GGCTGAAAAA ACAAAATATC
CCCGAAAGAG TCAATGATGA AGTTTTTATA GCAATGAGTA AGGCACTTAA TTTTATAGGT
CCTGATGAAA TTTCCTCAAC AGTATTGCTA ACTGCATTAA ACCGCTTCTT ACAAGAAAAA
AACGGATCAA AAATGGCATT TCTTGATGGA GCTCCACCAG AACGACTTTG TCAACCAATT
GTTGATCACA TCAGAGCTTT AGGAGGCGAC GTATTTTTAA ATAGCCCACT AAAAAAAATA
AATTTACAAC AAGATGGATC TGTTGAAAAT TTCTTAATAG GTAGTGCCAA AGAACCTCAG
GGAAAAGAAA TCCAAGCAGA CGCGTATGTC AGCGCAATGC CCGTTGATAT TTTCAAAACA
ATTTTGCCCA ATGAATGGGC CTCTCAAGAT ATTTTCAGAA AACTTGAGGG ACTGAAAGGA
GTCCCAGTTA TTAATATTCA TCTTTGGTTC GATCGAAAAC TTACAAATAT TGATCACCTG
TTATTCAGCA GATCTCCACT TTTAAGTGTC TATGCCGACA TGAGCATAAC TTGTAAAGAA
TATGAAGATC CCAATCGATC AATGCTTGAA TTAGTTTTTG CTCCTGCAAA AGACTGGATT
GGTCGTAAAG ACGAGGAAAT AATTGATGCA ACAATGCAAG AATTGAAGAA ACTTTTTCCC
ATGCATTTCT CTGGGGAAAA TCAAGCTAAA TTGAGAAAAT ATAAAGTAAT AAAAACACCA
AAATCAGTCT ACAAAGCTGT TCCTGGATGC CAAGATTTAA GGCCAGACCA AAAGACTCCA
ATAAGAAACT TTTTCTTAAC TGGTGATTAC ACAATGCAAA AATACCTCGC TTCCATGGAA
GGTGCAGTCC TAAGTGGAAA AATATGTGCA GAAAAAATCC AAATCTCGAC TGACATAGGT
TCTTCTTAG
 
Protein sequence
MRVAIAGAGL AGLSCAKYLA DAGHTPFVYE ARNVLGGKVA AWKDDDGDWY ETGLHIFFGA 
YPNMLQLFKE LDIEDRLQWK SHSMIFNQPE EPGTYSRFDF PDLPAPINGV AAILSNNDML
SWPEKISFGL GLVPAMLRGQ NYVEDCDKYS WTEWLKKQNI PERVNDEVFI AMSKALNFIG
PDEISSTVLL TALNRFLQEK NGSKMAFLDG APPERLCQPI VDHIRALGGD VFLNSPLKKI
NLQQDGSVEN FLIGSAKEPQ GKEIQADAYV SAMPVDIFKT ILPNEWASQD IFRKLEGLKG
VPVINIHLWF DRKLTNIDHL LFSRSPLLSV YADMSITCKE YEDPNRSMLE LVFAPAKDWI
GRKDEEIIDA TMQELKKLFP MHFSGENQAK LRKYKVIKTP KSVYKAVPGC QDLRPDQKTP
IRNFFLTGDY TMQKYLASME GAVLSGKICA EKIQISTDIG SS