Gene P9303_21081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_21081 
Symbol 
ID4776904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1869188 
End bp1870330 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content44% 
IMG OID640087616 
Productfatty acid desaturase, type 2 
Protein accessionYP_001018108 
Protein GI124023801 
COG category[I] Lipid transport and metabolism 
COG ID[COG3239] Fatty acid desaturase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.250719 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTCTG GCCTCAATGG AAAGGCGCTC AAGGTTTCCA AAGCGGATAT ACCTGATCTG 
GCTGCAATTA AGGCTGTTTT GCCTCATCAA TGCTTGAATT GCAGCACCAA CACTTCCCTT
GCTTATCTAG CCCAATCTCT GACGATACAG GTCATAGTAA TTTCCATAGG GATGTCCATC
CCTCTCAATG TGGAAATATT GCCTGTTTGG GTTCTCTACT GGCTTGTATC TGGCACAACA
GCAATGGGCT TATGGGTCAT CGCACATGAA TGCGGACACG GTGCATTTTC GAAAAATAGG
AAACTAGAAA CATTCGTTGG CTATGTGTTG CATTCGATGC TTCTTGTTCC ATATTTCTCA
TGGCAGCGTT CACATTTAGT TCACCATACT TATACGAACC ATATCGCCAA TGGTGAGACC
CACGTGCCTT TAGTCATTCG TGGCAACGGA ATTGATGAGC AGGCTGGTGG AGAAAAAGAT
ATTGCCATAG CAGGCAGATT AGGAAAAGTT CAATATGGTG TATTTCAGCT TGTGCTTCAT
CTGGTTTTTG GCTGGCCAGC CTATTTGCTG ACTGGGAAGA CGGGAGGCCC AAAATATGGC
CTATCGAATC ACTTCTGGCC GATAGCACCT TTCTCTAGAA AATTGTGGAC AAAAAAATGG
ATAAATAAAG TCTGGCTTTC CGATTGGGGG ATATGTCTGG CTCTGTTTGC ATTGATTGCC
TGGAGCCTGC ATGATGGCTT TGTTACTGTT TTTGCAATTT ATTTAGCTCC TCTGTTAGTA
GTCAATATCT GGCTAGTCAC CTATACGTGG CTGCATCATA CTGATACTGA TGTTCCCCAC
CTTGGCGGTT CAGACTTCTC CCAATTGCGA GGTGCGTTCC TATCGATTGA TAGGCCATAT
GGAAAAGTAA TTGATTTCCT CCATCACAAG ATAGGCTCTA CTCATGCCAT TCATCACATA
GCACCTTGGA TGCCTCATTA CCATGCAGGC AAAGCCACTA TTGCCCTAAA AAATGCTTTC
CCAAAGGTAT ATCTTTACAA TCAAACACCA ATTCTTCAGG CTCTCTGGCT TATTTCTACT
AACTGCATAG CTGTCACTCG GGAAGAGAAC AGTGGACGCT ATGTCTGGAA AAATCCTTGG
TGA
 
Protein sequence
MMSGLNGKAL KVSKADIPDL AAIKAVLPHQ CLNCSTNTSL AYLAQSLTIQ VIVISIGMSI 
PLNVEILPVW VLYWLVSGTT AMGLWVIAHE CGHGAFSKNR KLETFVGYVL HSMLLVPYFS
WQRSHLVHHT YTNHIANGET HVPLVIRGNG IDEQAGGEKD IAIAGRLGKV QYGVFQLVLH
LVFGWPAYLL TGKTGGPKYG LSNHFWPIAP FSRKLWTKKW INKVWLSDWG ICLALFALIA
WSLHDGFVTV FAIYLAPLLV VNIWLVTYTW LHHTDTDVPH LGGSDFSQLR GAFLSIDRPY
GKVIDFLHHK IGSTHAIHHI APWMPHYHAG KATIALKNAF PKVYLYNQTP ILQALWLIST
NCIAVTREEN SGRYVWKNPW